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PREFACE 


This volume is formally the 2nd edition of Ramat and Ramat 1998, itself an English trans- 
lation of Ramat and Ramat 1993. Although the idea and the scope of the volume have 
remained much the same, the volume itself has changed a lot (the new editor included), 
and not only due to the necessary updates, inescapable after almost a quarter of a century 
has passed since the first edition. 

The chapter “Proto-Indo-European: Comparison and Reconstruction” (1st ed.) is 
now replaced by three completely new chapters (“Proto-Indo-European Phonology,” 
“Proto-Indo-European Morphology,” and “Proto-Indo-European Syntax”). Instead of 
the chapter “Sanskrit” (1st ed.), there is a new chapter “Indo-Aryan.” Also, a separate 
short prechapter “Indo-Iranian” has been added. Instead of the chapters “Latin” and “The 
Italic Languages” (1st ed.), there is only the new chapter “Italic” now. A new prechapter 
“Balto-Slavic” has been added to precede “Baltic” and “Slavic.” A new chapter “Frag- 
mentarily Attested Indo-European Languages” was planned, but the text has unfortu- 
nately not been finished on time. The chapter “The Indo-Europeans: Origins and Culture” 
(1st ed.) is completely out. The order of the chapters is new. The only authors remaining 
from the 1st edition are Nicholas Sims-Williams (“Iranian”) and Patrick Sims-Williams 
(“Celtic”). Their chapters are updated versions of their chapters in the 1st edition; all the 
other chapters (written by different authors) are completely new. Some of the authors 
from the 1st edition have, unfortunately, passed away in the meantime, like the great 
Calvert Watkins (1933-2013) and Werner Winter (1923-2010). New maps and illustra- 
tions were also added. 

The idea of the volume is to provide a compact but informative introduction to and 
overview of Indo-European languages, with the stress on Proto-Indo-European, the old- 
est well attested IE languages, and the largest IE branches. The aim of the volume is to 
provide an introduction to beginners or non-Indo-Europeanists, but also to be used as a 
reference book for scholars already doing research in IE linguistics. However, this is not 
an introduction to historical linguistics, phonology, morphology, or syntax in general. 
It is assumed that the potential readers will already be familiar with basic linguistics, 
although the aim was to make the volume as reader-friendly as possible. The approach in 
the volume is mostly historical, as traditional in IE linguistics, and the volume is linguis- 
tics only - i.e., there are no chapters on IE culture, archeology, mythology, poetics, etc. 
In the last twenty years, a lot of historical-comparative grammars of IE in English have 
appeared (cf. the literature in the first three chapters), but usually by one author surveying 
all the languages - here, specialists in the field deal with one IE branch (or topic) only 
from a wider IE perspective. The chapters of the volume are meant to be complementary, 
but each chapter is supposed to be able to stand for itself (obviously, there is some repe- 
tition of certain facts). 

The editor has tried to make this volume more uniform (e.g., the same PIE “orthogra- 
phy” is used throughout the book) and internally coherent than the first one, but it was 
impossible to “tame” completely so many authors, each with their own writing style and 
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varying interests. Obviously, each chapter has the author’s personal touch (which is not 
bad - otherwise, all the introductions to IE linguistics would be the same). Naturally, not 
all literature and different views that would deserve it could be included in this volume. 
Also, the reader will sometimes note the conflicting views in the volume - not only was 
this impossible to avoid, but there was no reason to avoid it. This is something that the 
reader has to take into account when he or she encounters differing PIE reconstructions 
or interpretations of certain processes or phenomena. Rather than being a hindrance, this 
can allow a beginner to understand that historical linguistics is not a dogma, and that 
by applying its methods we can sometimes get differing results. It is exactly after the 
competition of different ideas that communis opinio is refined. In the end, the editor is 
glad to note that, although the basic idea of the volume is to provide an introduction and 
overview of IE linguistics and in spite of the fact that people have been doing IE histori- 
cal linguistics for quite some time now, the volume does indeed contain quite a few new 
insights and interpretations, which will make it an interesting read not only for beginners 
and non-experts but also for professional IE linguists. 

Finally, I would like to thank the publisher (especially the editor Samantha Vale Noya) 
for the opportunity to do this (though I have often, in times of despair, regretted the fact 
that I accepted the very difficult and often tedious editorial work), and all the authors who 
have worked hard in order for us to be able to produce such a wonderful volume. 
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Sanskrit 

Slav. 

Slavic 

Slk. 

Slovak 

Sin. 

Slovene 

Sogd. 

Sogdian 

Sp. 

Spanish 

Srb. 

Serbian 

Sw. 

Swedish 

Stok. 

Stokavian 

Thess. 

Thessalian 

Toch. 

Tocharian (A & B) 

Ukr. 

Ukrainian 

ULSorb. 

Upper/Lower Sorbian 

Umbr. 

Umbrian 

USorb. 

Upper Sorbian 

Ved. 

Vedic 

Ven. 

Venetic 

W 

Welsh 

W 

West . . . 

WGr. 

West Greek 

WSlav. 

Western Slavic 

YAv. 

Young Avestan (= Late Avestan) 


OTHER ABBREVIATIONS 

1 sg./pl. 

first person singular/plural 

2 sg./pl. 

second person singular/plural 

3 sg./pl. 

third person singular/plural 

abl. 

ablative 

ace. 

accusative 

act. 

active 

adj. 

adjective 

adm. 

admirative 

adv. 

adverb 

all. 

allative 

aor. 

aorist 

a.p. 

accentual paradigm 

arch. 

archaic 

caus. 

causative 

coll. 

collective 

comm. 

common (gender) 

comp. 

comparative 

cond. 

conditional 

dat. 

dative 

def. 

definite 

det. 

determinate 



ABBREVIATIONS xix 


dial. 

dialect(al) 

dim. 

diminutive 

du. 

dual 

end. 

enclitic 

f. 

feminine 

fut. 

future 

gen. 

genitive 

imp. 

imperative 

imperf. 

imperfective 

impf. 

imperfect 

ind. 

indicative 

indef. 

indefinite 

indet. 

indeterminate 

inf. 

infinitive 

inj. 

injunctive 

instr. 

instrumental 

interrog. 

interrogative 

intr. 

intransitive 

IPA 

International Phonetic Alphabet 

loc. 

locative 

m. 

masculine 

med. 

medial (= middle) 

med.-pass. 

medio-passive 

mid. 

middle (= medial) 

n. 

neuter 

norn. 

nominative 

opt. 

optative 

pass. 

passive 

perf. 

perfect 

perfect. 

perfective 

pers. 

person(al) 

pi. 

plural 

pluperf. 

pluperfect 

pres. 

present 

pret. 

preterite 

pron. 

pronoun 

prv. 

preverb 

ptcl. 

particle 

ptcp. 

participle 

sg. 

singular 

sigm. 

sigmatic 

St. 

stem 

subj. 

subjunctive 

suff. 

suffix 

sup. 

supine 

them. 

thematic 

trans. 

transitive 

voc. 

vocative 
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INDO-EUROPEAN 
LANGUAGES - 
INTRODUCTION 

Mate Kapovic 


Indo-European languages were originally, a couple of millennia ago, spoken from Europe 
to India. Now, in postcolonial times, they are spoken all over the world. English is thus, 
besides being a global lingua franca , spoken in North America, Australia, parts of Africa, 
and elsewhere; Spanish in Central and South America (Portuguese in Brazil); French in 
parts of Africa, etc. According to Ethnologue,' 437 IE languages (the number is in reality 
approximate) are spoken by almost 3 billion people in the world. With regard to the num- 
ber of speakers, this makes the IE language family the biggest in the world. Concerning 
the number of individual languages, IE is not the largest language family - for instance, 
the African Niger-Congo family with more than 1500 languages is much larger. In the 
world, there are around 7000 languages (this is not an exact number, since it is extremely 
difficult to define linguistically what a language actually is), so in this sense IE languages 
are less important, although IE is still one of the biggest language families. Historical 
reasons - primarily the concentration of economic power in Europe and later in North 
America, together with the culture, science, etc., that goes with it - make IE the best- 
known language family in the world. 

In the territories from Europe to India, where IE languages were spoken already a few 
thousand years ago, not all languages are IE. Most of the languages spoken in Europe 
today are IE, the exceptions (disregarding Balkan Turkish, the languages of the Cauca- 
sus, and recent migrant languages) being Basque, Maltese (an offspring of Arabic), and 
the Uralic languages (Hungarian, Estonian, Finnish, and a few smaller ones). Basque is 
the only remnant of pre-IE European languages. Other such cases (like Etruscan) have 
long disappeared. In the Middle East, we find both IE (like Persian/Farsi, Pashto, Kurd- 
ish, and Tajik) and non-IE languages (like Turkish, Azeri, Arabic, etc.). In the Indian 
subcontinent, many languages are IE (like Hindi/Urdu, Bengali, Punjabi, and Nepali), 
while many belong to the non-IE Dravidian family (like Tamil and Malayalam) and other 
non-IE language families (like smaller languages from the Tibeto-Burman and Austro- 
asiatic families). 

What is a language family? What does it mean when we say that English and Albanian 
belong to the IE language family? To put it succinctly, a language family consists of all 
the languages that have evolved from a single original proto-language. It is thus a group 
of genetically related languages (up to a certain point - it just may well be that all human 
languages are related, though this cannot be proven). In the IE case, it means that we can 
prove that all the IE languages that we know have evolved through many stages from a 
single parent language that we today call Proto-Indo-European. This proto-language was 
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MAP 1.1 IE LANGUAGES TODAY 

Source: Adapted from C. Watkins The Indo-European Languages (Routledge) 1st ed (1998) 


spoken some 6000 years ago and, like most proto-languages, is unattested (which means 
that we have no written records of PIE). This is not always the case; for instance, Latin (as 
the proto-language of all Romance languages) and Old Chinese (as the proto-language of 
all modern Chinese “dialects’Vlanguages) are attested. 

How can one language split into more than 400 languages? The answer is simple - 
through language change. All languages change all the time (even today with all our 
schools, means of communication, and mass media). Linguistic changes that occur during 
the life of an individual speaker are usually slow and gradual (although still perceivable, 
whether they have to do with changes in pronunciation, grammatical forms, or vocabu- 
lary), but in time languages change enough to become mutually incomprehensible and 
sometimes very different. This is made easy by migration - when two social groups sepa- 
rate, it is easy to imagine that a previously unifonn language can evolve in different direc- 
tions (cf. the differences between British, American, and Australian English, although 
they are still in contact, mutually comprehensible, and thus considered one language). 

That languages like English, Czech, and Welsh have all developed from a single 
proto-language was not always apparent. Already in ancient times, people noticed that 
there are many similar words in, for instance, Latin and Greek - cf. e.g. Lat. mater 
‘mother', pater ‘father’ and Gr. pf)xr|p /meter/, ;nxTfip /pater/ (or, for that matter, English 
mother and father). However, they did not know how to properly explain such similar- 
ities. Thus, there have been many unscientific hypotheses, such as that all languages 
are derived from Hebrew (considered a holy language) and so on. In the 15th and 16th 
centuries, there were a number of scholars, like Rodolphus Agricola/Roelof Huisman 
(1443/4-1485), Sigismund Gelenius/Zikmund Hruby z Jeleni (1497-1554), and Johann 
Elichmann (1601/2-1639), who noted correspondences between various IE languages. 
However, it is Marcus van Boxhorn (1612-1653), a professor at Leiden University, who 
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can be regarded as the first historical linguist and the father of IE linguistics. In 1647 
he advanced a theory that Greek, Latin, Persian, Old Saxon, Dutch, German, Gothic, 
Russian, Danish, Swedish, Lithuanian, Czech, Croatian, and Welsh (but not Hebrew) 
originally stemmed from “Scythian” (= PIE). His friend Claudius Salmasius/Claude de 
Saumaise (1588-1653) added Sanskrit to the list. Amazingly, van Boxhorn also noted 
the importance of recognizing false cognates, loanwords, systematic correspondences, 
plausible semantic agreement, morphological comparison, and synchronically irregular 
forms (all basic elements of historical linguistics now called the “comparative method”). 2 
The European discovery of Sanskrit played an important role in the beginnings of IE 
historical and comparative linguistics. Already in 1585, Filippo Sassetti (1540-1588), a 
merchant and scholar, noted in a private letter the similarity of some words in Sanskrit 
and Italian (e.g., Skr. nova and Ital. nove ‘nine’), and in the 18th century a couple of 
scholars wrote on the obvious similarity of Sanskrit to European languages. One of them 
was William Jones (1746-1794), who in a lecture in 1786 compared Sanskrit to Greek, 
Latin, Gothic, Celtic, and Old Persian, claiming that they “have sprung from some com- 
mon source.” Jones is usually wrongly considered the founding father of IE historical 
linguistics, although he was neither the first to make such a hypothesis nor the one with 
the clearest presentation of the problem. Shortly after, the first modem IE historical 
linguists - Jacob Grimm (1785-1863; famed as one of the Grimm brothers), Rasmus Rask 
(1787-1832), and Franz Bopp (1791-1867) - started doing serious historical linguistics, 
and others followed. The very term Indo-European was coined in 1 8 1 3 by Thomas Young 
(1773-1829). The discovery of Anatolian and Tocharian texts in the beginning of the 20th 
century gave new impetus for IE linguistics. 

The IE language family comprises ten principal branches (those being the ones suf- 
ficiently attested in ancient times and/or today) and a number of fragmentarily attested 
separate languages spoken in ancient times (like Lusitanian or Phrygian) that have since 
disappeared. Three main branches (Anatolian, Indo-Iranian, and Greek) are attested 
already in the second millennium BCE, two in the first millennium BCE (Italic, Celtic), 
and the other five branches later. Three of the principal branches consist of one lan- 
guage only (Greek, Armenian, and Albanian), although those comprise different dialects 
as well. Two major branches (Anatolian and Tocharian) are now extinct, and the same 
goes for all fragmentarily attested languages and certain languages from other primary 
branches (like Gaulish). The ten principal IE branches are as follows. 

1. Anatolian - attested from the 19th/16th to the 1st century BCE 

Spoken, until its disappearance, in Asia Minor. Old Anatolian languages (spoken in 
the 2nd millennium BCE) are Hittite (the best attested, written in Akkadian cunei- 
form, the language of the great Hittite Empire), Cuneiform Luwian (Luvian), Hiero- 
glyphic Luwian, and Palaic. Hittite is attested from around the 16th century, but a 
few names and loanwords are found already in the 19th century BCE in Old Assyrian 
texts. New Anatolian languages (spoken in the 1st millennium BCE) are Lydian, 
Lycian (Lycian A), Milyan (Lycian B), Carian, Sidetic, and Pisidian. 

2. Indo-lranian - attested from the 15th 14th century BCE 

Spoken mostly in the northern part of the Indian subcontinent and the Middle East 
(Eastern Turkey, Caucasus, Iran, Afghanistan). As a result of old migrations, the 
Romani language dialects are spoken by the Roma people in Europe. Indo-Iranian 
consists of two great groups - Indo-Aryan (Indie) and Iranian. The third, smaller 
group is Nuristani (on the border between Afghanistan and Pakistan), and the fourth, 
disputably, is Dardic (traditionally considered part of Indo-Aryan). The two major 
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groups are attested from ancient times and represented by many languages, while 
the two small groups are attested only from modem times and consist of a few small 
languages spoken in remote parts of northern India, Afghanistan, and Pakistan. Indo- 
Aryan languages are chronologically divided into Old Indic/Indo -Aryan (Vedic - the 
language of the Vedas; Sanskrit - the classical language of India), Middle Indie/ 
Prakrits (Pali, Shauraseni, Maharashtri, Magadhi, etc.), and New Indie (many lan- 
guages - Hindi/Urdu, Bengali, Punjabi, Marathi, etc.). Certain words and names of 
Indo-Aryan origin were attested in the 1 5th — 14th century BCE in Hurrian texts. The 
Rigveda (the oldest part of the Vedas) is usually dated to the second half of the 2nd 
millennium BCE but was transmitted orally for a long time. Iranian languages are 
chronologically divided into Old Iranian (Avestan - the language of Zarathustra, 
from the first half of the 1st millennium BCE, or perhaps even earlier but at first 
transmitted orally; Old Persian - the language of the Achaemenid Empire), Middle 
Iranian (Pahlavi - the language of the Sassanid Empire; Sogdian; etc.), and New 
Iranian (New Persian/Farsi - the official language of Iran; the Kurdish languages; 
Pashto - the official language of Afghanistan; etc.). 

3. Greek - attested from the 1 5th 14th century BCE 

Spoken from ancient times until the present in mainland Greece and neighboring 
islands. The first attestations were written in the Linear B script in Mycenaean Greek, 
the earliest recorded Greek dialect. From the 8th century, Greek has been written in 
the Greek alphabet. In the classical period (around the 5th century BCE), the fol- 
lowing Greek dialects existed - North-West dialects, Doric, Aeolic (controversial), 
Arcado-Cypriot, and Ionic-Attic (the details of the subgrouping are disputed). Later 
stages of Greek are Hellenistic/Koine, Byzantine/Medieval, and Modern Greek. 

4. Italic - attested from the 7th century BCE 

Spoken in ancient Italy. Italic consists of two groups - Latino-Faliscan and Sabellian 
(Osco-Umbrian). The Latino-Faliscan group consists of Latin (very well attested) 
and Faliscan. The Sabellian group consists of Oscan, Umbrian, and a number of 
smaller and poorly attested Italic languages. All Italic languages except Latin disap- 
peared already in Roman times owing to the expansion of Latin as the language of 
the Roman state. Latin later developed into numerous modem Romance languages 
(Portuguese, Spanish, Catalan, French, Italian, Romanian, etc.). 

5. Celtic - attested from the 6th century BCE 

Now spoken in peripheral areas of the British Isles (Ireland, Wales, Scotland) and 
France (Brittany), previously spoken in wide areas of Europe (from the Iberian Pen- 
insula and France all the way to Asia Minor). The Celtic branch can be divided 
(though controversially) into three groups: Celtiberian, Continental Celtic, and Insu- 
lar Celtic. Celtiberian is a one-language group, attested in the 2nd to 1st century 
BCE in present-day Spain. Continental Celtic languages - Gaulish (in France) and 
Lepontic (in northern Italy) - are now extinct. Insular Celtic consists of Brythonic 
(Welsh, Breton, and extinct Cornish) and Goidelic (Irish/Gaelic, Scottish Gaelic, and 
extinct Manx). Of all currently existing Celtic languages, only Welsh has a more or 
less secure future. 

6. Germanic - attested from the 3rd (?) century BCE 

Nowadays spoken in North-West Europe. The first Germanic words are attested per- 
haps already in the 3rd (?) century BCE on the so-called Negau helmet (found in 
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Zenjak in present-day Slovenia). The earliest runic inscriptions date from the first 
centuries CE, and the Gothic translation of the Bible from the 4th century. Germanic 
languages comprise three groups: East, North, and West. The extinct East Germanic 
group consists of Gothic, Vandal, and Burgund. The Northern languages are Old 
Norse and, in modem times, the Scandinavian languages. The Western languages 
are Old High German, Old Saxonian, Old Low Franconian (Old Dutch), Old Frisian, 
and Old English (in modem times, respectively, German, Low German, Dutch, Fri- 
sian, and English). 

7. Armenian - attested from the 5th century CE 

Spoken in present-day Armenia and elsewhere in the Caucasus and Middle East as 
a minority language, previously also in wide areas of present-day eastern Turkey. 
Modern Armenian consist of two dialects, Eastern and Western, both standardized. 

8. Tocharian - attested from the 4th-5th to the 8th-10th century CE 

Spoken in merchant cities on the ancient Silk Road, in the present-day Xinjiang 
province of western China (now inhabited by Turkic Uyghurs). There were two 
closely related Tocharian languages - Tocharian A and Tocharian B. 

9. Balto-Slavic - attested from the 9th (Slavic) and 16th (Baltic) century CE 
Spoken in the Baltic, the Balkans, and Central and Eastern Europe. The Balto-Slavic 
branch consists of West Baltic, East Baltic, and Slavic. The extinct Old Prussian is 
West Baltic, and Lithuanian and Latvian are East Baltic (there are a few other poorly 
attested and extinct Baltic languages). Slavic consists of East (Russian, Belorussian, 
Ukrainian), West (Polish, Czech, Slovak, etc.), and South Slavic (Slovene; Bosnian/ 
Croatian/Montenegrin/Serbian - traditionally known as Serbo-Croatian; Macedo- 
nian; Bulgarian). The first attested Slavic language, Old Church Slavic, was based 
on the Thessaloniki Old Macedonian dialect. 

10. Albanian - attested from the 15th century CE 

Spoken in present-day Albania, Kosovo, West Macedonia, South Montenegro, 
North-West Greece, and South Italy. There are two dialects - Gheg in the north and 
Tosk in the south, the latter being the basis of Standard Albanian. 

Fragmentarily attested IE languages can be divided into two basic groups: those that 
have at least one attested text/inscription and those that are attested only through onomas- 
tics or individual words in texts written in other languages. In the first group, there are 
Lusitanian (Portugal), Venetic (North-East Italy), Messapic (the “heel” of Italy), Thracian 
(Bulgaria), and Phrygian (Minor Asia). In the second group, we find languages like Ligu- 
rian (North-West Italy), the Illyrian languages (North-West Balkans), Dacian (Romania), 
and Macedonian (no relation to present-day Slavic Macedonian). Apart from fragmen- 
tarily attested IE languages, there are also possible non-attested IE languages, whose 
traces can perhaps be seen in loanwords in other languages (like the supposed North-West 
Block language in Benelux), but these are rather speculative. 

Many elements can be reconstructed for the last stage of PIE. However, what we 
obtain through reconstruction will never be a full picture of the language, since elements 
(sounds, grammatical endings, words, meanings, etc.) can disappear with no trace or 
leave so few traces that it is not possible to reconstruct them plausibly. Thus, the around 
1500 PIE lexical items that have been reconstructed are certainly just a small part of the 
actual PIE vocabulary, which must have had something like 30,000-50,000 words, just 
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MAP 1.2 IE LANGUAGES IN THE 1ST MILLENNIUM BCE 

Source: Adapted from C. Watkins The Indo-European Languages (Routledge) 1st ed (1998) 

as much as any language today. 3 Likewise, we can, for instance, reconstruct the ending 
variants *-mes, *-mos, *-mesi, *-men, etc. in the PIE 1st person plural present indicative 
(p. 93), but it is impossible to know how to interpret them exactly. Were these just random 
variants (not likely), were they sociolinguistic variables, did their use depend on dialect, 
gender, social status, etc.? Sometimes it is also very difficult to separate different layers - 
should an element be reconstructed for PIE in general, dialectal PIE, post-Anatolian PIE 
(i.e., PIE after the early split of Anatolian), early PIE (i.e., PIE including Anatolian), 
post-PIE time, etc.? 

Like most languages that cover any substantial ground, PIE probably also had dialects. 
The dialectal picture of PIE is not completely clear because it is sometimes difficult to 
distinguish PIE dialectal features from post-PIE shared innovations and accidental inde- 
pendent innovations. It is now widely accepted that Anatolian was the first branch to 
separate from the rest of PIE (and that Tocharian was the next). However, which Anatolian 
characteristics (like the lack of feminine gender) are archaisms, and which are innova- 
tions, is still a matter of dispute (p. 62, 66, 159, 178). Dialects can share one trait with one 
dialect, and another with another dialect. For instance, Greek, Indo-Iranian, and Armenian 
all have the augment *h[e- in past tense verbal forms, which is not present in other IE lan- 
guages. At the same time, Armenian and Indo-Iranian are satem languages, while Greek 
is a centum language (p. 27). In Greek, Armenian, and Phrygian, initial pre-consonantal 
laryngeals (*H-) yield vowels (p. 48), while this is not the case in Indo-Iranian. Indo- 
Iranian, on the other hand, shares the so-called RUKI rule (p. 206) with Balto-Slavic 
(p. 29, 495, 526-527), a satem branch, while Balto-Slavic shares the ending *-mos in the 
dative plural with Germanic (as opposed to *-b h os in Italic, p. 63), a centum branch, etc. 

The question of the PIE homeland (the “original” territory where speakers of PIE 
lived) is still a subject of debate. The most commonly accepted hypothesis in the last few 
decades has been the “Kurgan hypothesis” (Kurgans being a type of burial mound), which 
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puts Proto-Indo-Europeans in the steppes of present-day Ukraine and South Russia, north 
of the Black Sea. The position of the PIE homeland is often deduced from certain linguis- 
tic indications, and by trying to connect linguistic evidence with archeology, paleobotany, 
and other disciplines. The way linguistic evidence is used in trying to deduce the place 
of the PIE homeland follows a certain logic. For instance, since one can reconstruct a 
PIE word for ‘snow’ (*snoyg wh s), we can suppose that PIE was spoken in a place where 
there was snow (which by itself does not amount to much). Evidence has to be carefully 
evaluated - for instance, Lat. mare, OCS morje, and OIr. muir ‘sea’ would all point to 
PIE *mori ‘sea’ (and thus to a PIE homeland by the sea). However, Goth, marei (cf. Eng. 
mere, marsh) means both ‘sea’ and Take’, which brings the PIE meaning ‘sea’ in doubt 
(it is not unimaginable that the original meaning was Take, standing water’ and that the 
meaning ‘sea’ is secondary, though there can be no certainty in any of the possibilities). 

Comparative linguistics has proven beyond reasonable doubt that all IE languages 
stem from the same proto-language. However, the question of a deeper genetic relation- 
ship of IE with other language families is still a matter of heated debate (though most 
IE-ists, curiously, ignore the problem, choosing not to go outside of inner-IE problems). 
The most likely IE relative is Uralic. There are some striking cognate-looking examples, 
cf. e.g. Proto-Uralic *weti ‘water’ and PIE *wodr (this is not a word easily loaned), 
or Proto-Uralic verbal endings 1 sg. *-mi, 1 pi. *-me, 2 pi. *-te, which are practically 
identical to PIE *-m(i), *-me, *-te (p. 93). Although the Indo-Uralic hypothesis is still 
not strictly proven, grammatical correspondences like these make this proposition quite 
probable. On a wider scale, IE often connected with “Nostratic” languages, which include 
IE, Uralic, Kartvelian (like Georgian), Altaic (a suspicious family by itself, consisting 
of Turkic, Mongolic, Tungusic, and possibly other languages), Dravidian, Afro-Asiatic 
(consisting of Semitic, Egyptian, etc.), and possibly other families/languages (the “cast” 
is not always the same). There is a lot of work left to be done in long range comparison, 
but it is not at all certain that we will ever get satisfactory answers to these problems. 
Many linguists are, with good reason, highly skeptical concerning the very methodologi- 
cal basis of such reconstruction, and simply do not believe that there is enough linguistic 
material preserved (since sounds, forms and words not only change but can also disappear 
leaving no trace) to let us plausibly reconstruct proto-languages so distantly in the past. 

Up till now, we have seen examples of cognate words and reconstructed PIE forms 
(marked with an asterisk to indicate them not being attested). But how do linguists actu- 
ally know what words/forms to compare, and how do they get the idea that two or more 
languages are genetically related in the first place? Looking for similar words with similar 
meanings is obviously not enough, since similar words with similar meanings in two 
languages can easily be the result of linguistic borrowing. Compare Modern English and 
French pairs like city/cite, people! peuple, or partlpart. These are all French loanwords 
in English, which can be proven by historical data (French loanwords were brought to 
England by French-speaking Normans after their 1066 conquest). When looking for 
genetically related languages, it is safer to look for similarities in grammar (morphology, 
i.e., nominal or verbal endings, etc.), since morphological elements are borrowed much 
less frequently than vocabulary. Looking for morphological similarities may be a prob- 
lem in languages which are not rich with morphology (like Chinese or even English), but 
not in older IE languages, which have quite complex morphology. Some of the similar- 
ities, clearly seen in ancient IE languages, are visible even today. For instance, compare 
the plural ending -i in Italian lupi ‘wolves’ and Russian eojuai /volki/. However, one does 
need to reconstruct vocabulary as well. When looking for old words inherited directly 
from PIE, one has to find words that are likely to have been in a language already a 
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couple of millennia ago (thus, not words like computer or tennis) and words that are not 
so likely to be borrowed. Although in principle any word can be borrowed (and all kinds 
of examples from all around the world can be found), in some types of vocabulary one 
often finds old inherited words, for instance, in words for family relations (cf. Eng. father, 
mother, brother and related Ital. padre, madre, fratello), numbers (cf. Eng. two, three, 
seven and related Ital. due, tre, sette), body parts (Eng. nose, beard, ear and Ital. naso, 
barba, orecchio), etc. Genetically related words are sometimes obviously similar (Eng. 
mother ~ Ital. madre), sometimes less obviously so (Eng. wolf ~ Ital. lupo - the / is the 
same, and the sounds / and p are phonetically close), and sometimes almost completely 
different (Eng. hornet ~ Ital. calabrone). Of course, similarity of form and meaning may 
be completely accidental, cf. e.g. non-related pairs such as Eng. much and Span, mucho, 
Eng. iron and Span, hierro, Eng. day and Span, dia, etc. (this is more apparent when one 
considers phonologically more archaic Germanic cognates such as Scots mickle, Gennan 
Eisen, or Swedish dag respectively, or the Latin forms multum,ferrum, dies that yield the 
cited Spanish forms). 

But similarity alone is not enough. Not only does it not guarantee that two words 
are related, it is also not scientific. What historical linguists look for are not necessarily 
similar sounds, words, or forms, but systematic sound correspondences. This means that 
certain phonemes (or clusters of them) in purportedly genetically related languages have 
to systematically correspond to each other, whether they are phonetically close or not. 
For instance, in initial position we find in English and Italian the correspondences n — 
n- (no/no, night/notte, nine/nove), h — c- {hornet/ calabrone, hound/cane, heart/ cuore), 
t — d- {two/due, ten/dieci, tooth/dente), etc. Careful examination and analysis of such 
correspondences is one of the primary tasks of historical linguistics and a prerequisite for 
proving a genetic relationship between two or more languages. Describing such corre- 
spondences will be an important part of the next chapter. 

Note: I would like to thank David Mandic and Petra Sostaric for reading the first draft of the chapter. 
Of course, all the mistakes are just mine. 


FURTHER READING 

Lehmann 1967 provides a rich selection of translated primary texts (with commentary) 
from the beginnings and early IE historical linguistics. Pedersen 1959 is also still very 
useful. For the problems of the IE homeland, culture, and archeology from a mainstream 
IE-ist perspective see, for instance, Mallory 1989 and Mallory & Adams 1997. A lot of 
information on IE culture can be found in the famous but controversial and idiosyncratic 
Gamkrelidze & Ivanov 1995. For more on the “Kurgan hypothesis” from the pen of its 
originator see Gimbutas 1997. Salmons & Joseph 1998 is an informative look at Nostratic 
from different perspectives. There are a number of good handbooks and introductions to 
historical linguistics, many of them using a lot of IE material. Trask 2007 may be a good 
place to start, as well as Campbell 2004. Joseph & Janda 2003 and Ringe & Eska 2013 
are a bit more demanding. Labov’s trilogy on the principles of linguistic change (1994, 
2001, 2010), written mostly (but not exclusively) from the perspective of quantitative 
sociolinguistics, is invaluable and inspiring, though perhaps not a usual IE-ist read. For a 
list of recent introductions to IE, see the “Further reading” sections in the chapters on PIE 
phonology and PIE morphology in this volume. 
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NOTES 

1 http://www.ethnologue.com/statistics/family [last approached: Feb. 2, 2016]. 

2 For van Boxhorn and other early scholars cf. Van Driem 2001: 1039-1051. 

3 Cf. Mallory & Adams 2006: 117-119. 
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CHAPTER 1 


PROTO-INDO-EUROPEAN 

PHONOLOGY 

Mate Kapovic 


INTRODUCTION 

Proto-Indo-European phonology is reconstructed on the basis of its reflexes in Indo- 
European languages - mostly older ones, generally preserving more archaic features, i.e., 
closer to PIE. The principal languages referenced in this chapter for the reconstruction 
of PIE are Old Indic/Indo -Aryan (Vedic and Sanskrit), Avestan (Old/Gatha- and Young/ 
Late Avestan), (Ancient) Greek, Latin, Old Church Slavic, Lithuanian (Old and Modem), 
Gothic, Hittite, Old Irish, Tocharian (A and B), (Classical) Armenian, and Albanian. Of 
course, other languages (Sabellic, Latvian, Old Prussian, Old High German, Old English, 
Luwian, Gaulish, Old Persian, etc.) are also used when necessary or when they have a 
form or feature not present in a language from the first group. Ideally, PIE reconstruction 
should take into account all the relevant information present in any IE language/dialect. 
This overview tries to present the reconstructed PIE phonological system and its main 
reflexes in major ancient IE languages. A more detailed overview of reflexes for specific 
languages/branches can be found in separate chapters in this volume. 

The following phonological system can be reconstructed for the last phase of PIE: 


STOPS 

*p 

*t 

*k 

*k 

*k" 

[voiceless] 


(*b) 

*d 

*g 

*g 

*g" 

[voiced] 


*b h 

*d h 

* g h 

*g h 

*gwh 

[aspirated] 

FRICATIVES 

*s 

*h. 

*h 2 

*h 3 



RESONANTS 

*m/m 

*n/n 

*1/1 

*r/r 

*w/u 

*y/i 

and GLIDES 







VOWELS 

(*a) 

*e 

*0 




(including 


*e 

*0 




diphthongs) 

(*ay) 

*ey 

*oy 

(*aw) 

*ew 

*ow 



*ey 

*oy 


*ew 

*ow 


The phonemes *b and *a (together with *ay and *aw) can be considered marginal, or at 
least infrequent, phonemes (p. 18, 42), *a also being an allophone of *e when in contact 
with *h 2 (p. 43). Reconstruction of *a not stemming from original *eh, is very ques- 
tionable (p. 42). Short *i and *u most likely had long allophones in monosyllabic forms 
(p. 55). 
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PIE stops were voiceless, voiced, or (voiced) aspirated. The velar stops were unmarked 
(*k, *g, *g h ), palatalized (*k, *g, *g h ) (IPA [k J , g j , g jh ]), or labialized (*k w , *g w , *g wh ). 
Besides *s, the “laryngeals” *h 2 and *h, (marked numerically) were almost certainly 
fricatives as well (some kind of A-type sounds). The *h, “laryngeal” (provisionally labeled 
a fricative in the above scheme) might also have been a fricative, but this is less certain 
(p. 30). The sign *H is used for all three laryngeals when it is not known which one 
is to be reconstructed or when we need a mark for all three laryngeals. Resonants and 
semivowels were either asyllabic (*m, *n, *1, *r, *w, *y (IPA [j])) or syllabic (*m, *n, 
*1, *r, *u, *i) (IPA [m, n, 1, r]). The vowels *i and *u act phonologically in the same 
way as syllabic resonants - the relation of *i/u to *y/w is the same as that of *m/n/l/r to 
*m/n/l/r. Some reconstructable segments appeared only as allophones - cf. the non-pho- 
nemic fricatives *z (p. 29), *[) (IPA [0]), and *5 (and *d h ) (p. 53) and the vowel *3 
(p. 46-47, 52). 

What is immediately typologically noticeable in the PIE phonological system, among 
other things, is the paucity of “real” vowels (if one does not take into account syllabic 
glides, long vowels, and diphthongs), the very unusual marginality (or even complete 
lack of) the vowel a, and the complete lack of affricates (no phonemic *c [Is], *c [tf], 
*3 [(fc], etc.). On the other hand, the stop system was rather rich (with 15 of them). In 
later IE languages, phonological systems usually developed in the following direction: 
the vowel system became typologically more common, with a becoming more frequent 
(additionally, i and u lost their phonological relation to j and w/v, while diphthongs often 
became monophthongs); syllabic resonants (*m, *n, *1, *r) disappeared everywhere (later 
they appeared again in some languages); a lot of languages developed new affricates; 
the “laryngeals” (*h„ *h 2 , *h,) disappeared in most languages in most positions, but 
new fricatives appeared; the stop system simplified everywhere; and languages preserved 
only one additional distinctive feature - aspiration or labialization. Velar palatalization as 
such was nowhere preserved - it either completely disappeared or was the origin for new 
affricates and fricatives. 

Note on the PIE “orthography”: various authors/schools write reconstructed PIE pho- 
nemes differently. Palatalized velars can also be written as *k’, *k, *k j , *k y , etc.; *k y can 
be written instead of *k w ; aspirated stops can be written as *bh, *dh, *gh; laryngeals can 
be written as *H[, *H 2 , *H, or *3 1; *s 2 , *3, (the latter usually for “syllabic” laryngeals), 
and an unknown laryngeal as *h x (instead of *H); the glides *y, *w can be written as *i/j, 
*u; *eu/eu instead of *ew, etc. 


TABLE 1.1 PIE STOPS 



voiceless 

voiced 

aspirated 

labials 

*p 

*b 

*b h 

dentals 

*t 

*d 

*d h 

velars 

*k 

*g 

* g h 

palatalized velars 


*g 

*g h 

labialized velars 

*k w 

*g w 

* gwh 


No modem IE language has preserved PIE palatalized velars as such. On the other 
hand, labiovelars are preserved even in some modem IE languages - cf. the phonetics 
of English queen [k w i:n] (p. 25). PIE aspirated stops are traditionally reconstructed as 
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voiced because of their reflexes, which are mostly voiced (p. 28). However, voiced aspi- 
rated stops are typologically less common than voiceless aspirated stops - cf. the initial 
allophonic aspirated [p h ], [t h ], [k h ] in English pin, tin, kin. In PIE, velars could be both 
palatalized and aspirated (*g h ) and labialized and aspirated (*g wh ) at the same time. 

PIE voiceless stops do not change in most of the languages (p, t, k ), the main excep- 
tions being Germanic (where they change to voiceless fricatives f p, x) and Armenian 
(where they change to voiceless aspirates *p c , f, kb). The phoneme *p weakens and dis- 
appears in Celtic. In most languages, PIE voiced stops remain unchanged ( b , d, g). How- 
ever, in Germanic and Armenian (perhaps also in Anatolian) they become voiceless p, t, 
k (in Tocharian all stops merge into p, t/ts, k). The phonemes traditionally reconstructed 
as voiced aspirated stops change in most IE languages. They are preserved only in Indo- 
Aryan (bh, dh, gh); in Greek they remain aspirated but become voiceless (cp [p h ], fl/0 [t h ], % 
[k h ]); in Italic they change to fricatives (voiceless initially: *f-, *|>, *x-; voiced medially: 
*-(3-, *-6-, *-y-), and in other branches they change to plain voiced stops ( b , d, g). PIE 
palatovelars change to plain velars in some languages (k, g) and to affricates (c[ts]/c[tc]/ 
c[t|l, dz[di\l dz[d&]l dz[d$\ . . .) and, further on, to fricatives (s/i[G]/.s[J], z/z[z]/z[3] . . .) in 
others. PIE labiovelars remain labialized velars in some languages (k w , x"', g"), or change 
further to full labials (p, b/w), while in others (mostly those where PIE palatovelars yield 
affricates/fricatives) they become plain velars ( k , g). 


Voiceless stops 

*P 

PIE *pods ‘ foot’ > Ved. pat (YAv. pad-), Gr. jiotk; (cf. via Lat. Eng. podium), Lat. pes 
(=> Eng. pedal), OCS pesb ‘on foot’, Lith. pesetas ‘pedestrian’, Goth .fotus (Eng. 
foot), OIr. is ‘below, under’ < *pedsu, Hitt, pat-. Arm. otn ‘foot’, het ‘foot(print)’, 
Toch. B paiyye 

PIE *pek w - ‘cook, bake’ > Ved. pacanti ‘they cook’ (YAv. pacata ‘it is cooked’), 
Gr. nsooco, Lat. coquo < *k w ek w - (assimilation), OCS pesti, Lith. kepti < *pekti 
(metathesis), MW pobi < *k w ok w - (assimilation), Arm. hacf ‘bread’, Alb. pjek, 
Toch. AB pak- 

PIE *peh 2 wr/n- ‘fire’ (p. 78) > Gr. 7tup, Urn hr. pir, Cz. (arch.) pyr ‘ash, fire’, OPruss. 
panno, Goth .fan {Eng. fire), Hitt, pahhur, Ann. hur, Toch. B puwar 

PIE *p is reconstructed because it reflects as p in most languages. In Gennanic, 
where all stops change their articulation, we find /'(medially only following the stressed 
syllable). In Armenian the reflex is originally aspirated *p h -, which yields either h- or 
nothing word-initially (#h- of any origin is unstable in Armenian) and w word-intemally 
(p. 426^127). In Celtic PIE *p disappears through a *p > *f > *h type process (p. 363). 
In Italic and Celtic (before the *p > *h change in Celtic), *p-k w (a *p and a *k w in the 
same word) assimilate regularly to *k w -k w , cf. PIE *penk w e ‘five’ > Lat. quinque, OIr. 
coic (*k w > c in Old Irish). In Hittite the reflex is p- initially and a geminate spelling 
-pp- internally (p. 177). 


*t 


PIE *tod ‘that’ > Ved. tat (GAv. ta-), Gr. to, OCS to, Lith. tas, Goth, pata (Eng. that. 
Germ, das), OIr. to ‘yes’, Alb. ke-ta. Arm. da 
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PIE *tnh,us ‘thin’ > Ved. tanus, Gr. ravuq, Lat. tenuis, OCS tbirbcaje ‘thinner’, Lith. 

tgvas, OIc. punnr (Eng. thin. Germ, diinn), OIr. tanae 
PIE *pet- ‘fly’ > Ved. patati ‘flies’ (YAv. patorjti ‘they fly’), Gr. Jisiopou ‘I fly’, Lat. 
peto ‘I go after’, OEng .feder < Gmc *fepro (Eng. feather), OIr. en ‘bird’ < *pet-no- 
(Gaul. Etnosus (name of a deity)), Hitt, pattai- 

PIE *t is reconstructed because it reflects as t in most languages. In Germanic it 
changes to p (Eng. <th>, Gmc *p > d) initially and medially following the stressed syl- 
lable (p. 20). In Armenian the reflex is an aspirated t' (cf. Ann. farsamim ‘I wither’, Lat. 
torred ‘I dry’, Eng. thirst < PIE *ters-), but the evidence is scarce, with d- appearing in 
unaccented words (cf. Arm. du ‘thou’, Lat. tu < PIE *tu, p. 438). In Hittite we find initial 
t- and medial -tt-. 


*k 


PIE *lewk- ‘light’ > Ved. rokas (GAv. raocah- with palatalization), Gr. Xcoicoq ‘light, 
white’ (=> Eng. leukocyte), Lat. liix, Russ. CS1. lucb (with palatalization), Lith. 
laukas ‘pale’, Goth, liuhap (Eng. light), Mir. luach ‘glowing white’ (Gaul. Leucus 
(pers. name)), Hitt, lukk- ‘to light up’, Toch. AB luk- 
PIE *krewh,s ‘raw meat, blood’ > Ved. kravl$- ‘raw/bloody meat’, Gr. Kpeaq ‘meat’, 
Lat. cruor ‘(wound) blood’, OCS krbVb ‘blood’, Lith. h-aujas ‘blood’, OEng. 
hreaw (Eng. raw) 

PIE *kes- ‘comb, scratch’ > Ved. ktwaumi ‘I whet’ (*ksnew-), Gr. ^eco ‘I scrape’, PS1. 
*kosa ‘hair’ (BCMS kosa), Lith. kasa ‘braid’, OIr. clr ‘comb’ (also cass ‘curl’, 
Gaul. Cassi- (pers. name)), Hitt./CLuw. kis- ‘to comb’, Ann. Eos ‘scab, itch’, (?) 
Toch. B kaswo ‘eruption, inflammation of the skin’ 

PIE *k is reconstructed because it reflects as k in most languages (cf. PIE *kes- > Alb. 
kohe ‘time’, OCS cash ‘time, moment’). Parallel to *p and *t, *k yields *x in Germanic 
(medially only following the stressed syllable) and aspirated E in Annenian. In Hittite we 
find initial k- and medial -kk- (written also as <gg>). 


Voiced stops 

*b 


PIE *h 2 ebol ‘apple’ > OCS abhko, Lith. arch./dial. obuolas, OEng. ceppel (Eng. 

apple), OIr. ubull, (?) Lat. (from Osc .) Abella (name of a town) 

PIE *bel- ‘strong’ > Ved. balam ‘strength’, Gr. PtAiepoq ‘better’, Lat. de-bilis ‘weak’, 
OCS bolii ‘better’, Low Germ, pall ‘unmovable’, (?) OIr. bale ‘strong’ 

PIE *d h ewb- ‘deep’ > Gr. Pufloq ‘depth’ (< *Sup6<;, metathesis), OCS dbbn ‘valley, 
ravine’ < *d't.bn>, Lith. dubiis, Goth, diups (Eng. deep), Toch. B tapre ‘high’ 

PIE *b is reconstructed because it reflects as b in most languages. However, in Ger- 
manic, Armenian, Tocharian, and Anatolian, the reflex is p. In Germanic and Armenian 
this change is just a part of a larger chain shift (see below), while in Tocharian all stops 
merge into voiceless ones. The exact phonetics of Hitt, p-, -p- < PIE *b (as opposed to p-, 
-pp- < PIE *-p-) is not clear (p. 177). 

There are not a lot of words containing PIE *b. It seems to have been a marginal 
phoneme in PIE (it might have been completely absent in pre-PIE). The roots containing 
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PIE *b are often attested in only a few languages, and sometimes this *b seems to be 
of secondary origin. Clear cases of secondary *b are found in forms where it originates 
from the *ph 3 cluster - cf. the PIE root *peh 3 - ‘drink’ (Ved. apat ‘he drank’, Lat. potus 
‘drunk’), where after reduplication one gets *pi-ph 3 - > *pib- ‘drink’ (Ved. pibati ‘drinks’, 
Lat. bibo ‘I drink’ < *pibo, OIr. ibid ‘drinks’). In some cases, it seems likely that a PIE *b 
originates in older *m. Thus, the root *h,ebl- ‘apple’ can be related to another PIE word 
for ‘apple’ - *meh 2 lom (Lat. malum, Gr. prjA.ov, Alb. molle; (?) Hitt, mahlas ‘branch of a 
grapevine’). Here, one can assume a metathesis *meh 2 l- > *h,eml- and then the change 
*-ml- > *-bl- in northern IE dialects, where the word is attested. This kind of change 
can probably be seen in PIE *bel- ‘strong, better’ (see above) as well - cf. Lat. melior 
‘better’ < PIE *mel-. A secondary *b (presumably from older *p) can perhaps be seen in 
PIE *bol- (Lat. blatea ‘swamp’, OCS blato ‘mud’, Lith. bald ‘mud’, OEng. pol > Eng. 
pool. Alb. balte ‘mud’), compared to PIE *pol- (Skr. palvalam ‘pool, pond’, Lat. palus 
‘swamp’, Lith. palios ‘mud’). In some cases, there is even a variation of *p/b/b\ cf. 
*d h ewp- (PS1. *dupa ‘hole’, ONor. dufa ‘dive’, Eng. dive - but see p. 54), *d h ewb- (see 
above), and *b h ud h - ‘bottom’ (Ved. budhnas, Gr. 7tuflpf|V, Lat. fundus, OEng. bodan) with 
metathesis. See p. 18 for more on PIE *b. 


*d 


PIE *h,don(ts) ‘ tooth’ (< ‘eating’) > Ved. dan (Av. dantan-), Gr. 68or><;, Lat. dens (=> 
Eng. dentist), PS1. *d§sni ‘gums’ (BCMS desni), Lith. dantis, Goth, tunpus (Eng. 
tooth), OIr. det, Hitt, adant- ‘eating, eaten’, Ann. atamn 
PIE *deks- ‘right (side)’ > Ved. daksinas (YAv. dasina-), Gr. 8e^ioq, Lat. dexter (=> 
Eng. dexterity), OCS desnu, Lith. arch, desinas, Goth, taihswa, OIr. dess (Gaul. 
Dex(s)iua (name of a deity)), Alb. djathte 

PIE *doru ‘tree, wood’ > Ved. daru ‘wood’ (YAv. da"ru ‘tree trunk’), Gr. 8opu ‘tree’, 
OCS drevo ‘tree’, Lith. derva ‘resin, tar’, Goth, triu ‘tree trunk’ (Eng. tree), OIr. 
daur ‘oak’, Hitt, ta.ru ‘wood’, Alb. dru 

PIE *d is reconstructed because it reflects as d in most languages. The exceptions are 
again Germanic, Armenian, and Anatolian, with t, and Tocharian, with ts (cf. Toch. tscim- 
‘grow’, Gr. Sepco ‘I build’ < PIE *demh 2 -, p. 454). 


*g 

PIE *gerh 2 n/w- ‘crane’ > Gr. yfpuvoc, Lat. grus, PS1. *zeravb (OCz. zerciv) (with 
palatalization), Lith. gerve, OEng. cran (Eng. crane), MW/MBret. garan (Gaul. 
tri-garanos ‘with three cranes’), Arm. krunk 
PIE *gem- ‘squeeze, (com)press’ > Gr. yepco ‘I am full/laden’, Lat. gemo ‘I groan’ 
(< *‘groan under pressure/a load’), CS gQStb ‘dense’, Lith. gamalas ‘lump’, (?) 
Ole. kum(b)la ‘to bruise, wound’, Mir. gemel ‘shackle’, Toch. AB kama- 
PIE *gel-/gl-ut- ‘ swallow; throat’ > Skr. galas ‘throat, neck’, Lat. glutto ‘glutton’ (=> 
Eng. gluttony), ORuss. glutati ‘to swallow’, OEng. ceole ‘throat’, (?) OIr. geilid 
‘grazes, eats’, Arm. klanem ‘I swallow’ 

The phoneme *g was, strangely, rather rare in PIE. It reflects as g in most languages 
(see p. 28 for palatalizations). The exceptions, with the voiceless reflex k, are, as usual, 
Germanic, Armenian, Anatolian, and Tocharian. 
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PIE *b and glottalic theory 

The fact that (early) PIE seems not to have had a phoneme *b is typologically unusual, 
since this kind of gap (a language with p but no b) seems to be practically unattested 
elsewhere 1 (languages having a b and no p, like Arabic, do exist - Proto-Celtic was such 
a language as well). Together with the curious constraint in PIE that a root could not 
have two voiced stops (p. 54), this has led some researchers (Gamkrelidze & Ivanov 
1995 being the most famous) to believe that PIE voiced stops ((*b), *d, *g) should be 
phonetically reinterpreted as glottalized (ejective) stops ((*p), *t, *k) (while the tradi- 
tional *p, *t, *k and *b h , *d h , *g h would be reinterpreted as *p ,h) , *t (h) , *k (h) and *b <h) , *d (h) , 
*g (h) - cf. p. 20-21). Since some languages with ejectives (like Navajo) are missing the 
bilabial stop p (though this is not the case in the majority of such languages - Quechua 
and Georgian have it, for instance), and sometimes have a constraint on having two ejec- 
tives in one word (though there are plenty of languages that do not), this could explain 
why (early) PIE had no *b and no **deg-type roots. The problem with this theory is that 
the reflexes of traditionally reconstructed PIE *b, *d, *g are indeed voiced stops in the 
majority of IE languages/dialects. In cases where this is not true, the voiceless reflexes 
are hardly relevant - Germanic and Armenian have gone through complete consonantal 
shifts, and in Tocharian all stops became voiceless. Assuming an independent nontrivial 
change of glottalized voiceless to voiced stops in almost all IE branches is highly implau- 
sible, while the empirical evidence in favor of reconstructing PIE glottalized stops (cf. 
Kortlandt 1985) is interesting but hardly compelling. Thus, it seems best to stick with 
the traditional reconstruction of voiced (*b), *d, *g for the last phase of PIE, though it is 
possible that pre-PIE indeed had glottalized *t, *k. Cf. also p. 154. 


Voiced aspirated stops 

The opposition of voiced and voiced aspirated stops is preserved in Indo-Aryan, in Italic 
(through fricativization of aspirates - p. 326-327), and, via complete consonant shifts, in 
Germanic and Armenian (p. 392-393, 426). Elsewhere (Anatolian, Celtic, Balto-Slavic, 
Albanian, Iranian), the aspiration is lost and the two series merge (traces of the old dis- 
tinction are seen indirectly through Winter’s Law in Balto-Slavic - p. 481). 


*b h 

PIE *b h reh,ter ‘brother’ > Ved. bhrata (Av. bratar-), Gr. (ppaipp ‘member of a 
brotherhood’, Lat. frater (=> Eng. fraternity), OCS bratn, Lith. broterelis Tittle 
brother’, Goth, bropar (Eng. brother), OIr. brathir (Gaul. Bratronos (pers. name)), 
Arm. elbavr, Toch. B procer 

PIE *b h ewH- ‘be, become, grow’ > Ved. bhavati ‘is, becomes’ (GAv. bavat), Gr. 
(puovxai ‘they grow’, Lat./«r ‘I were’, OCS bvti ‘to be’, Lith. buti ‘to be’, Goth. 
bauan ‘to dwell, inhabit’ (Eng. be), OIr. biid ‘is wont to be’, Arm. busanim ‘I grow’ 

PIE *gomb h os ‘ tooth ''' Ved. J ambhas , Gr. yo pxpoc, OCS zij hi, , Lith. zcnnbas corner, 
edge’, ONor. kambr ‘comb’ (Eng. comb). Alb. dhemb 

In most branches (Germanic, Armenian, Iranian, Celtic, Balto-Slavic, Albanian), the 
reflex is a plain voiced b (with a merger with the reflex of PIE *b, except in Germanic 
and Armenian, where PIE *b > p). Anatolian p-, -p- has the same reflex as PIE *b (cf. PIE 
*-p- > -pp-). Indie has voice together with aspiration (bh), while the reflex in Greek is a 
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voiceless aspirate (p [p h ]. The aspiration is indirectly seen in Italic, where the reflexes are 
fricative, cf. Latin initial voiceless f- and medial voiced -b- (Proto-Italic fricative *-(3-, 
p. 326-327). In Tocharian PIE *p, *b, and *b h merge into p. All this points to a PIE voiced 
aspirate (or breathy-voiced; p. 20-21) *b h , with most languages preserving voice and 
some aspiration. 


*d h 

PIE *h,wid h ew- ‘widow’ (perhaps originally: ‘unmarried’) > Ved. vidhava (YAv. 
viSava ), Gr. f|llteo<; ‘bachelor’, Lat. uidua, OCS vbdova, OPruss. widdewu, Goth. 
widuwo (Eng. widow’), Olr.fedb, Hitt. w(i)dati- 
PIE ' h | r u d h ro s / h , ro wd h o s ‘red’ > Ved. rudhiras, Gr. tpulfpoc (=> Eng. erythrocyte), 
Lat. ruber, OCS ndn, Lith. raudas ‘reddish’, OEng. read (Eng. red), OIr. ruad 
(Gaul. Roudius (pers. name)), Toch. B ratre 
PIE *d h uh 2 mos ‘smoke’ > Ved. dhumas, Gr. Oupoq ‘soul, spirit’, Lat .Junius (=> Eng. 
fume), OCS dynvb, Lith. diimas, OHG doum, Mir. dumacha ‘fog’, Hitt, tuhhui-, 
Toch. B tweve ‘dust, ashes’ 

Again, Indie has a voiced aspirated dh and Greek a voiceless aspirated 0 (written also 
0) [t h ]. In Balto-Slavic, Celtic, and Albanian, PIE *d h > d, which merges with PIE *d. The 
merger is also seen in Hittite initial t- and medial (non-geminate) -t- (written also as <d>). 
Germanic and Armenian (cf. p. 427) have voiced d with a consonantal shift, and in Toch- 
arian PIE *d h and *t merge into t. Proto-Italic has initial *]> and medial *-3-, reflected in 
Latin as f- and -d- (in most conditions) or -b- (before/after r, after u, and before /). 


* g h 

PIE *steyg h - ‘go, climb’ > Ved. ati-stigham ‘to climb up’, Gr. oiei^co ‘I walk’, OCS 
po-stignQti ‘to attain, to catch up with’, Latv. steigties ‘to hurry’, Goth, steigan ‘to 
climb’, OIr. -tiagait ‘walks’, Alb. shteg ‘footpath’ 

PIE *h 3 meyg h -, *h 3 mig h leh, ‘fog, cloud’ > Ved. meghas ‘cloud, rain’ (YAv. maeya-), 
Gr. opixA,r| ‘mist, fog’, OCS nibgla ‘fog’, Lith. migla ‘fog’, Dutch miggelen ‘to 
drizzle’, Ann. meg ‘fog’, Alb. mjegull ‘fog, mist’ 

PIE *g h l(e)h 2 d h - ‘ smooth’ > Lat. glaber, OCS gladbkb, Lith. glodiis, OHG glat ‘clear’ 
(Eng. glad) 

PIE *g h yields gh in Indie and % [k h ] in Greek. In Iranian, Balto-Slavic, Celtic, and 
Albanian PIE *g h (together with PIE *g) yields g, the same as in Gennanic and Armenian 
but in the latter two as a part of a general sound shift. In Hittite one finds k-, -k- (written 
also as <g>), and in Tocharian all velars merge into k. In Latin we find h in most cases 
(the exceptions being gr-/gl-,fu-, and -ng-). 


Grassmann s Law 

In Indo-Aryan and Greek, there is an independent dissimilation rule that causes the deas- 
piration of the first aspirate in a word: C h -C h > C-C h . Cf. PIE *b h ewd h - > Ved. bodhati 
‘is aware’ (not **bhodhati), Gr. jieufiopat ‘I give notice’ (not **(peu9opat). The fact that 
Greek exhibits voiceless deaspirated stops proves that the deaspiration occurred after the 
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Greek devoicing of PIE aspirated stops (which have a voiced reflex in Indie). This means 
that the deaspiration in Greek is younger and independent of Indie. 


Consonantal shifts in Germanic and Armenian 

In Germanic and Armenian, all stops experience a very similar chain shift, in which all 
PIE stops weaken. Voiced aspirates lose aspiration, voiced stops lose voicedness, and 
voiceless stops change to fricatives (in Germanic) or to voiceless aspirates (in Armenian): 


PIE 

gh 

*b, *d, *g 
*p, *t, *k 


Germanic Armenian 

b, d,g 
p, t, k 

fp,h *p c , t c , k 


Palatovelars and labiovelars change as well (see below). This seems to be an indepen- 
dent change in these two branches. In Germanic it is traditionally called Grimm’s Law 
(p. 392-393, cf. p. 426 for Armenian). In Germanic PIE voiceless stops have a voiceless 
reflex if preceded by a fricative ( s,f h,p), cf. PIE *nok w ts ‘night’ (Lat. nox) > Goth, nahts 
(not **nah]}s) or PIE *pisk- (Lat. piscis) > OEng .fisc (Eng .fish). 

In Germanic, Grimm’s Law operates always in initial position. However, in the mid- 
dle of the word it does so only if the syllable preceding PIE *p/t/k was stressed in 
pre -Proto-Germanic (pre-PGmc had a free stress, like Vedic or Greek, which it subse- 
quently lost - p. 396). If not (i.e., if the stress was on the syllable following *p/t/k or 
elsewhere), Verner’s Law (p. 393) operated, by which PGmc *f, *J), *x changed to 
voiced fricatives *(3, *6, *y (similar to Spanish <b, d, g> in most positions). In Gothic 
these yield b, d, g. 

PGmc *(3ro]3er ‘brother’ (Ved. bhrata, Gr. (ppaiqp) > Goth, bropar 
PGmc *fa6er ‘father’ (Ved. pita, Gr. jraifip) > Goth .fadar 

Grimm’s and Vemer’s Laws operate on PIE palato- and labiovelars as well (p. 21-25). In 
Old High German (and Modem Standard German), there is another chain shift, called the 
OHG sound shift, where Germanic stops and fricatives change once again (p. 393-394). 


The problem of the PIE voiced aspirates 

PIE voiced aspirates are reconstructed as such because this is what the reflexes point to - 
to aspiration or traces of it in some languages, and to voicedness in most of them. Indie is 
thought of as the most archaic since it preserves both voice and aspiration, though what 
are usually called “voiced aspirates’’ seem actually to be murmured/breathy-voiced stops 
(acoustically similar to aspirates) in many modem Indo-Aryan languages. Greek also has 
aspirates (though voiceless), and fricative reflexes in Italic (p. 326-327) point to original 
aspirates as well. In other branches, aspiration is lost, but the reflexes are always voiced 
(except in Tocharian, which is irrelevant). In Anatolian, where the exact pronunciation of 
stops is not clear (p. 177), one can still see that the reflexes of voiced and voiced aspirate 
stops are the same (non-geminate), while different from voiceless stops (geminate medi- 
ally). Even in Germanic and Armenian, with a complete chain shift of stops, the internal 
logic of the shift (with each series “weakening” by one step - losing aspiration, voiced- 
ness, and plosiveness respectively) points to voiced aspirates. The typological parallel for 
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the development of PIE voiced aspirates can be found in Indo-Aryan, where Old Indie 
voiced aspirates yield voiced aspirates (breathy-voiced), voiceless aspirates, or voiced 
stops in various modern Indo-Aryan languages. 

Thus, the reconstruction of PIE *b h , *d h , *g h based on its reflexes is quite solid. How- 
ever, the problem is again of typological nature - the PIE system of *T, *D, *D h , i.e., the 
system with voiceless, voiced, and voiced aspirated stops (and no voiceless aspirates), is 
very unusual. It is a near universal in world languages that a language has to have voice- 
less aspirates if it has voiced aspirates (like in Indo-Aryan languages), or that, if it has just 
one set of aspirates, those are voiceless (like in Ancient Greek) - there is no reason for 
the double markedness of voiced aspirates (voice and aspiration) if voiceless aspirates are 
lacking in a language. However, as we have seen, the reflexes in IE languages do indeed 
point to voiced aspirates. Though the problem can hardly be considered solved, it may 
just be that PIE, with its *T/D/D h system, was an (unstable) typological exception. And it 
is not the only one: there is at least one similar system in the world (with voiced aspirates 
but no voiceless aspirates) - in a dialect of the Austronesian Kelabit language. 2 Cf. also 
p. 18 and p. 153-154. 

There is no real compelling evidence for the reconstruction of PIE voiceless aspirated 
stops, although they used to be reconstructed and sometimes still are by certain authors/ 
schools. Indie voiceless aspirates originated in the combination of PIE *p/t/k and laryn- 
geals (but not *ph 3 - p. 17), cf. PIE gen. sg. *pntHos ‘path’ > Ved. gen. sg. pathas, Gr. 
natoq (p. 77), *h,esth, (p. 30), etc. For the laryngeal-aspiration of voiced plosives in 
Indo-Aryan cf. p. 30. 


Palatalized velars 

Concerning the reflexes of PIE *k, *g, *g h , which were palatalized velars (IPA [k J ], [g j ], 
[g jh ]) rather than palatal stops (IPA [c], [j], [j h ]), the IE languages can be divided into two 
groups - centum languages (after Lat. centum ‘hundred’ < PIE *(d)kmtom) and satem lan- 
guages (after Av. satam ‘hundred’). In the centum group, PIE palatalized velars depalatalize, 
i.e., reflect in the same way as the original PIE *k, *g, *g h . Most of the centum languages/ 
branches are positioned to the west - Greek, Italic, Celtic, Germanic, and some fragmen- 
tarily attested languages (like Lusitanian and Venetic). The only exception is Tocharian, 
positioned to the east. In the satem group, PIE palatovelars usually reflect as either affricates 
(like ts, ts, dz, dz . . .) or fricatives (like s, s, z,z,p,d. . .), the fricatives usually being the 
further reflexes of older affricates (through processes like ts > s, dz> z, etc.). Meanwhile, 
Balto-Slavic and Indo-Iranian (and fragmentarily attested Thracian, for instance) as satem 
languages are positioned more to the east. This is also true of Albanian, Armenian, and a 
part of Anatolian (Luwian and Lycian), which can be considered satem because of their 
reflexes of palatalized velars (but see p. 176). Considering that we find velars as reflexes 
in one group of languages and mostly affricates/fricatives (not depending on specific vow- 
els, etc.) in other languages, the reconstruction of palatalized velars for the proto-language 
seems to be the most reasonable solution. For more on centum! satem, cf. p. 27-28. 


*k 

PIE *Eer(d), *Erd- ‘ heart’ > Ved. srad-dha ‘faith’, Gr. icrjp, KapSia/icpaSia (=> Eng. 
cardiology), Lat. cor (=t> Eng. cordial), OCS srudnce, Lith. sirdis (Latv. sirds, 
OPruss. seyr), Goth, hairto (Eng. heart), OIr. cride, Hitt, ker, kard(i)- (Pal. kart-, 
CLuw. zart-, HLuw. zart(i)-), Ann. sirt, Toch. B karyah ‘hearts’ 
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PIE *h,ekwos ‘horse’ > Ved. asvas (YAv. aspa-), Gr. mnoc, (=> Eng. hippo-drome, 
hippo(-potamus), Myc. i-qo), Lat. equus (=t> Eng. equestrian), OLith. esva ‘mare’, 
Goth, aihwa-tundi- ‘thornbush, bramble’ (lit. ‘horse-tooth’, OEng. eoh ‘war- 
horse’), OIr. ech (< Ogham Irish EQO-DDI, CIb. Ekua-laku (pers. name), Gaul. 
Equos ‘the ninth month’, Epos (pers. name)), HLuw. azu- (Lye. esb-), Arm. es 
‘donkey’, Toch. B yakwe, Ven. ace. sg. ekwn 
PIE *h,nek- ‘get, reach’ > Ved. nas- ‘attain’ (GAv. nqsat ‘reaches’), Gr. eveyisxiv ‘to 
carry’, Lat. nancTscor ‘I get’, OCS nesti ‘to carry’, Lith. nesti ‘to carry’ (Latv. 
nest), Goth, ga-nah ‘to suffice’ (Eng. enough), OIr. ro-ic ‘reaches’, Arm. hasanem 
‘I arrive’, Toch. B erik- ‘take, grip’ 

In centum languages, the reflexes are the same as for PIE *k (p. 1 6). In satem languages, 
we find affricates and fricatives - OInd. s, Av. s, OPers. 6 (cf. OPers. vid- ‘house(hold)’ 
to Ved. vis- ‘settlement’, Lat. ulcus ‘village’ < PIE *wik-/woyk-) (p. 206), Slav./Latv./ 
OPruss. 5, Lith. s (< PBS1. *s), Alb. th [0] (cf. Alb. them ‘I say’, Ved. samsati ‘praises’, 
Lat. censed ‘I suggest, propose’ < PIE *ken/ms-), and Ann. 5 (PIE *kw > s, p. 434). In 
Luwian we find z [Is], and in Lycian s from PIE *k before *e/i/y/w (p. 176, cf. always 
k in Hittite). 


*g 

PIE *ge/onu ‘knee’ (p. 74) > Ved .janu (YAv. zanu-), Gr. yovu, Lat. genii, Goth, kniu 
(Eng. knee), Hitt, genu. Arm. cunr, Toch. A kanwem ‘knees’ (du.) 

PIE*h 2 egros ‘field’ > Ved. ajras, Gr. dyp6<; (=> Eng. agro-nomv), Lat. ager (=> Eng. 
agri-culture), Goth, akrs (Eng. acre) 

PIE *gerh 2 - ‘ripen, mature, old’, *grh 2 nom ‘grain, com’ > Ved .jarati ‘grows old’, 
jirnas ‘old’ (YAv. zar'ta- ‘decrepit’), Gr. yspcov ‘old man’ (=> Eng. ger-iatrics), 
Lat. granum ‘grain’ (=> Eng. grain), OCS zureh ‘ripe’, zruno ‘grain’, Lith. zirnis 
‘grain’ (Latv. zirnis, OPmss. syrne), Goth, kaiirn (Eng. corn), OIr. gran ‘grain’, 
Arm. cer ‘old’ 

In centum languages, the reflexes are the same as for PIE *g (p. 17). In satem lan- 
guages, we find mostly affricates and fricatives - OInd.y [ck], Av. z, OPers. d with depala- 
talization (cf. OPers. daustar- ‘friend’, YAv. zaosa- ‘pleasure’, Ved. yo,y- ‘like’, Lat. gusto 
‘I taste’ < PIE *gews- ‘taste, like, choose’) - p. 206, Slav./Latv. z (written as <s> in 
OPmss.), Lith. z (< PBS1. *z), Alb. dh [3] (cf. Alb. dhemb ‘tooth’ < PIE *gomb h os, 567), 
Arm. c [Is] (with regular devoicing as with non-palatalized stops, p. 426). 


*g" 

PIE *g h eym- ‘winter’ > Ved. heman ‘in winter’ (cf. himds ‘cold, snow’ > Hima-laya; 
Av. zdtn-, Pers. dai), Gr. ^etpcbv, Lat. hiems, OCS zima, Lith. ziema (Latv. ziema, 
OPmss. semo), ONor. goi ‘month from the middle of February to the middle of 
March’, OIr. gam (Gaul. Giamos (pers. name)), Hitt, gimm(ant)-, Alb. dimer. Arm. 
jiwn ‘snow’, Toch. A sarnie 

PIE * g h ew- ‘call’ > Ved. havate ‘invokes’ (YAv. zavaiti ‘calls’), Gr. Kau-^aopai ‘I 
speak loudly, boast’, OCS zuvati, Lith. zaveti ‘charm, fascinate’, (?) Goth, gup 
‘god’ (lit. *‘the one who is called’, Eng. god), OIr. guth ‘voice’ (Gaul, gutu-ater ‘a 
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kind of druid’ < *‘father of invocations’), Arm . jaunem ‘I devote’, Toch. B kwa- 
‘call out to, invite’ 

PIE *h 2 em/iig h - ‘bind, squeeze; narrow’ > Ved. amh- ‘compress’, anihas ‘trouble’ 
(YAv. qzah- ‘difficulty’), Gr. oiyxco ‘I squeeze’, Lat. ango ‘I squeeze/choke’, angus- 
tus ‘narrow’ (=> Eng. anguish), OCS Qz-bkb ‘narrow’, Lith. ankstas ‘narrow’ (with k 
instead of z), Goth, aggwus [ljg] ‘narrow’ (Germ. Angst => Eng. angst, ONor. angr 
=> Eng. anger), OIr. cum-ung ‘narrow’ (MW ing), Hitt, hamank- ‘tie’, Ann. anjuk 
‘narrow’, Toch. B entse ‘greed, envy’ < *enkse 

In centum languages, the reflexes are the same as for PIE *g h (p. 19). In satem lan- 
guages, we find mostly affricates and fricatives - OInd. h (from older *jh), Av. z, OPers. 
d, Slav./Latv. z (OPruss. <s>), Lith. z, Alb. initially d- (cf. Alb. done, Gr. %sip, Ann .jern 
< PIE *g h esr- ‘hand’) or dh-, medially mostly -dh- (p. 567-568). In Armenian we find 
j [dz], a voiced pair of c < PIE *g. In languages where the reflex of PIE *g is the same as 
the reflex of *g h , PIE *g h has the same reflex as *g, the only exception being Albanian. 


TABLE 1.2 THE REFLECTION OF PIE PALATOVELARS IN SATEM LANGUAGES 


PIE 

Luw. 

Lyc. 

OInd. 

Av. 

OPers. 

Lith. 

Latv. 

Slav. 

Arm. 

Alb. 


z [ts]/yt 

slk 

s 

s 

e 

s 

s 

s 

s 

th [0] 

*g 

0 k/y/0 ) 


j [<fc] 

z 

d 

z 

z 

z 

c 

dh [6] 

*g h 

(k/y/0) 


h 

z 

d 

z 

z 

z 

j [*] 

d/dh-, -dh- 


Labiovelars 

The centum/ satem distinction is relevant concerning the PIE labialized velars (*k w , *g w , 
*g wh ) as well. In the centum group (where *K = *K), they usually yield different reflexes 
than the regular velars (*k, *g, *g h ). The labiovelars either are preserved (as k w , g"', x w , 
etc.) or are (subsequently) changed to full labials (p, b, w) or plain velars ( k , g). Tocharian 
seems to be the only centum language that shows no distinction between old plain and 
labialized velars (p. 454). In the satem group (where *K ± *K), the labiovelars usually 
reflect as plain velars (*K = *K W ) - this is the case in Balto-Slavic and Indo-Iranian 
(although there are perhaps some indirect traces of labiovelars in Indo-Iranian). But 
in Albanian (p. 567) the reflexes of plain and labialized velars are distinct before front 
vowels (*K ± *K W ), and the same is true for *k ^ *k w before front vowels in Armenian 
(p. 427). In Luwo-Lycian, which preserves a special reflex of *k in certain conditions 
(p. 176), the PIE labiovelars have separate reflexes as usual in Anatolian. Traditionally, 
the centum group is represented by the formula *k = *k ^ *k w and the satem group by 
*k = *k w ^ *k, but Armenian, Albanian, and Luwo-Lycian (and Tocharian with k from all 
PIE velars) cannot be neatly subsumed into these formulas (see p. 27). 


*k' v 

PIE *k"etwores ‘four’ > Ved. caU’aras (YAv. caOparo) (with palatalization), Gr. 
Teiiapcq, Lat. quattuor, OCS cetvre (with palatalization), Lith. keturi, Goth, fidwor 
(Eng .four), OIr. ceth(a)ir (Gaul, petuar-, OW petguar), (?) Alb. kater (if not from 
Lat.), Arm. c c o;W (with palatalization), Toch. B stwer (with palatalization) 
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PIE *penk"e ‘ five’ > Ved. pahca (YAv. panda) (with palatalization), Gr. rcevxe (=> 
Eng . penta-gram), Lat. qumque, OCS pqtb < *penktis, lAih.penki, Goth. /zm/(Eng. 
five), OIr. coic (Gaul, pempe-, OW pimp). Alb. pese (with palatalization) 

PIE *k"el- ‘turn’, *k"ek"los ‘wheel’ (with root reduplication) > Ved. cakra- ‘wheel’ 
(=> Eng. chakra, YAv. caxra- ‘wheel’) (with palatalization), Gr. Kuickoq ‘ring, cir- 
cle’ (=> Eng. cycle), nokoq (< *k w olos) ‘axis’ (=> Eng. pole), teXoc, (< *k w elos) 
‘end, outcome’ (< *‘full circle’, => Eng. teleo-logy), Lat. colus ‘distaff’, OCS 
kolo ‘wheel, circle’, Lith. kaklas ‘neck’, OEng. hweol (with Grimm’s Law, PGmc 
*x w ex w l-, cf. Eng. wheel), hweogul (with Vemer’s Law), Mir. cul ‘chariot’, Toch. B 
kokale ‘cart, wagon’, Alb. sjell ‘bring’ (with palatalization) 

In centum languages, PIE *k w remains unchanged in part of Italic (Lat. <qu> but Sabel- 
lian p), Anatolian (Hitt./Luw. <ku>), Mycenaean Greek (<q>), and some Celtic languages 
(Ogham Irish <Q>, CIb. <ku>). Proto-Greek *k w changes to ji, x, or k in Attic, depending 
on what comes next (na, no, nC, xi, xe, ku - p. 292). In Q-Celtic languages, Proto-Celtic 
*k w remains unchanged or loses the labialization later (OIr. c), while in P-Celtic it 
changes to p (p. 357-358, 363). In Germanic the reflex is *x w (Goth. <fu>, OEng. <hw>, 
Eng. <wh> - still pronounced [h w ] in some dialects) by Grimm’s, or *y w by Vemer’s Law 
(p. 20, 392). In certain cases, one finds *f instead of *x w in Germanic (like in ‘four’ and 
‘five’ above, or ‘wolf’ - p. 35), which can be explained in different ways. Tocharian has 
k for all velars. In satem languages, PIE *k w usually yields k, just like PIE *k. However, 
Luwian preserves the distinction of ku < PIE *k w , k < PIE *k, and z [c] < PIE *k in certain 
conditions (p. 176). One finds different outcomes of PIE *k w , *k, and *k before front 
vowels (PIE *e, *i) in Albanian and Armenian as well - Alb. s : k:th and Ann. c c : t'\ 5 
(see p. 427, 567). 


The reflex of PIE *kw/Kw in centum languages 

PIE *k w seems to have been distinct from PIE *kw (*k + *w, likewise *g w(h) *g (h) w), 
though the latter was rare. In Greek, PIE *kw/kw > kk (though this is disputed), cf. Gr. 
kcuocoq ‘pond’, OCS lokv ‘puddle’, Lat. lacus ‘lake’ < PIE *laku/w- (or *lok-/lk-). The 
reflex of PIE *kw in Latin is not clear (perhaps it is c in initial position before a), cf. 
Lat. equus (p. 22) and queror ‘I lament’ (Ved. svasiti ‘blows’ < PIE *kwes-), but caseus 
‘cheese’ (OCS kx’asu ‘leaven’ < PIE *kweh 2/3 /o(t)s-) and canis ‘dog’ (OInd. sva < PIE 
*kwon-). PIE *kw/kw yields PCelt. *k w (cf. PCelt. *ek w os ‘horse’), but not in Gennanic 
and Tocharian (cf. Goth, aihwa-, Toch. B vakwe, p. 22). 


*g" 

PIE *g w ows ‘cow’ > Ved. gaus (Av. gaus), Gr. Pouq, Lat. bos (a Sabellian loanword, 
should be **u5s in Lat.), OCS govgzdb ‘bovine’, Latv. giiovs, OEng. cu (Eng. 
cow), OIr. bo (Gaul. Bo-marus (pers. name)), HLuw. wawa/i- (Lye. wawa-), Ann. 
kov, Toch. B ke u 

PIE *g w en(e)h 2 ‘ woman’ > Ved .jam (with palatalization), gnas ‘divine woman’ (GAv. 
jmi-, gona-), Gr. yuvf) (=> Eng. gyne-cology, dial. Boeotian Pavoi), OCS zena, 
OPruss. genno, Goth, qino (cf. Eng. queen with semantic change), OIr. be/ben 
(Gaul. gen. pi. bnanom), CLuw. wana-, Ann. kin, Toch. B sana (with palataliza- 
tion), Messapic benna 
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PIE *g w e/orH-/*g w rH- ‘mountain’ > Ved. giris (YAv. ga‘ri-), Gr. 8eipd<; ‘ridge of a 
chain of hills’, Popery ‘north wind’, OCS gora (Slav, also ‘forest’), Lith. giria 
‘forest’, Alb. gur ‘stone’ 

In centum languages, PIE *g w as such is not preserved anywhere (except partially in 
Latin after -/?-). In Latin the usual reflex is it [w] (the exceptions being gr-lgl- and -ngu-, 
while in Sabellian one finds b- p. 327). Hittite has <ku>, and Luwian has w. Proto-Greek 
*g w changes to P, 8, y in Attic (Pa, Po, PC, Pi, 8e, yu - p. 292), but not in complete paral- 
lelism to voiceless counterparts (thus Pi < *g w i, but n < *k w i). PIE *g w yields PCelt. *b 
(while PIE *k w is preserved and PIE *g wh > PCelt. *g w , p. 363), and, according to Grimm’s 
Law, PGmc *k w (Goth. <q>, OEng. <cw>/Eng. <qu>, OHG <qu>, ONor. <kv>). In 
satem languages (including Armenian but not Albanian before front vowels), PIE *g w is 
reflected the same as PIE *g. 

*gwh 

PIE *sneyg" h - ‘snow’ > Ved. snih- ‘wet(ness)’ (‘snow’ in Prakrits, YAv. snaeza- 
‘snow’, with palatalization), Gr. ace. sg. vupa, Lat. nix (< *nig-s), gen. sg. niuis, 
ninguit ‘it snows’ (with -n- infix, p. 95), OCS snegb, Lith. sniegas, Goth, snaiws 
(Eng. snow), OIr. snige 

PIE *g wh en- ‘strike, kill’ > Ved. ghnanti ‘they strike, kill’ (< *g wh nenti), hcinti ‘strikes, 
kills’ (<*g wh enti, with palatalization) (YAv. ja'nti ‘strikes, kills’), Gr. (povEUCO ‘I 
kill’, fleivco ‘I strike’, Lat. de-fendo ‘I ward off, repel’ (=> Eng. defend), OCS goniti 
‘to chase’, zgti ‘to reap’ (with palatalization), Lith. giiiti ‘to drive (animals)’, ONor. 
gunnr ‘war, battle’, OIr. gonaid ‘strikes, kills’, Hitt, kuenzi ‘kills’, Arm. gan ‘beat- 
ing’, Toch. B kdsk- ‘scatter to destruction’ 

PIE *d h eg wh - ‘ burn’ > Ved. ni-daghas ‘heat’, dahati ‘bums’ (YAv. dazaiti ‘burns’, with 
palatalization), Gr. xecppa ‘ashes’, Lat. febris ‘fever’ (=> Eng . fever), foueo ‘I (keep) 
warm’, OCS zegg ‘I bum’ (< *geg- < *deg- with assimilation), Lith. degti ‘to burn’, 
Goth, dags (Eng. day), OIr. daig ‘flame’, Alb. djeg, Toch. AB tsdk- ‘burn up’ 

In centum languages, the labialization of PIE *g wh is usually not preserved (except in 
Latin and Gothic after -/?-). In Proto-Italic one expects *h w -, *-y w -. In Latin this yields initial 
f- and medial -u- (but -ngu- and -br-). Proto-Greek *k wh (with Greek devoicing of aspi- 
rates) changes to <p, ft, % in Attic (<pa, (po, <pC, (pi, He, jp - P- 292), in parallelism to voiced 
counterparts. PIE *g wh yields PCelt. *g w (> OIr. g, p. 363). In Germanic, PIE *g wh yields 
w or g (before u, o, and consonants), although not everything is clear. After n, the expected 
*g w is seen (cf. Goth, saggws [sang w s] ‘song’, Gr. oj-Kpr) < PIE *song wh -). In some cases, 
b is also posited as a reflex by some; e.g., both Goth, brinnan (Eng. bum) and OEng. wearm 
(Eng. warm) are sometimes derived from PIE *g wh erm- ‘burn’ (Gr. flEppot; ’hot’, hdt.for- 
mus ‘warm’, etc.), but alternative etymologies exist as well. In Anatolian, Balto-Slavic, and 
Albanian, PIE *g wh reflects the same as *g w (in Armenian, *g wh reflects as *g h ). In Albanian, 
PIE *g w(h) merges with PIE *g <h) before back vowels (and yields g), while in front of *e/i it 
yields z (cf. PIE *g wh ermo- > Alb. zjarr ‘fire, fever’) unlike normal velars. 


Loss of labialitv 

In some centum languages that preserve PIE labiovelars in one way or another, there is a 
tendency toward loss of labiality in some positions - i.e., in front of rounded vowels ( u 
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and sometimes o - a case of phonetic dissimilation). This explains Gr. kd, yu, (with 
plain velars before *u) and examples like Lat. cultura ‘cultivation’, incola ‘inhabitant’, 
but inquilinus ‘tenant’. In Latin the labiality is lost before consonants as well, cf. Lat. 
reliquiae ‘remains’ but relictus ‘left behind’. 


The problem of three velar series in PIE 

According to the most usual and traditional reconstruction, dating from the 19th century 
and Karl Brugmann, PIE had three velar series (regular, palatalized, and labialized): 

*k, *g, *g h 
*g, *g h 

*g w *gwh 

This kind of system is complex and not typologically too frequent, but it is nonetheless 
attested - e.g., in the modem Iranian language Yazgulyam 3 (where it is an innovation of 
no historical relation to the PIE three series). There is a long tradition of denying the exis- 
tence (or originality) of the three velar series in PIE, starting with Antoine Meillet and, 
surprisingly, still alive today. In this approach, the palatovelars are usually considered 
somehow secondary. 

The main argument against the PIE distinction of *K : *K : *K W is that the major- 
ity of the languages distinguish only the reflexes *K : *K or *K : *K W . However, as 
already said, the traditional formulas (*K = *K ^ *K W for centum and *K = *K W ^ *K 
for satem languages) are not valid for all languages. Albanian (p. 567) and Armenian 
(p. 426^-27) distinguish all three series before front vowels (Armenian only in voice- 
less stops), while Luwian and Lycian distinguish the reflexes of PIE *k : *k : *k w (in 
voiceless stops) in certain conditions (p. 176), cf. CLuw. kis- ‘to comb’ < PIE *kes- 
(p. 16), CLuw. zart- ‘heart’ < PIE *krd- (p. 21), CLuw. kui- ‘who, what’ < PIE *k w i- 
(Lat. quis, p. 88). 4 The adversaries of the PIE three series usually offer no plausible 
alternate solutions for these, but simply hold, if only implicitly, that Anatolian evi- 
dence is scarce and inconclusive, or that Armenian and Albanian are too innovative and 
attested too late to be convincing. 

A convincing argument for the three original series is found in the fact that PIE *K 
: *K : *K W cannot be predicted from phonemes in their surroundings (though some 
tendencies do exist). Thus, we find palatovelars in contact with front vowels (direct or 
indirect, before or after them): *kerd- (p. 21), *g h eym- (p. 22), *h,ekwos (p. 22), *gneh r 
‘know’ (Ved . janati ‘knows’, OCS znati ‘to know’, Goth, kann ‘to get to know’, etc.), 
etc. However, we also find palatovelars when not in contact with front vowels: *kwo(n) 
(p. 41), *klownis (p. 51), *kolh 2 m-/klh 2 m- ‘straw’ (Gr. Kakapoq ‘reed, stalk’, Lat. culmus, 
PS1. *solma > BCMS slama, OEng. healm > Eng. haulm), etc. In cases like *gomb h os 
(p. 18), one could argue the connection with the verbal stem *gemb h -, etc. Likewise, reg- 
ular velars are found even when front vowels follow - *kes- (p. 16), *kelh 3 - ‘rise’ (Lat. 
ex-cello ‘I rise’, Lith. kelti ‘to lift, raise’, etc.), *gerh 2 nAv- (p. 17), etc., though, of course, 
one can always assume analogies in some cases (to variants *kos-, *kolh r ). There are 
also pairs like *leyg h - (p. 32), *steyg h - (p. 19), *sneyg wh - ‘to snow’ (p. 25), and even “min- 
imal pairs” like *b h erg h - ‘take care of’ (OCS ne-bregg ‘I neglect’, Goth, bairgan ‘to pre- 
serve, keep’) and *b h erg h - ‘tall, high’ (Ved. brhant-, OHG berg ‘hill’, Arm. barjr ‘high’). 5 
All this proves that palatovelars were phonemic in the last phase of PIE, and that they did 
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not depend synchronically on the supposed phonetic environment prone to palatalization. 
Certain possible PIE variants like *lewk-/lewk- ‘shine’ (cf. Ved. rokas ‘light’ but rusant- 
‘shining’) are probably just remnants of some pre-PIE leveling, perhaps similar to PIE 
variants with *p/b/b h (p. 17). 

One usual argument against three series of velars in PIE is the unexpected frequency 
of these three series. One would expect that the unmarked plain velars would be the most 
frequent, while in reality they are the least frequent. However, this in no way proves that 
palatovelars are “secondary” or something of the sort. It just means that the system was 
unstable, which is clear from the fact that it was not preserved in any of the IE languages. 
The low frequency of plain velars is surely to be accounted for by certain conditions 
in the PIE prehistory. A frequent proposition is that PIE plain velars historically derive 
from uvular velars (thus pre-PIE *q > PIE *k, while, for instance, pre-PIE *k > PIE *k 6 ). 
Another possible hypothesis is that the PIE velar system is to be derived from an older 
system with plain velars only and a number of vowels, similar to Proto-Uralic, which is 
often mentioned as the most likely relative of IE (p. 7). It is imaginable that a system like 
that, after a massive merger of vowels (p. 42 — 43), would result in what we have in PIE. If, 
let us suppose, PIE *k w (e) is to be derived from pre-PIE *ku, *ko, PIE *k(e) from pre-PIE 
*ki, *ke (and possibly from *ku and *ka as well), and PIE *k(e) from pre-PIE *ka only, 
it is logical to expect that PIE *k would be the least frequent. 

As for the so-called centum reflexes in satem languages, which are sometimes adduced 
as proof of the secondary /loose nature of the three series, these seem to be mostly of sec- 
ondary, post-PIE nature - due either to later depalatalizations in certain satem languages 
(like the regular depalatalization before *m/l/r and back vowel in Balto-Slavic, cf. Lith. 
akmuo (p. 41) but asmenys ‘edge, blade’ 7 ), or loanwords from centum languages (like 
OCS breg-b ‘bank, shore’, Slav, also ‘hill’ < PS1. *bergr> probably from Germanic - see 
above for PIE *b h erg h -). Some of the centum reflexes may also be explained by rare PIE 
palatalized/non-palatalized variants like *lewk-/lewk (see above). 


Centum and satem languages 

As already mentioned (p. 21, 23), the centum! satem distinction in IE is based on differ- 
ent treatment of PIE *K and *K W in IE branches/languages. This used to be considered 
a major dialectal isogloss in PIE that divided PIE into western ( centum ) and eastern 
(satem) dialects. However, this view is nowadays untenable. The west/east divide was 
broken already by Tocharian (which is the easternmost IE language), but more import- 
ant is the fact that the traditional formulas (*K = *K ^ *K W for centum and *K = *K W ^ 
*K for satem languages) do not work for Armenian, Albanian, and Anatolian. Although 
other interpretations are possible (since this is essentially a question of terminology, 
not essence), the centum/satem distinction could be explained as follows - in satem 
languages, PIE *K yields affricates/fricatives, while the labiality of PIE *K W is usually 
(though not always and everywhere) lost. The centum group would then consist of those 
languages where PIE *K > K (while *K W is usually, though not always and forever, pre- 
served). Thus, Albanian, Annenian, and Luwian would be satem and Tocharian would 
be centum. The fact that in Anatolian one finds both satem (Luwian, Lycian) and centum 
languages (Hittite, Palaic), and the fact that often very close PIE dialects (like Indo- 
Iranian and Greek) are divided by this isogloss would point to the conclusion that this is 
not a crucial dividing point in PIE, as was once thought, but is a rather trivial phonetic 
innovation/variable. 
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centum languages 

satem languages 

centum/satem languages 


MAP 1.1 THE CENTUM-SATEM BRANCHES OF IE 


Later palatalizations 

In many IE branches, sub-branches, and languages, later palatalizations of the PIE velars 
(and, less often, other phonemes like dentals) occur before front vowels - e.g., in Indo- 
Iranian (p. 226, 270), Slavic (p. 528-529), Latvian (p. 496), Germanic, Anatolian (e.g., 
Hitt. *ti > zi, *di > si), Armenian (p. 427), Albanian (p. 569), Tocharian (p. 459-460), and 
most Romance languages (but not in Classical Latin). The change of *Cy > C j , which 
can also be interpreted as a kind of palatalization, can also be seen in, e.g., Lithuanian 
(p. 496), Slavic (p. 529-530), and Greek (p. 293). 


TABLE 1.3 BASIC REFLEXES OF PIE STOPS (palatalizations and other special changes, like 


Vemer’s Law, are not included) 
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bh 
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b 

b 

b 

P 

P 

*b 
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b 

P 
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dh 

d 

9 
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d 

d 

d 

t 

t 
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d 
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1 
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gh 

g 

X 
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g 

g 

g 

k 
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*g 

g 

g 
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s 

s 
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kk 

z, k 

*k 

s 
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z 
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z 
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k-, -0- 
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k 
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k 
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k 
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g 
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u 

g 

g 

q 

ku 

w 

*b 

k 

g 

k 
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gh 

g 
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g 

g 

w 

ku 

w 

*gW 

g 

g 
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PIE FRICATIVES 

PIE fricatives were *s and the so-called laryngeals (*h,, *h 2 , *h 3 ), or at least *h 2 and *h 3 , 
which were almost certainly some kind of /7-sounds. In PIE, *z was a voiced allophone 
of *s, like in *nisdos [nizdos] ‘nest’ (OCS gnezdo, OEng. nest). For PIE allophones *J>, 
*6, *6 h see p. 53. 

In IE languages, PIE *s either remains unchanged or changes (in various conditions) 
to s, h (glottal fricative), x (velar fricative), z, or (z > z >) r. The change to h, typologi- 
cally common (cf. many varieties of modem Spanish or many Polynesian languages), 
occurs independently in Iranian, Greek (initially and originally intervocalically), and 
Armenian (similar to Greek), and in some sandhi-conditions in Old Indie and Insular 
Celtic (cf. also Slavic, where PIE *s > *s > x in certain conditions - p. 526-527). The 
change to r (rhotacism), also typologically common (cf. the sporadic intervocalic z > r 
in Western South Slavic, e.g., Sin. more ‘can’ to OCS mozetb), occurs independently 
in Latin, North-West Germanic, Eretrian Ionic Greek dialect, and in certain sandhi- 
positions in Old Indie. 

*S 

PIE *septm ‘seven’ > Ved. sapta (YAv. hapta), Gr. zmd, Lat. septem (=t> Eng. 
September), OCS sednib, Lith. septvni, Goth, sibun (Eng. seven), OIr. secht" 
(Gaul, sextan-), Hitt, sipta-miva- ‘seven-drink’, Arm. ewfn, Alb. shtate, Toch. 
A spat 

PIE *(sti-)steh 2 - ‘stand’ > Ved. stha-, tisthati ‘stands’ (YAv. histaiti ‘stands’), Gr. 
i'oTT|pi ‘I make to stand’, oiaxoq ‘placed, standing’, Lat. sto ‘I stand’, sisto ‘I place 
(cause to stand)’, OCS stati ‘to stand’, Lith. stoti ‘to stand’, Goth, standan ‘to 
stand’ (Eng. stand), OIr. sissidir ‘stands’ (CIb. SISTAT ‘puts’), Hitt, istantave- 
‘stay put’, Toch. B ste ‘is’ 

PIE *muHs ‘mouse’ > Ved. mis-, Gr. put; - gen. sg. puoq, Lat. mus - gen. sg. muris, 
OCS mvsb, OEng. mus (Eng. mouse) 

Except in special cases, PIE *s does not change in Anatolian (written as <s> in Hittite), 
Continental Celtic (but cf. 365-366), and Tocharian. In Balto-Slavic and Indo-Iranian, 
the so-called RUKI mle (p. 206, 495, 526-527) operates, according to which PIE *s after 
*r, *u, *k, *i (and *r, *w, *k w , *y - in Hr. after *k as well) turns to *s, which then yields 
OInd. s (written also as <s>), Av. s (p. 269), Lith. s, Latv./OPmss. 5, Slav, x (cf. also 
PIE *rs > rs in Ann. - p. 428). In Slavic, x (cf. CS jwxa, Ved. vus-, Lat. ius - all ‘broth’, 
p. 33) can later be again palatalized to s (or s) (p. 528-529), while Lithuanian shows many 
exceptions from the law (p. 495). In Iranian, *s generally changes to h (except in *sC) - 
p. 264. In Greek (p. 293), *sC and final *-s# are preserved. Elsewhere, *s yields /?, written 
as <’> initially and preserved intervocalically in Mycenaean, but disappearing in Classi- 
cal Greek (gen. sg. Gr. vsepsoq ‘cloud’, OCS nebese ‘heaven’ < PIE *neb h ese/os). Inter- 
vocalic *s changes to r in Latin in historical times (p. 327); otherwise, it is unchanged. In 
Germanic, *s becomes voiced word-finally (later again devoiced in Gothic) and medially 
in the case of Vemer’s Law (p. 393). This *z is preserved in East Germanic, but changes 
to r in North-West Germanic (cf. Goth, batiza to Eng. better). In Armenian (p. 427^-28), 
PIE *s is preserved in certain clusters (Ann. mis, OCS mgso < PIE *mems- ‘meat’) but, 
as in Greek, yields h- (or drops) initially (cf. Ann. at ‘salt’ ~ Lat. sal but bin ‘old’ ~ Lat. 
senex) and disappears intervocalically. In Albanian, initially we get gj- (cf. Alb. gjume, 
OCS Sbnb, Gr. uwvoq < PIE *supnos ‘sleep’) or sh-, and medially perhaps -sh- or -h- (cf. 
p. 568). 
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A-mobile 

Some PIE roots appear both with initial *s- and without it. This facultative *s- is called 
s-mobile (mobile s ). An example is PIE *(s)poHimn- ‘foam’ > Ved. phena-, OCS penv 
(nom. pi.), OEng. jam (Eng. foam) but Lat. spuma, Lith. spaine. S-mobile is usually 
explained through some kind of pre-PIE sandhi reanalysis (e.g., *wlk w os pekye(ti) ‘wolf 
sees’ > *wlk w os (s)pekye(ti) - cf. Lat. specio ‘I see’ but Ved. pasyati ‘sees’). 


Consonantal reflexes of PIE laryngeals 

The exact phonetics of PIE “laryngeals” (*h b *h 2 , *h 3 ) are controversial. The name 
“laryngeals” is usually taken to be a traditional misnomer, though it is not impossible 
that *h, was, for instance, indeed a laryngeal/glottal [h]. One possibility is that they were 
velar *x, *x, *x w (parallel to stops), which would agree with their vowel-coloring charac- 
teristics (*h 2 e > *h 2 a and *h 3 e > h 3 o, p. 43); the other that *h, was glottal/laryngeal *h or 
palatal *g, 8 that *h, was a velar *x (uvular or pharyngeal *h are far less likely), and *h 3 
a labialized voiced velar *y w (or uvular *k w , for instance) or voiced pharyngeal (cf. the 
change *ph 3 > *b (p. 17) that would point to a voiced *h 3 ), etc. Other phonetic interpreta- 
tions are also possible. The *h 2 was the most frequent. The laryngeals disappear in most 
languages and positions, leaving only indirect traces (cf. p. 481, 523 for BSL), which is 
the reason they will be dealt with together with vowels (p. 43). Here, we shall only take a 
look at their consonantal reflexes (as h or aspiration). 

In Anatolian, *h 2 (and probably initial *h 3 -) reflects as h in many positions. In Hittite, 
initial *h 2 - yields h- (Hitt, hartaggas, Gr. apietoq < PIE *h,rtkos ‘bear’). Medially, -/?/?- is 
preserved in many conditions (Hitt, pahhur, Gr. Jiup < PIE *peh 2 wr ‘fire’), but not all 
(p. 177). Many suppose that initial *h 3 - also yields h- in Anatolian (Hitt, hastai, CLuw. 
has-, Ved. asthi, Lat. os < PIE *h 3 esth, ‘bone’), at least sometimes, though some contest 
that (and reconstruct PIE *h,osth, and the like instead). Medial *-h 3 - generally disappears 
in Anatolian (but cf. p. 177). 

In Indo-Iranian, laryngeals (mostly *h 2 ) aspirate preceding stops, both voiceless 
(p. 21) and voiced. Cf. PIE *d h ugh 2 ter ‘daughter’ > Ved. duhitd (with *-gh 2 - > *-g h - > -/?-), 
Gr. Suydirip (p. 47), *megh 2 > Ved. mahi (p. 53, 80), or *h,egHom > aham (p. 82) (with 
*gH > * 3 h > h). Initial *h 2/3 - seems to yield h- in some cases in Armenian, though the exact 
conditions are unclear and not all scholars accept this (p. 430), and less convincingly in 
Albanian. Cf. Ann. haw, Lat. auis < PIE *h 2 ew- ‘bird’ and Alb. herdhe ‘testicles’, Gr. 
opxn; ‘testicle’ < PIE *h 3 erg h -. 

The laryngeals apparently disappeared already in PIE medially before *y (Pinault’s 
Law ), 9 cf. PIE *krewh 2 s (with -i- in Ved. b-avis- and -a- in Gr. Kpeaq) (p. 16) but adjec- 
tive *krewh 2 yos > *krewyos (Ved. h-avvas ‘bloody’, cf. also the accent in Lith. kraujas, 
which points to no laryngeal). For initial *Hy- cf. p. 34. There were perhaps some other 
possible very specific PIE synchronic rules for laryngeal deletion - e.g., in *VHTRV 
position (cf. PIE *h,weh,- > Ved. vdti ‘blows’ but PGmc *wedran ‘weather’, OCS vedro 
‘nice weather’). 

PIE RESONANTS AND GLIDES 

PIE segments were syllabic (vowels), asyllabic (stops, fricatives), or syllabic/asyllabic 
(resonants + glides). PIE asyllabic resonants were *m, *n, *1, *r and glides *y, *w. 
When in syllabic position, they are written as *m, *n, *1, *r and *i, *u (the last two were 
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functionally obviously vowels). Syllabic variants appear mostly in predictable positions: 
-CRC- (gen. sg. *krde/os ‘heart’), #RC- (gen. sg. *udne/os ‘water’), and -CR# (*yek"r 
‘liver’). In other positions, asyllabic variants appear: -VRC- (*kerd ‘heart’), -CRV- (*g h ' 
wehp ‘beast’), #RV- (*wodr), -VR# (*ph,ter ‘father’). However, the occurrence of an 
a- syllabic variant was not always completely predictable - cf. the initial resonant clusters 
like *wr- (not **ur-) or *ml- (not **ml-) (p. 52). The general rule in case of two potential 
syllabic resonants in a coda (-CRRC- or -CRR#) was that the second one became syllabic 
(“the *RR principle”), cf. instr. pi. *kwnb h is ‘with dogs’ (Ved. svabhis, not **kunb h is) 
(p. 41). The exceptions here were cases with the present infix *-n- (p. 95), which was 
always asyllabic (cf. *yung- > Lat. iungo ‘I yoke’, *link w - > Lat. linqud ‘I leave’ and 
not **iwng-, **lynk w -); accusatives *-im/-um (not **-ym/-wm, p. 72-73) and pi. 
*-ins/-uns/-trns (not **-yns/-wns/-trns, p. 72, 74—75); and cases with Stang’s Law like the 
acc. sg. *dyewm (not **dyewm) > *dyem (p. 72). All this means that syllabic resonants 
were really separate phonemes in the last phase of PIE and not just allophones of their 
asyllabic variants (although still regularly alternating with them in many cases). In any 
case, the reflexes of syllabic and asyllabic resonants are completely divergent and have 
to be analyzed separately. 

Asyllabic resonants 

Asyllabic resonants were mostly stable in IE languages. They remained unchanged in 
most languages and positions. 

*m 

PIE *men- ‘think’ > Ved. manas- ‘mind’ (YAv. manah- ‘thinking’), Gr. psvoq ‘spirit’, 
Lat. memim ‘ I remember’ (with root reduplication, => Eng. memory), OCS mbneti 
‘to think’, Lith. menu ‘I remember’, Goth, man ‘to think’, OIr. muinithir ‘thinks’, 
Arm. imanam ‘I understand’, Toch. B maftu ‘desire’ 

PIE *med h u ‘mead’ > Ved. madhu (YAv. madu- ‘berry wine’), Gr. psflu ‘wine’, OCS 
medv ‘honey’, Lith. medus ‘honey’, OEng. me(o)du (Eng. mead), OIr. mid (Gaul. 
Medu-genos, CIb. Mezu-kenos (pers. names)), Toch. B nut ‘honey’ 

PIE *med h y- ‘middle’ > Ved. madhyas (GAv. ma'dya-), Gr. peo(a)oq (=t> Eng. Meso- 
zoic), Lat. medius (=> Eng. medium), OCS loc. pi. mezdaxu ‘alleys’ (PS1. *medja 
‘boundary’), Lith. mede ‘forest’ (< *‘forest as a border’), Goth, midjis (Eng. 
mid(dle)), OIr. mide, Arm. mej. Alb. midis ‘in the middle of’ 

Generally speaking, PIE *m does not change in any of the languages. The typologi- 
cally frequent (cf. Arabic, Finnish) change of final -m# > -n# occurs in many languages, 
e.g., in Greek, Hittite, Germanic, Armenian, Old Prussian, Phrygian, and many Celtic 
languages (cf. PIE neuter nom. sg. *-om > OInd. -am, Lat. -um but Gr. -ov, Hitt, -an, 
PGmc *-an, OPruss. -an, p. 65). 

*n 

PIE *neb h os ‘cloud, cloudy sky, mist’ > Ved. nabhas ‘cloud, mist’ (GAv. nabah- 
‘cloud’), Gr. vscpoq ‘cloud’, Lat. nebula ‘mist, fog’ (=> Eng. nebula), OCS nebo 
‘heaven’ (Slav, also ‘sky’), Lith. debesis ‘cloud’ < *nebesis, OHG nebul ‘fog, 
mist’, Hitt, nepis ‘sky, heaven’, OIr. nem ‘heaven, sky’ 
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PIE *new(y)os ‘new’ > Ved. nav(y)as (YAv. nauua-), Gr. veoq (=> Eng. neo-), Lat. 
nouus, OCS novb, Lith. naujas, Goth, niujis (Eng. new), Hitt, newa-, OIr. nuae 
(Gaul. Nouio-dunum (toponym)), Arm. nor, Toch. B huwe 
PIE *nog wh ts ‘night’ (p. 54) > Ved. nakt- (YAv. upa-naxtar- ‘almost night’), Gr. vu^, 
Lat. nox (=> Eng. nocturnal), OCS nostb, Lith. naktis, Goth, nahts (Eng. night), 
Hitt, nekuz mehur ‘at night’, OIr. innocht ‘tonight’ (Gaul, tri-nox(tion) ‘three 
nights’), Alb. nate, Toch. A nokte ‘at night’ 

Generally speaking, PIE *n does not change in any of the languages, but cf. late *VnV 
> r in Tosk Albanian (p. 564-565) and Slavic nasal vowels (p. 521). In cases like *pepk w e 
(p. 24), PIE probably had an allophone *[q] before velars like most languages. 


*r 


PIE *swesor ‘sister’ > Ved. svasar- (YAv. x'aijhar-), Gr. (Hesychius) eop ‘daughter, 
first cousin’, Lat. soror (=> Eng. sorority), OCS sestra, Lith. sesuo - gen. sg. sesers, 
Goth, swistar (Eng. sister), OIr. siur (Gaul, instr. pi. suiorebe), Arm. tovr, Toch. 
B ser 

PIE *treyes ‘three’ > Ved. travas (YAv. dravo), Gr. xpeiq (=> Eng. tri-logy), Lat. tres, 
OCS trije, Lith. trys, Goth .preis (Eng. three), Hitt, teri-, OIr. tri (Gaul. In'-), Arm. 
ereld, Alb. tre, Toch. B trai 

PIE *mer- ‘disappear > die’ > Ved. maranti ‘they die’ (YA v.fra-mirvete ‘dies’), Gr. 
popxoq ‘mortal’, Lat. morior ‘I die’ (=> Eng. mortal), OCS mreti ‘to die’, Lith. mirti 
‘to die’, Goth, maurpr (Eng. murder), Hitt, merzi ‘disappears’, Arm. meranim ‘I 
die’ 

Since it had a syllabic variant (*r - p. 35), PIE *r was originally phonetically probably 
a trill (as it still is, e.g., in Slavic). Generally speaking, PIE *r does not change in any of 
the languages - except phonetically to a different type of rhotics (rarely, one finds an / as a 
reflex in some Indo-Iranian languages, p. 206, 271). It is possible that there was no initial 
*#r- in PIE (as in Proto-Uralic, most non-IE Caucasian languages, Korean, etc.), which 
would be preserved as a phonotactic constraint in Old Anatolian languages, Armenian, 
Phrygian, and Greek (for Gr. #p- [hr] < *wr-, *sr-, cf. p. 290). If so, an initial laryngeal 
would have to be reconstructed even when there is no direct attestation of it, e.g., PIE 
*HrotHos ‘wheel’ (Ved. rathas ‘chariot’, Lat. rota, Lith. ratas). 


*1 


PIE *Ieyg h - ‘ lick’ > Ved. redhi (younger), ledhi Ticks’ (YAv. riz-, Modern Pers. 
listan ‘to lick’), Gr. kei^co ‘I lick up’, Lat. lingo ‘I lick’, OCS lizaase Ticked’, Lith. 
liezti ‘to lick’, OEng. liccian ‘to lick’ (Eng. lick), OIr. ligid Ticks’, Ann. lizem 
‘I lick’ 

PIE *seh,wl- ‘sun’ (cf. p. 78) > Ved. siiiyas (GAv. huuarS), Gr. pkioq (=> Eng. 

helium), Lat. sol (=> Eng. solar), OCS sfanbce, Lith. saule, Goth, sauil, MW haul 
PIE *h 3 el(n)- ‘elbow’ > Ved. aratnis (YAv. frara&ni-drajah- ‘elbow (length)’), Gr. 
cbk£vr|, Lat. ulna, OCS lakbtb, Lith. alkiine, Goth, aleina ‘cubit’ (Eng. ell, el-bow), 
OIr. uilen ‘elbow, angle’ (Gaul. Olina (river name)), Arm. olok° ‘shin’, Alb. (Gheg) 
llane ‘fore-arm’, Toch. B alviye ‘palm of the hand’ 
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Generally speaking, PIE *1 is preserved in most languages. The only exception is 
Indo-Iranian, where it yields r in most languages (and in most words), but not always 
(p. 206, 271). 


*w 

PIE *weg h - ‘drive, convey’ > Ved. vahati ‘drives’ (YAv. vazanti ‘they puli’), Gr. 
o%oc, ‘carriage’ (Myc. wo-ka ‘wagon’, Pamph. fs%stco ‘should bring’, Cypr. 
‘brought’), Lat. uehd ‘I bear, carry, convey’ (=> Eng. vector, vehement), OCS vezg 
‘I drive, lead, convey’, Lith. vezti ‘to lead, convey’, Goth, ga-wigan ‘to move’ 
(Eng. weigh', Eng. wagon <= Dutch), OIr .fen ‘wagon’ (W gwairi), (?) Alb. vjedh 
‘rob, steal’, Toch. A wkdm ‘way, manner’ 

PIE *wert- ‘turn, spin’ > Ved. vartate ‘turns’ (YAv. var’ta- ‘turn’), (?) Gr. (Dor., Hes- 
ychius) paidvav Topuvr|v ‘stirrer, ladle’, Lat. uerto ‘I turn’ (=> Eng. con-vert ), OCS 
vratiti sg ‘to return, turn’, Lith. versti ‘to turn over’, Goth, wairpan ‘to become’, 
OIr .frith- ‘against’ (cf. Lat. uersus ‘against, towards’ => Eng. ad-verse), Toch. A 
wart- ‘throw’ 

PIE *welh,- ‘want’ > Ved. vcirat ‘wants, chooses’ (GAv. var'nte ‘chooses’), Gr. \<h 
‘I want/wish’, Lat. uolo ‘I wish/want’ (=> Eng. volunteer), OCS veleti ‘to want’, 
OLith. velti ‘to let, allow’, Goth, wiljan ‘to want’ (Eng. will), Gaul, uelor ‘I wish’ 
(MW gwell ‘better’) 

Generally speaking, PIE *w is preserved in many languages in most positions. How- 
ever, the original PIE bilabial semivowel pronunciation (like Eng. w) is preserved only in 
some languages (Classical Latin but not Romance, some Ancient Greek dialects, English 
but not most Germanic languages, etc.), while elsewhere it usually changes to a fricative 
v (Lithuanian, most Slavic languages, Albanian, most Germanic and Romance languages, 
etc.). A labiodental approximant [u] as a reflex is less common (as in some Dutch and 
South Slavic Stokavian dialects, Slovene, as one of the realizations in Hindi, etc.). 

Mycenaean still has a w, and so do some Greek dialects (written as p); this p disappears 
early in Attic-Ionic (p. 289, 292). PCelt. *w is preserved in Gaulish and Celtiberian (later 
changed in Insular Celtic, p. 362, 365-366). In Armenian, -w is preserved word-finally 
(cf. Ann. naw ‘ship’, Lat. nauis < PIE *neh 2 w-), while initially it changes to g (via *gw, 
similar to Welsh), cf. Ann. egit ‘found’, Ved. avidat ‘found’ < PIE *widet (p. 429). In 
Albanian, PIE *w is preserved initially, and otherwise it disappears (p. 568). 


*y 

PIE *yuh 3 s- ‘soup, broth’ > Ved. yds, Gr. ^upr| ‘leaven’, ^ropoq ‘soup’, Lat. ius ‘broth, 
sauce’, CSA.juxa, Lith. arch, fuse ‘fish soup’ 

PIE *h 2 yowHn- ‘young’ > Ved. yuvan- (YAv. yauua ‘youth’), Lat. iuuenis ‘young 
man’ (=> Eng. juvenile, junior), OCS jum>, Lith .jaunas, Goth.y'uggs (Eng. young), 
OIr. oac (MW ieuanc, Gaul. Iouinc-illus (pers. name)) 

PIE *yek"r ‘liver’ (p. 78) > Ved. yakrt (YAv. vakara), Gr. rjnap (=> Eng. hepatitis), 
Lat. iecur, CS ikra ‘roe’ (< PS1. *jbkra), OLith. jeknos 


Generally speaking, PIE *y is often preserved but not universally. In Greek and Latin, 
intervocalic *y disappears (PIE *treyes ‘three’ > Gr. ipeiq [tres], Lat. tres), *-Cy- has 
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various outcomes (p. 293), and in initial position one finds either Gr. /h-/ or which 
is an old and disputed problem - some claim that one is the outcome of *y- and the other 
of *Hy- (since there are no “prothetic vowels” - p. 48 - before *y), but which reflex is 
derived from which source is also disputed. PCelt. *y disappears completely in Old Irish 
(p. 362, 365-366). In Armenian, intervocalic *y disappears (PIE *treyes > Ann. erelf), 
*-Ry- yields Rj (cf. Arm. olj, OIr. uile < PIE *h 3 elyos ‘whole’), and the reflex of initial 
*y- is not completely clear (p. 429). In Albanian, medial *y is lost (PIE *treyes > Alb. 
tre ), while initial *y- yields gj-, cf. Alb. gjesh ‘gird’, Gr. ijcoGifip ‘a warrior’s belt’, Lith. 
juosta ‘waist-band, belt’ < PIE *yeh 3 s- (p. 568). 


TABLE 1.4 THE REFLEXES OF PIE ASYLLABIC RESONANTS 
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y 

y 
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j 

j 

j 

y 

0 
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Syllabic resonants 

Syllabic resonants *m, *n, *1, *r are completely unstable diachronically in IE and change 
everywhere (the sole exception being PIE *r > OInd. r). Their reflexes are of the follow- 
ing types: 

a) the reflex is a syllabic resonant (*R > R) - only in PIE *r, *1 > OInd. r 

b) the resonant obtains a preceding vowel (*R > VR) - e.g., PIE *R > Gmc *uR 

c) the resonant obtains a following vowel (*R > RV) - e.g., PIE *r > Gr. pa (also ap) 

d) the resonant becomes a vowel (*R > V) - e.g., PIE *m, *n > OInd./Alb. a, Gr. a 

The type b) is the most frequent by far. In some languages/groups, all resonants have the 
same type of development (cf. Gmc *umAin/ul/ur). In others, the type of development of 
*m/n differs from that of *l/r (cf. Lat. em/en but ol/or). Reflexes in *-CRHV- (before a 
disappearing laryngeal) are often different than in *-CRC(V)- (p. 39). As already said, *i 
and *u phono logically acted just like syllabic resonants. 


PIE *deKm ‘ten’ > Ved. dasa (YAv. dasa ), Gr. Ssica (=> Eng. decade), Lat. decern (=> 
Eng. December), OCS desqtb, Lith. desimt, Goth, taihun (Eng. ten), OIr. deich" 
(Gaul, decan-). Arm. tasn < *tasan 

PIE *(d)Kmtom ‘ hundred’ > Ved. satam (YAv. satam), Gr. eKaxov, Lat. centum (=> 
Eng. cent), OCS Sbto, Lith. simtas, Goth, hunda (Eng. hundred), OW cant (CIb. 
kantom, OIr. cet), Toch. A kdnt 

PIE *g"mtis ‘walk’ > Ved. gatis (YAv. a‘pi-ga‘ti- ‘entrance’), Gr. pdaiq ‘step’ (=> Eng. 
basis), Lat. uentio ‘coming’ (=> Eng. con-vention), Goth, ga-qumps ‘gathering’ 
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*n 

PIE *mntis ‘thought, mind’ > Ved. matis ‘thought’ (Av. ma‘ti-), Gr. duio-poiToq ‘spon- 
taneous’ (=» Eng. automatic), Lat. (mens) - gen. sg. mentis ‘mind’ (=> Eng. mental), 
OCS pa-mqtb ‘memory’ (Slav, also ‘mind’), Lith. mintis ‘thought’, Goth, ga-munds 
‘remembrance’ (Eng. mind) 

PIE *h|iiewn ‘nine’ > Ved. nava (YAv. nava), Gr. ewsa, Lat. nouem < *nouen (by 
analogy to decern, => Eng. november), OCS devgtb, Lith. devintas ‘ninth’ (OPrass. 
newTnts), Goth, niun (Eng. nine), OIr. noi" < PCelt. *nowan, Arm. inn < *inan 

PIE *h,neh 3 mn ‘name’ (p. 48) > Ved. nama (YAv. nqma), Gr. ovopa (=> Eng. ono- 
mastics), Lat. nomen (=> Eng. nominal), Hitt, laman 

The reflexes of *m and *n are always parallel. Final -m changes to -n in some lan- 
guages as usual (p. 31), and assimilatory m > n before dentals is also common in some 
languages. In Indo-Iranian, Albanian, and Greek, the reflex is a/a (cf. Alb. shta-te ‘seven’ 
< *s(ep)ta- < PIE *septm). Latin has em/en (initially ma-, na-); Germanic *um/un; Arme- 
nian, Hittite, and Celtic am/ an (with further development in separate Celtic languages - 
p. 358-359); and Tocharian dm/dn. In Balto-Slavic, the reflexes are *im/in (less fre- 
quently and in unclear conditions *um/un). Lithuanian preserves im/um, in/un, and in 
Slavic *im/in > g, *um/un > g (*um can less often and in unclear conditions also yield r>, 
cf. ‘hundred’ above) in closed syllables. 


PIE *wlk w os ‘wolf’ > Ved. vfkas (YAv. vohrka-), OCS vlbkb < PS1. *vtlk'b, Lith. 
vilkas, Goth, wulfs (Eng. wolf), (?) Hitt, walkuwa- (meaning something negative), 
Toch. B walkwe 

PIE *mldus ‘soft’ > Ved. mrdus, Gr. pLaSeL; ‘powerless’, Lat. mollis < *moldwis, 
MW blydd < PCelt. *mlidos 

PIE *plth,us ‘broad, wide’ > Ved. prthus (GAv. por^Hu-), Gr. jiLorruq, OBret. litan 
‘broad’ (Gaul. Litana (toponym)), Arm. v-alf 


*r 

PIE *mrt(w)os ‘dead’, *mitis ‘death’ (cf. *mer-, p. 32) > Ved. mrtas ‘dead’ (YAv. 
mor'ta-), Gr. (Aeol.) Ppoxoq ‘mortal’, Lat. mors ‘death’ (=> Eng. mortal), mortuus 
‘dead’, OCS mrbtvb ‘dead’ < *mi>rtv'b, Sb-mrbtb ‘death’ < *ST>-mbrtb, Lith. mirtis 
‘death’, Goth, maurpr (Eng. murder). Arm. mard ‘man’ < *‘mortal’, MW marw 
‘dead ’ 

PIE • h.rtkos > *h 2 rktos [-kp] ‘bear’ (p. 53) > Ved. rksas (YAv. arsa-), Gr. apictoq 
(=> Eng. Arctic), Lat. ursus, Lith. irst\’d ‘bear-den’, OIr. art (Gaul. Artio (name of 
a deity)), Hitt, hartaggas, Ann. arj, Alb. ari 

PIE *k"rmis ‘wonn’ > Ved. kfmis, OCS crwb < *ci>rvi>, Lith. kirmis, OIr. cruim 
< PCelt. *k w rimis, Alb. krim(b) (cf. PIE *wnnis > Goth, waurm ‘snake’ (Eng. 
worm)) 

The reflexes of *1 and *r are always parallel. In Old Indie we find r, in Avestan mostly 
or 3 (p. 267). In Greek (Attic), we find aWka and ap/pa in unclear conditions (cf. PIE 
*krd-, p. 21); in Latin or/ol (initially ra-, la-). Balto-Slavic has *ir/il (less frequently 



36 MATE KAPOVIC 


*ur/ul) - unchanged in Lithuanian; Slavic reflexes are *br/br and *bl/bl (p. 480), pre- 
served in Old Russian but written as <rb/rb> and <lb/lb> indiscriminately in OCS. Ger- 
manic has *ul/ur (> ullaur in Gothic, p. 391). In Celtic we find *ri/li before stops and *m, 
and *ar/al otherwise (p. 362). Armenian and Hittite have ar, al, Albanian usually ri and 
li, and Tocharian al, dr. 


PIE *misd h - ‘reward, pay’ > Ved. rmdham < *mizdham ‘spoils, gain’ (MPers. mizd), 
Gr. piofloq ‘hire, pay, reward’, OCS nibzda ‘pay’, Goth, mizdo 
PIE *h,esmi ‘am’ > Ved. asmi (YAv. ahmi), Gr. eipi, OCS jesnib, Hitt, esmi 
PIE *-is (nom. sg., /-stems, p. 72) > Ved. -is, Gr. -tc, Lat. -is, OCS -b, Lith. -is, 
Hitt, -is 


*u 


PIE *yugom ‘yoke’ > Ved. yugam (GAv. vaogat ‘harnesses’), Gr. ijuyov, Lat. iugum 
(=> Eng. conjugation ), OCS igo < *jbgo, Lith. jiingas, Goth .juk (Eng. yoke), Mir. 
cuing < *kom-yungi- (Gaul. Ver-iugus (pers. name)), Hitt, iukan, Ann. luc 
PIE *snusos ‘daughter-in-law’ > Ved. snusa, Gr. (Horn.) vuoc, Lat. nurus, CS snbxa, 
OHG snura, Ann. nu 

PIE *supnos ‘sleep’ > Pali supina- ‘dream’, Gr. uwvoq (=t> Eng. hypnosis), OCS Sbnb, 
Alb. gjume 

In most languages/families, PIE *i and *u are preserved (at least in the oldest stages). 
Greek u originally had the value [u] but changed to [ii] later in Attic (p. 295). In Slavic, 
*u > b and *i > b (originally probably back and front very short, centralized/schwa-like 
vowels respectively). In Gothic, *u changes to ait [o] (written originally just as <au>, the 
same as the diphthong [aw]) before h, Iv, r. In Tocharian, both *i (also *e) and *u yield a, 
but a from *i/e causes palatalization (p. 455, 459). 


TABLE 1.5 THE REFLEXES OF PIE SYLLABIC RESONANTS 
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Syllabic resonants and laryngeals 

In traditional IE linguistics, PIE reconstruction included long syllabic resonants *m, *n, 
* r, *1, *1, and *u (in *CRC position). After the discovery of laryngeals, it turned out that 
these were in fact *mH, *nH, *rH, *1H, *iH, and *uH. One has to distinguish two basic 
positions of *RH - one is the pre -consonantal (*CRHC) and the other is the pre -vocalic 
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(*CRHV). In *CRHC, what one gets in IE languages (if the reflexes are not the same as 
those of the regular syllabic resonants) are sequences like RV, VR, or, less frequently, 
VRV. In *CRHV, the reflexes are everywhere of the VR type. We deal first with the 
*CRHC type. 


*CRHC (pre-consonantal position) 

*mH 

PIE *dmh 2 tos ‘ tamed’ > Ved. dantas (with analogical -/?-), Gr. a8apa(o)xoq ‘uncon- 
quered’, aSpqxoq ‘unbroken’ 


*nH 

PIE *gnh,tos ‘bom’ > Ved . jatas (YAv. zata-), Gr. Kaai-yvqxoq ‘brother’, Lat. (g) 
natus, Goth, airpa-kunds ‘born of the earth’, Gaul. Cintu-gnatus (‘first-bom’ (pers. 
name)) 

PIE *gnh,tos ‘known’ > Gr. yvcoxoq, Lat. gnarus ‘knowing’ (< *gnh 3 ros), Lith. pa-zin- 
tas, Goth, -lamps, OIr. gnath 

There are few words with *nH/mH (mostly *-tos participles of verbs of *C VRH- type), 
especially with *mH. In Vedic, a is the reflex (though this is not completely certain for 
*mH). In Greek, the reflexes depend on the laryngeal and possibly (though disputably) 
on stress position. In unaccented position, *mh[ > pq, *mh 2 > poi (> Att. pq, p. 295), 
*mh 3 > pro and *nh, > vq, *nh, > voi (> Att. vq), *nh 3 > vco. When stressed, the reflexes 
seem to be epe/apa/opo and Eve/dva/ovo. Cf. pairs like wo^uKpqxoq ‘wrought with much 
toil’ - Kapaxoq ‘toil’ (*kmh,tos) or Kaoiyvqxoq and yEveaiq ‘origin’ (*gnh,tis). But forms 
like yvqaioq ‘belonging to the race’ would then have to be secondary. In Germanic, the 
reflexes are identical to those of *m/n, and the same goes for Balto-Slavic (disregarding 
the prosodical differences, see below). Latin has ma, na, and so does Celtic (OIr. ma, na; 
cf. here also Toch. <ma>, <na>), while in Armenian we find ama, ana (somewhat similar 
to Greek). 

*1H 

PIE *plh,nos ‘ full’ > Ved. pumas (GAv. psr’na-), OCS pl-bm, (BCMS pun), Lith. 
pilnas, Goth .fulls (Eng. full), OIr. Ian 

PIE *(h,)wlh ] neh 2 ‘wool’ > Ved. urna (YAv. var’na-), Gr. A.qvoq, Lat. lana, OCS 
vl-bna (BCMS vuna), Lith. vilna, Goth, wulla (Eng. wool), MW gwlan 

PIE *dlh,g h os ‘long’ > Ved. dirghas (GAv. dar'ga-), OCS dl-bgb (BCMS dug), Lith. 
ilgas (< *dilgas), Goth, tulgus ‘firm’, Alb. gjate (< *glag(V)-t-) 


*rH 

PIE *grh 2 no- ‘old, ripe; grain’ (cf. *gerh 2 -, p. 22) > Ved .jTrnas ‘old’ (YAv. zar’ta- 
‘decrepit’), Lat. granum ‘grain’ (=> Eng. grain), OCS znno ‘grain’ (BCMS zrno), 
Lith. zirnis ‘grain’, Goth, kauni ‘grain’ (Eng. com), OIr. gran ‘grain’ 
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PIE *krh 2 sro(n) ‘ hornet’ > Lat. crabro, CS snsenu, Lith. sirse, OEng. hyrnetu (Eng. 
hornet) < PGmc *xurzn- 

PIE *strh 3 t/nos ‘laid down’ > Ved. stirnas, Gr. oipcoioq, Lat. stratus (=> Eng. 
street) 

PIE *rH/lH merge and yield ir/ur in Vedic (fir usually after labials - p,ph, bh, m, v) and 
ar 1 in Avestan. In Greek once again we find A,r|/pr|, Xd (> Att. A,r|)/pd, tao/pco and eXe/epe, 
dXa/dpa, oXo/opo. In Germanic *r/l = *rH/lH; the same is true for Balto-Slavic except 
on prosodic level - cf. BCMS pun, Lith. pilnas, Latv. pilns for PIE *plh[nos (p. 37), 
but BCMS vuk, Lith. vilkas, Latv. vilks for PIE *wlk w os (p. 35). Latin and Celtic have 
la, ra (written as la, ra in OIr. - p. 362 - cf. also Toch. <la>, <ra>), while in Armenian 
we find ala, ara. In Albanian we find possible al, ar reflexes (cf. Alb. pare ‘first’ < PIE 
*prh 2/3 wo-) but also la, ra (cf. gjate above). 

*iH 

PIE *wiHros ‘man’ > Ved. viras (Av. \nra-), Lat. uir (with shortening, cf. p. 357), 
Lith. \yras, Goth, wair (with shortening, cf. Eng. were-wolf), OIr .fer (with short- 
ening), Toch. A wir ‘young’ 

PIE *g"ih 3 wos ‘alive’ >Ved.jivas, Gr. (3ioq ‘life’ (with pre -vocalic shortening, => Eng. 
bio-logv, etc.), Lat. uTuus (=> Eng. vivisection), OCS zivt, Lith. gvvas, Goth, qius 
‘alive’ (with shortening, Eng. quick), OIr. beo (Ogham BIVI-TI, with shortening) 

PIE *g w riHweh 2 ‘neck’ > Ved. griva (YAv. griva), OCS grivbna ‘necklace’ (BCMS 
griva ‘mane’), Latv. griva ‘(river) mouth’ 


*uH 

PIE *suHnus ‘ son’ > Ved. s units (GAv. hunu- ‘offspring’), Gr. dial, uiuq (< *suHyus), 
OCS sym>, Lith. sunus (ace. sg. siinq), Goth, sunus (with shortening, Eng. son) 

PIE *muHs ‘mouse’ > Ved. mits-, Gr. piic, Lat. mus, OCS mysb (BCMS mis), OEng. 
mus (Eng. mouse), Ann. mukn. Alb. mi 

PIE *suHs ‘ sow, swine’ > Ved. sukaras ‘wild boar’ (YAv. hu- ‘swine’), Gr. uq, Lat. 
sus ‘pig, sow’, OEng. sit ‘sow’, Alb. thi ‘pig’ (with secondary th-), Toch. B suwo 
‘pig, hog’ 

Traditionally reconstructed *1 and *u can be reinterpreted as *iH and *uH in almost 
all cases, while the only real instances of long *T and *u appear in cases of monosyllabic 
lengthening (p. 54—55). Generally speaking, long *1 and *u remain unchanged in IE lan- 
guages. Long 7 is written as <y> in Lithuanian, as <ei> in Gothic, and as <i> in Old Irish. 
Old *u changes to y in OCS (most likely a id-type diphthong originally - thus written 
as a combination of and i in Glagolitic/Cyrillic script), and to i in Albanian in final/ 
only syllable (probably y otherwise). In Germanic, Italic, and Celtic, *1 and *u shorten 
in pretonic position in many cases (Dybo’s Law). Distinctive length is lost in Armenian, 
Albanian, and Tocharian. According to some linguists (cf. p. 431), in some cases in Greek 
(and Tocharian and Armenian) there is a so-called laryngeal breaking of *i/uh, /3 , where 
it is not *i/u that is syllabic but the laryngeals (*y/wh 2/3 ) - cf. Gr. ^co(p)6<; ‘alive’ (=> Eng. 
zoo-logv) from *g w yh 3 wos (if not from *g w yeh 3 -?). The reflexes in Greek are thus *y 
(palatalization)/^ + a > r| or co (also Toch. yd, wa). 
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TABLE 1.6 

THE REFLEXES OF PIE CRIIC 









PIE 

OInd. 

Av. 

Gr. 

Lat. 

ocs 

Lith. 

Goth. 

OIr. 

Arm. 

Alb. 

Toch. 

*mH 

a 

a 

pr|/pa > (ir)/|ico, 8(i8/ajia/6(io 

ma 

?,<? 

im/um 

um 

ma 

ama 


ma 

*nH 

a 

a 

vrj/va > vrj/vco, eve/ava/ovo 

nd 

?,<? 

in/un 

un 

nd 

ana 


nd 

*1H 

ir, ur 

ar 3 

X,r|/X,ot > AjjAxo, tXdbXaloko 

la 

hJlh 

il/ul 

ul 

la 

ala 

al,la 

la 

*rH 

ir, ur 

ar 3 

pr|/pa /pco, epc/apa/opo 

ra 

rb/rb 

ir/ur 

aur 

ra 

ara 

ar, ra 

ra 

*iH 

T 

i 

T 

T 

i 

y 

i 

l 

i 

i 

7 

*uH 

u 

u 

i3 

u 

y 

u 

u 

u 

u 

ly 

u 


*CRHV (pre-vocalic position) 

In this position, syllabic resonants are not in the same syllable with laryngeals, so this is 
not really the case of “long syllabic resonants”, and reflexes in most languages are dis- 
tinct from both *CRI 1C and *CRC types: 


TABLE 1.7 THE REFLEXES OF PIE *CRHV 


PIE 

OInd. 

Av. 

Gr. 

Lat. 

ocs 

Lith. 

Goth. 

OIr. 

Arm. 

Toch. 

*mH 

am 

am 

ap 

em 

bm/wn 

im/um 

um 

am 

am 

am/ am 

*nH 

an 

an 

av 

en 

bn/bn 

in/un 

un 

an 

an 

an/an 

*1H 

ir/ur 

ar 

aX 

al 

bl/bl 

il/ul 

ul 

al 

al 

dl/al 

*rH 

ir/ur 

ar 

ap 

ar 

br/br 

ir/ur 

aur 

ar 

ar 

ar/ar 


Cf. PIE *tnh,us (p. 16), *g w rH- (p. 25) or PIE *g w rh,us ‘heavy’ > Ved. gurus, Gr. Papin;, 
Goth, kaurus. The -ur- in Ved. gurus (cf. also ur after labials - p. 38) is often taken as a trace 
of the old labiovelar in Ilr., though there are counterexamples (cf. girls - p. 25). 


PIE VOWELS 

PIE had the following vowels (not counting the diphthongs): *e, *e, *o, *o, *a. The pho- 
nemes *i and *u were also vowels but phonologically speaking, at least originally, syllabic 
glides (p. 30-31) (phonetically speaking, j/i and w/u are generally the same except for the 
difference in tongue hight/syllabicity). Vowel *a was a marginal phoneme in PIE (p. 42), 
with a position somewhat similar to that of *b (p. 16-17). Some IE-ists even believe 
that there is no need to reconstruct a PIE *a. What seems to point to long *a is actually 
always *eh 2 (p. 45). PIE long *e and *o were rather rare, limited to certain morphological 
(p. 61-62) and derivational forms and contractions (p. 65). The reflexes adduced below 
are sometimes relevant only for non-final and/or accented syllables (e.g., in Latin). 

*e 


PIE *(h,)nepots ‘grandson, nephew’ > Ved. napat ‘grandson’ (YAv. napa ‘grand- 
son’), Gr. dve\|no<; ‘cousin’, ven o5ec, ‘descendants’, Lat. nepos ‘grandson, descen- 
dant’ (=> Eng. nepotism), CS netii ‘nephew’, OLith. nepuotis ‘grandson’, OEng. 
nefa ‘grandson, nephew’ (Eng. nephew <= French), OIr. necht ‘niece’, Alb. nip 
‘grandson, nephew’, mbese ‘granddaughter, niece’ 
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PIE *der- ‘tear, flay’ > Ved. dar- ‘split, blow up’, Gr. Sepco ‘I flay’ (Seppa ‘(flayed) 
skin’ => Eng. dermatology), OCS denp ‘I flay’ (Slav, also ‘tear’), Lith. dial, derii ‘I 
flay’, Goth, dis-tairan ‘to tear asunder’ (Eng. tear). Arm. terem ‘I flay’, Toch. AB 
tsar- ‘be separated’ 

PIE *b h er- ‘bear, carry’ > Ved. bharati ‘bears, brings, keeps’ (YAv. bara‘ti ‘brings’), 
Gr. (pep© ‘I bear, carry’, Lat .fern ‘I bear, carry’, OCS berQ ‘I gather, select’, Goth. 
bairan ‘to bear, carry’ (Eng. bear), OIr. berid ‘carries, brings’, Ann. berem ‘I bear, 
bring’, Alb. bie ‘carry, bear, bring’, Phryg. a(3(3ep£T ‘brings’, Toch. AB par- ‘bear 
(away), carry (off)’ 

PIE *e remains unchanged in Greek, Latin (in the fonnerly stressed first syllable, 
where there is no vowel reduction - p. 328), Balto-Slavic, Celtic, Armenian, and Anato- 
lian (cf. Hitt, nepis with lengthening under accent -p. 31), except in specific conditions. 
In Indo-Iranian, PIE *e causes palatalization (p. 206) and then merges with *o and *a into 
Ilr. a. In Germanic, PIE *e yields *e or *i (p. 395) - in Gothic it is always i (cf. Goth, itan 
‘to eat’ but ONor. eta ‘to eat’, Lat. edo ‘I eat’ < PIE *h,ed-), merging with PIE *i (the new 
i changes to ai [e] before h, Iv, r). In Albanian the reflexes ( e,ja , ie,je, i) depend on the 
phonetic surroundings but are not completely clear (p. 561). PIE *e merges with *i into 
’a in Tocharian (p. 455). 

*o 


PIE *potis ‘lord, husband’ > Ved. pads, paty- (GAv. paid-), Gr. took; ‘husband’, Lat. 
potis ‘(cap)able’, OCS gos-podb ‘lord’, lAth.pats ‘husband, self’, Goth, brup-faps 
‘bridegroom’, Toch. B pets ‘husband’ 

PIE *domh 2 os ‘house, home’ > Ved. damas ‘house’, Gr. 56goc ‘house’, Lat. domus, 
OCS donrb, Lith. namas ‘house’ < *damas (assimilation) 

PIE *g h ostis ‘stranger, guest’ > Lat. hostis ‘stranger, enemy’ (=> Eng. hostile), OCS dat. 
pi. gostemb ‘to guests’, Goth, gasts ‘stranger, guest’ (Eng. guest <= ONor. gestr), (?) 
Lep. Uvamo-kozis (‘having the highest guests’ (pers. name)), (?) Luw. kasi- ‘visit’ 

IE languages can be divided into two groups when it comes to the reflex of PIE *o. 
In one group, PIE *o generally remains o - Greek, Latin (in the first syllable - p. 328), 
Celtic, Armenian (in most cases, p. 424), Lusitanian, etc. In the other languages, PIE 
*o yields a (and thus merges with the reflex of PIE *a/h,e) - in Indo-Iranian (*e/o > a), 
Balto-Slavic (in Slavic, BS1. *a shifts back to o), Germanic, most of Anatolian (including 
Hittite, but not in Proto-Anatolian and all Anatolian languages - p. 176), Albanian (cf. 
nate, p. 32), Messapian, etc. In Indo-Iranian, PIE *o > a in open syllables (BrugmaniTs 
Law), cf. Ved. ddru (p. 17). In Tocharian, PIE *o yields e (Toch. B), a (Toch. A), and 
sometimes a (p. 455). 

*e 

PIE *meh 2 ter ‘mother’ > Ved. mata (GA v.pta < *ph 2 ter, p. 46), Gr. pf)TT|p, Lat. mater 
< *mater (=> Eng. maternal), Lith. arch, mote, OIr. mathir (< PCelt. *matlr), Toch. 
B macer 

PIE *weg h s- ‘conveyed’ (cf. p. 33, 96) > Ved. a-vaksur ‘they drove’, Lat. uexi ‘ I car- 
ried, conveyed’, OCS vesr> ‘I conveyed’ 

PIE *dyem ‘day sky’ (ace., cf. p. 56, 70, 72) > Ved. dvam, Gr. Zpv ‘Zeus’, Lat. diem 
‘day’ (with shortening) 
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In general, PIE *e reflects just like PIE *eh[ (p. 45). PIE *e is preserved in Greek (<r|>), 
Latin (shortened outside of the first syllable), Hittite (shortened when not accented), and 
Germanic. In Lithuanian (closed e, p. 488) and Slavic (*e, p. 522), it yields a type of long 
e. In Indo-Iranian, PIE *e (like *5 and *a) yields a. PIE *e changes into *1 in Celtic and i 
in Armenian. In Albanian the reflex is o (p. 561), and in Tocharian PIE *e merges with *o 
into e (Toch. B)/a (Toch. A), but only old *e causes palatalization (p. 455). 

*o 

PIE *Kwo(n) ‘dog’ > Ved. svS (YAv. spa), Gr. kucov, Lith. sud, OIr. cu, Ann. sun 
PIE *h 2 ekmo(n) ‘ stone’ > Ved. asma, Gr. oncpcov ‘anvil, meteoric stone’, Lith. akmud 
PIE *h 2 owyom ‘egg’ > Pers. xava, Gr. raov, Lat. ouum (=> Eng. ovulation), PS1. 
*aje (> BCMS jaje), Arm . ju (with an unexpected j-), OW ui (< PCelt. *awyo-), 
Alb. ve 

In general, PIE *o reflects just like PIE *eh 3 or *oH (p. 46). PIE *o is preserved in 
Greek (<co>), Latin, and Gennanic (cf. Goth, fotus, p. 15). Proto-Balto-Slavic *o yields 
uo in Lithuanian/Latvian and a in Slavic. In Indo-Iranian, PIE *5 (like *e and *a) yields 
a. In Celtic, PIE *5 gives *a in non-final (OIr. <a>) and *u in final/only syllables. 
Proto-Anatolian *5 yields Hitt, a (accented) or a (unaccented), cf. Hitt. pi. widar ‘water’ 
< *wedor (p. 179). In Armenian the reflex of PIE *o is u, and in Albanian it is e (p. 561). 
In Tocharian PIE *o merges with *a and *H into a (p. 455). 

*a 


PIE *dap(n)- ‘sacrificial meal, feast’ > Gr. Soottco ‘I devour, consume’, 5omdvr| ‘cost, 
expenditure’, Lat. daps ‘sacrificial meal, feast’, ONor. tafn ‘sacrifice’, Hitt, tap- 
pala- ‘person responsible for court cooking’, Ann. town ‘feast’, Toch. B tapp- ‘eat’ 
PIE *kapros ‘he-goat, boar’ > Ved. kaprth- ‘penis’, Gr. Kourpot; ‘(wild) boar’, Lat. 

caper ‘he-goat, buck’, ONor. hafr ‘he-goat’, W caer-iwrch ‘roebuck’ 

PIE *bak- ‘staff, stick’ > Gr. pdierpov (=> Eng. bacterium), Lat. baculum, (?) Lith. 
baksnoti ‘to poke’, OIr. bacc ‘billhook, angle, bend’ 

A separate reflex of PIE *a (preserved almost everywhere as a) can be seen only in IE 
languages where there is no *o > a change, i.e., no merger of PIE *a and *o. Languages/ 
branches without merger are Greek, Italic, Celtic, Annenian, Tocharian, and originally 
Anatolian (though merger occurs later in some languages). In Tocharian, PIE *a merges 
with PIE *o and *H into Toch. a. In Indo-Iranian, Balto-Slavic, Germanic, Albanian, and 
some Anatolian languages (like Hittite), PIE *a and *o merge into a. 


TABLE 1.8 THE REFLEXES OF PIE VOWELS 


PIE 

OInd. 

Av. 

Gr. 

Lat. 

ocs 

Lith. 

Goth. 

OIr. 

Arm. 

Alb. 

Toch. 

*e 

a 

a 

8 

e 

e 

e 

i 

e 

e 

eljalieljeli 

’a 

*0 

a 

a 

0 

0 

0 

a 

a 

o 

o 

a 

A a, B e 

*e 

a 

a 

T1 

e 

e 

e 

e 

i 

i 

o 

A ’a, B ’e 

*6 

a 

a 

(0 

0 

a 

uo 

0 

a 

u 

e 

a 

*a 

a 

a 

a 

a 

0 

a 

a 

a 

a 

a 

a 

*a 

a 

a 

a > r| 

a 

a 

0 

0 

a 

a 

0 

A a, B o 



42 MATE KAPOVIC 


The problem of PIE *a (and *ay, *aw) 

In some cases where the classical IE linguistics reconstructed PIE *a, one now recon- 
structs *h 2 e (p. 44), e.g., 1 sg. perf. *-h,e (p. 98) (cf. also the voc. sg. *-e(h 2 ), p. 66-67). 
The sequence *h,e- is now almost always reconstructed when it comes to initial *a-, cf. 
*h 2 ekmo(n) (p. 41), even if the laryngeal is not strictly provable - many believe that PIE 
roots, or at least nouns and verbs, always had consonantal onset, and thus a laryngeal is 
reconstructed on structural grounds. In the laryngealist reconstruction of PIE, *a is often 
reduced to a rather infrequent and slightly suspicious peripheral vowel. Some IE-ists 
even deny its existence (except as an allophone and perhaps in some loanwords) in PIE 
in general (cf. Lubotsky 1989), though this appears to be too extreme. The position of 
PIE *a (and the diphthongs *ay and *aw, p. 50-51) is somewhat similar to PIE *b (which 
is typologically strange in both cases) - it is a marginal phoneme (though this depends 
on how one goes on to reconstruct a number of specific roots), appearing often in loan- 
words and in dialectal words (attested only in a couple of branches), and often in spe- 
cific phonological contexts (e.g., near velars, cf. *kap- ‘take’ > Lat. capio ‘I take’, Goth. 
haban ‘to have’ or *kan- ‘sing’ > Lat. cano ‘I sing’, OIr. canaid ‘sings’). PIE *a, *ay, 
and *aw never appear in grammatical morphemes/affixes and are not a part of the usual 
PIE ablaut system (p. 61). However, there are a couple of forms where an *a/a quantita- 
tive alternation is traditionally reconstructed: cf. PIE *nas- ‘nose’ (Ved. gen. du. nasos, 
CS nosy, OEng. nasu) and *nas- (Ved. nom. du. nasa, Lat. nasus, Lith. nosis ), and PIE 
*sal- ‘salt’ (Ved. salildm ‘sea’, Gr. akc,, Lat. gen. sg. salis, OCS soln) and *sal- (Lat. sal, 
Latv. sal). Some would now prefer to reconstruct a *neh,s-/*nh,es- (though one could 
expect **nh,es- here) and *seh,l-/*sh 2 el- type alternation here (*nh,s- and *sh 2 l- do not 
work for Old Indie, which would yield **nis-/**sil-, p. 47) or the like (OInd. nas- could 
also be explained as secondary). Possible instances of *e/a/a are few, cf. *h,wes- (Hitt. 
huiszi ‘lives’, Ved. vasati ‘dwells’, Goth, wisan ‘to be’), which can hardly be unrelated to 
*h 2 wastu- (Gr. oigtu ‘town’) and (?) *h,wastu- (Ved. vastu ‘residence, house’) - a recon- 
struction *h,weh 2 stu-/h 2 wh 2 stu- would be difficult to relate to verbal *h,wes- (though 
* e/a/a “ablaut” is, of course, also problematic). 10 Theoretically, when the word is not 
attested in Old Indie (where syllabic *H yields i), instead of *a one could reconstruct 
*dh 2 p-, *bh 2 k-, *kh,p-, etc., but this is hardly convincing in all cases - why would it 
always be *h 2 (needed to account for Greek a, p. 47) and why would there be no ablaut in 
these words? In spite of differing opinions, one can conclude that *a (and perhaps *a) did 
exist as an infrequent phoneme in the final phase of PIE. However, it is also quite possible 
that pre-PIE had no phoneme *a (nor *ay, *aw). 


The problem of the PIE vowel system 

Traditionally, the PIE vocalic system was reconstructed as having the usual *a, *e, *f, *d, 
*u (plus diphthongs). In any case, at least phonetically speaking, the vowel system of PIE 
in its last phase was not problematic. However, taking laryngeals into account and disre- 
garding marginal phonemes (*a), syllabic glides (*i, *u), diphthongs, and length leaves 
us with *e and *o as the only phonological vowels. This kind of analysis would be typo- 
logically problematic, since rare two-vowel languages (like Abkhaz) always have /a/ and 
/a/ as phonemes, and three-vowel ones have /a/, /i/, and /u/ (like Classical Arabic), though 
they always have more allophones. To further complicate matters, the role of *e and *o 
in PIE is also suspicious, since *e is much more frequent and they are part of the same 
ablaut pattern (p. 61); i.e., they are mostly dependent on morphological and derivational 
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conditions and are rarely, if ever, used to differentiate separate noun/verb roots. Though 
there are roots occurring with *o only (like *potis or *g h ostis - p. 40), *e is usually 
thought to be the “basic” vowel in a root (especially verbal ones), which means that all (or 
almost all) forms that occur with *o also have an *a-variant of the morpheme. Although 
long-range comparison is highly controversial, it is curious that PIE vowels rarely seem 
to have any kind of role in it (the supposed correspondences are usually found among 
consonants only). These and similar facts lead some to speculative claims that pre-PIE 
perhaps had only one phonological vowel (or even none - cf. Lehmann 1993: 138), which 
is again typologically and theoretically highly problematic. 11 


Laryngeals and vocalism 

We have already seen the consonantal reflexes of PIE laryngeals (p. 30). In most cases, 
though, laryngeals left traces in vocalism by changing neighboring vowels qualitatively 
or quantitatively. In some positions in certain languages, laryngeals themselves changed 
to vowels. Thus, we analyze them in the context of vocalism. 


The influence of laryngeals on neighboring vowels 
Laryngeals influence neighboring vowels in two ways: 

a) *h 2 and *h, change the color of the preceding/following *e 

b) disappearance of laryngeals in closed syllables lengthens all preceding syllabic 
segments 

The first change is already present in PIE (where the change is still just allophonic), while 
the second one is a post-PIE process that nonetheless occurs everywhere except for some 
cases in Anatolian (p. 177). For PIE *e, the following formulas generally stand: 

*h,e > (*h,e >) *e *eh[C/# > *e 

*h 2 e > *h 2 a > *a *eh,C/# > *a 

*h,e > *h 3 o > *o *eh 3 C/# > *5 

Thus, *h, changes the preceding/following *e to *a, and *h 3 changes it to *o, while *h 3 
does not influence vowel color. Only *e (and *ey/ew) are influenced by *h, and *h 3 , 
while other vowels (*o, *e, *o + *i, *u, and other diphthongs) are not (possible origi- 
nal *aH/Ha cannot be distinguished from *eh,/h 2 e). Laryngeals that disappear in closed 
syllables - word-intemally (*-VHC-) and word-finally (*-VH#, originally perhaps only 
before a pause or in cases when the following word began with a non-syllabic segment) - 
lengthen compensatorily all preceding syllabic segments (except already long *e, *5): *e, 
*o (p. 45^16), *i, *u (p. 38), *m, *n, *1, *r (p. 37-38). Laryngeals in intervocalic position 
(*VHV) color the surrounding vowels but after that drop without compensatory length- 
ening (cf. gen. sg. *-eh,es > *-ah,as > *-aas > *-as, p. 66). 


*HV (laryngeal + vowel) 

In certain cases, an initial laryngeal (*#HV-) can be directly reconstructed (e.g., if it 
yields #/?- in Anatolian, p. 177). In others, one can infer it indirectly (see below). Yet in 
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some cases, an initial laryngeal is reconstructed because of theoretical/structural reasons 
only - e.g., in words like *h,ekwos (p. 22). Here, one could as easily reconstruct *ekwos 
(because *e- and *h,e- would reflect the same), but nowadays *h,ekwos is often recon- 
structed because it is believed that most PIE words began with a consonant (p. 42, 52). 
The combination *HV can occur word-intemally (*-C/VHV-) as well. 


*h,e 

PIE *h,esti ‘is’ > Ved. asti (Av. asti), Gr. ecm, Lat. est, OCS jestb, OLith. esti, Goth. 
ist (Eng. is), Hitt, eszi, OIr. is. Arm. e 

PIE *h,edmi ‘I eat’ > Ved. admi, Gr. sbco, Lat. edo (=> Eng. edible), OCS jamb (< 
*edmi>, with Winter’s Law lengthening - p. 481), Lith. esti ‘to eat (for animals)’, 
Goth, itan ‘to eat’ (OEng. etan > Eng. eat), Hitt, etmi 

PIE *h,el(h,)(n)- ‘deer’ > Gr. ekcupoc,, OCS jelem, Lith. elni(a)s, OIr. elit ‘doe, hind’, 
Arm. eln ‘deer-cow, hind’, Toch. Aval ‘gazelle’ 

The development of ^e is basically the same as that of *e (p. 40). In many cases (like 
*h,el(h 1 )(n)-), the initial *h,e- is reconstructed on structural grounds alone. However, this 
is not always the case: in *h,es- and *h,ed- the existence of initial *h,- can be proven. 
It is proven by the initial a- of Hitt. 3 pi. pres, asanzi < *h[Sonti (Ved. sand, p. 47, 93), 
adanzi < *h,donti, etc. The initial laryngeal is also indicated by the lengthened negative 
a- in Ved. ptcp. a-sat- ‘which is not’ < PIE *n-h,snt-. 

*h 2 e 

PIE *h 2 erh 3 - ‘crush > plough’ > Gr. apoco ‘I plough’, Lat. aro ‘I plough’, OCS orati 
‘to plough’, Lith. arti ‘to plough’, Goth, arjan ‘to plough’, OIr. airid ‘ploughs’, 
Hitt, harm- ‘grind, crush’, Ann. (h)arawr ‘plough’, Toch. B are ‘plough’ 

PIE *h 2 eks- ‘axle, axis’ > Ved. aksas (YAv. asa- ‘shoulder’), Gr. a^cov, Lat. axis, OCS 
osb, Lith. asis, OHG ahsa, MW echel (OIr. ais ‘back’) 

PIE *h,eK(r)- ‘sharp’ > Ved. asris ‘sharp side, corner’, Gr. aicpo<; ‘topmost, outer- 
most’, Lat. acutus (=> Eng. acute), OCS ostn, Lith. astriis, OEng. awel (Eng. awl), 
Gaul. Axro-talus (‘with high forehead’ (pers. name)), Ann. aseln ‘needle’, Toch. 
B akwatse 

Although *h,e is usually written for (mor)phonological reasons, this was most likely 
phonetically *h 2 a already in the last phase of PIE. PIE *h 2 e reflects just like PIE *a 
(p. 41), except for Anat. ha- {-ha-) and perhaps Arm. ha- (p. 430). Initial *a- is often 
reconstructed as original *h 2 e- (p. 42), even if there are no concrete traces of a laryngeal 
(as in *h 2 eks- or *h 2 ek(r)-). 

*h 3 e 

PIE *h,ek"- ‘eye’ > Ved. aksi- (Av. as-), Gr. oaae ‘two eyes’, Lat. oculus, OCS oko, 
Lith. akis, Ann. akn, Toch. B ek 

PIE 'h,ewis ‘sheep’ > Ved. avis, Gr. (Horn.) oiq, Lat. ouis, OCS ovbca, Lith. avis, 
Goth, awistr ‘sheepfold’ (Eng. ewe), Mir. oi, Hitt./CLuw. hawi-, Arm. hoviw ‘shep- 
herd’, Toch. B awi ‘ewes’ 
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PIE *h 3 ekto(w) ‘eight’ > Ved. astau (YAv. asta ), Gr. oktco (=> Eng. octo-pus ), Lat. 
ocfo (=> Eng. October ), OCS osmb, Lith. astuoni, Goth, ahtau (Eng. eight), OIr. 
ocht" (Gaul, oxtumetos ‘eighth’), Toch. B okt 

PIE *h 3 e reflects just like PIE *o (p. 40), except for Anat. initial ha- and Arm. ho- 
(p. 430). Alternatively, instead of *h 3 e, it is possible to reconstruct *Ho (*h 2 o- when we 
have Anat. ha-, p. 177). It is impossible to distinguish original *Ho from *h 3 e. 


*VH (vowel + laryngeal) 

In most cases, the reflexes of *eh, and *eh 3 are identical to the reflexes of *e and *o 
(p. 40-41) respectively. Most instances of *a seem to originate in *eh, (but cf. p. 42). 

*ehj 

PIE *meh,n(s)- ‘ month; moon’ > Ved. mas (GAv. ma ‘moon’), Gr. pijv ‘month’, Lat. 
mensis ‘month’, OCS mesgcb, Lith. memo ‘month’, Goth, mena (Eng. moon), 
menops (Eng. month), OIr. mi ‘month’, Ann. amis ‘month’, Toch. B mehe 

PIE *d h eh,- ‘put, make, etc.’ > Ved. da-dhami ‘I set, make’, Gr. Ti-flripi ‘I put, set up’ 
(=> Eng. thesis), Lat .fed ‘I made’, OCS deti ‘to do’, Lith. deti ‘to lay/put’, Goth. 
ga-deps (Eng. deed), Hitt, temi ‘I speak, state’ 

PIE *seh,- ‘sow’ > Lat. semen ‘seed’ (=> Eng. semen), OCS seti, Lith. seti, Goth. 
mana-seps ‘mankind’ (OEng. seed > Eng. seed), OIr. sil ‘seed’ 

Only Anatolian differentiates original *e from *eh, (p. 176). 

*eh 2 

PIE *meh 2 ter ‘mother’ > Ved. mata (Av. matar-), Gr. (Dor.) pdirip [a] (Myc. ma-te, 
Att. pfjxrip), Lat. mater, OCS mati, Lith. arch, mote (Latv. mate), OEng. mddor 
(Eng. mother), OIr. mathir (Gaul. dat. pi. matrebo). Alb. moter ‘sister’ (cf. also 
*b h reh 2 ter, p. 18) 

PIE *peh,(s)- ‘guard, graze’ > Lat. pasco ‘I feed, pasture’, OCS pasti ‘to pasture, 
herd, feed’, Goth .fodjan ‘to feed, nourish’ (Eng. food), Hitt .pahs- ‘guard’, (?) OIr. 
as ‘growth’, Toch. B pask- ‘guard’ 

PIE *k"eh 2 s- ‘cough’ > Ved. kas-, CS kasnljati ‘to cough’, Lith. koseti ‘to cough’, 
OEng. hwosan ‘to wheeze’, Alb. kolle, Toch. B kosi 

When comparative evidence points to *a, a PIE *eh 2 (phonetically, and written as 
such in reconstructions by some - cf. e.g. the chapter on Armenian, *ah 2 already in 
PIE) is usually reconstructed now. Traditionally adduced examples for a non-laryngeal 
*a (p. 42) are dubious (though not impossible). In some cases, the *h, in *eh, is sub- 
stantiated by Anatolian (cf. *peh 2 (s)-); in others (like *k w eh,s-) it cannot be strictly 
proven (BS1. intonations are not really conclusive in distinguishing long grade and 
* VH, as some think - p. 107), and in yet others *h 2 can be indirectly proven: *me-h 2 ter 
obviously has the same suffix as *p-h 2 ter (p. 46), *b h re-h 2 ter (p. 18), *d h ug-h 2 ter 
(p. 47), etc., with the laryngeal being directly attested in Indie as the aspiration of the 
preceding consonant in the last word (p. 30). 
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The reconstructed PIE *eh, yields *a in most languages (the exception being Hitt. 
-ah- in some positions). In Indo-Iranian this *a merges with *e/o into a. Latin and Celtic 
preserve a, and Armenian has the reflex a. In Greek dialects, a is also preserved, but it 
changes into r| in Attic-Ionic (but not after e, i, p in Attic, p. 295). BS1. *a is preserved in 
Slav./Latv. a but changes into o [o] in Lithuanian. The reflex o is seen also in Albanian 
and Germanic (PGmc *5 > Goth. <o> [5], OEng. <o> [5]). In Tocharian, PIE *a yields o 
(Toch. B) and a (Toch. A). 

*eh 3 

PIE *deh,- ‘give’ > Ved. da-dami ‘I give’ (GAv. da-dat ‘gives’), Gr. 81-8copi ‘I give’, 
Lat. donum ‘gift’ (=> Eng. donate), OCS dati ‘to give’, Lith. duoti ‘to give’, OIr. 
dan ‘gift’, Hitt, da- ‘take’, Arm. tur ‘gift’ 

PIE *gneh,- ‘know’ > Ved .jha- (GAv. -zan-), Gr. yvco- (if not from *gnh r ), Lat. (g) 
nosed ‘I get to know’, OCS znati ‘to know’, Lith. zinoti ‘to know’, Toch. A knan- 

PIE *peh,- ‘ drink’ > Ved. a-pat ‘he drank’, Gr. Jicopa ‘beverage’, Lat. potus ‘drunk’, 
lAth.puota ‘feast, banquet’, Hitt, pas- ‘swallow’ 

There is no difference between reflexes of *eh, (phonetically/allophonically *oh 3 
already in PIE), *oH, and *5 (p. 41). 


*CHC (interconsonantal laryngeal) 

Laryngeals between consonants (*CHC), in postconsonantal position word-finally 
(*-CH#), and in word-initial pre-consonantal position (*HC-) are called “syllabic” 
and thus sometimes marked as *H (in *CRHC it is always the resonants that are 
syllabic except with the present tense nasal infix - p. 31, 39). However, it is quite 
possible that they were not really phonetically syllabic but rather pronounced with a 
prop-vowel *9. 


*Ch,C 

PIE *h 2 enh[- ‘breathe’, *h 2 enh,mos ‘breath’ > Ved. ciniti ‘breathes’ (YAv. antva- 
‘ inhalation’), Gr. avepxx; ‘wind’, Lat. animus ‘mind, spirit’ (< *anamos, => Eng. 
animate), Goth, uz-anan ‘to breathe out’, MW anadl ‘breath’ < PCelt. *ana-tla, 
Arm. holm ‘wind’ (< *honm), Toch. B anask- ‘breathe, inhale’ 

PIE *genh,tor ‘begetter’ > Vcd.janitd ‘begetter, father’, Gr. yevexcop ‘begetter, ances- 
tor’, Lat. genitor ‘begetter, parent, father’ (< *genat5r); cf. OCS zgtb (BCMS zet), 
Lith. zentas ‘son-in-law’ 

PIE *wemhj- ‘vomit’ > Ved. vam(i)-, Gr. epeco ‘I vomit’, Lat. uomitus ‘vomiting’ (=> 
Eng. vomit), Lith. vemti ‘to vomit’, Goth, ga-wamms ‘unclean, stained’ (< PGmc 
*wammaz < *womh 1 n-) 

*Ch 2 C 

PIE *ph 2 ter ‘father’ > Ved. pits (GAv. pta, dat. sg . fodroi, YAv. pita, OPers. pita), 
Gr. jraxfip, Lat. pater, (?) CS stni ‘uncle’ (< *ptr-), Goth . fadar (Eng. father), OIr. 
ath(a)ir (Gaul. dat. pi. atrebo), Arm. havr, Toch. B pacer 
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PIE *d h ugh 2 ter ‘daughter’ > Ved. duhild (GAv. dug'dar-), Gr. Suyairip, Osc. futi'r, 
OCS dusti (< PS1. *dr>kt’i), Lith. dukte, Goth, dauhtar (Eng. daughter), Gaul, dux- 
tir (CIb. gen. sg. tuateros), Hitt. duttariyatH as ‘female functionary’ (4= Luw.?, 
HLuw. tuwatra/i-), Arm. dustr, Toch. B tkacer 
PIE *senh 2 - ‘(seek to) accomplish, reach’ > Ved. san(i)- ‘win, reach, get’, Hitt, sanhzi 
‘seeks, looks for, attempts’ 


*Ch 3 C 

PIE *h,erh 3 trom ‘plough’ (cf. *h 2 erh 3 -, p. 44) > Gr. dpoipov, Lat. aratrum (with ana- 
logical *a after arare ‘to plough’), Cz. radio, Lith. arklas ‘wooden plough’ (Latv. 
arkls), OIr. arathar (< PCelt. *aratro-) 

PIE *dh 3 -tos ‘given’ (cf. *deh 3 -, p. 46) > Ved. a-di-mahi ‘we shared’, Gr. 5oxoq, Lat. 
datus (=> Eng. date) 

PIE *ph 3 tos ‘drunk’ (participle, cf. *peh 3 -, p. 46) > Gr. no xoq ‘drinkable’ 

In certain languages and in some conditions laryngeals can drop entirely in this posi- 
tion- cf. the word ‘daughter’ with the vocalic outcome in Greek (as always there), Vedic, 
Tocharian, and likely Celtiberian, but with the zero-reflex in other languages. The only 
language that distinguishes *h,, *h 2 , and *h 3 in this position is Greek, where one gets 
e/a/o respectively. In Old Indie, the reflex is usually and in certain unclear conditions 
it is -T- (cf. Ved. braviti ‘speaks’ < *mlewh 2 -), and occasionally -0- (cf. Ved. dadhmas 
‘we place’ < *d h ed h h,mes). In Iranian, the laryngeals are much less often vocalized to i 
(p. 267). It is not clear why Old Indie sometimes shows the aspiration of the preceding 
stop (duhitd ‘daughter’) and sometimes does not (pith ‘father’) (p. 30). This is perhaps 
related to a different position of the prop-vowel: *d h ugh 2 [g]ter (p. 47) but *p[s]h 2 ter 
(p. 46). In Tocharian we find a, and in other languages, including Armenian (p. 430), it 
is a (unless dropped altogether). In Balto-Slavic, *VRHC yields *VRC with the acute 
accent (PS1. BCMS" Sin. 7” Lith. ', Latv. 7 ~) from the laryngeal. In Anatolian, h is 
preserved in some conditions. 


*#HC- (initial pre-consonantal laryngeal) 

*h,C- 

PIE *h,se/onti ‘they are’ > Ved. santi (GAv. hantT), Gr. (Myc.) e-e-si /ehensi/, Lat. 
sunt, OCS sqt'b, Goth, sind, Hitt, asanzi, Ann. en 

PIE *h,lewd h - ‘grow (up); people; free’ > Ved. rodhati ‘grows’ (GAv. raodanti ‘they 
grow’), Gr. e^euflrpo:; ‘free’, Lat. liber ‘free’ (=> Eng. liberty), IfberT ‘children’, 
OCS ljudie ‘people’, Lith. liaudis ‘people, nation’, Goth, liudan ‘to grow’ (OHG 
liut ‘people’) 

PIE *h,su ‘good’ > Ved. su- (Av. hu-), Gr. eu- (=> Eng. eu-thanasia), OCS st>- (in 
stj-dravb ‘healthy’, si>-mrbtb ‘death’), OIr. su- 

*h 2 C- 

PIE *h 2 ster ‘star’ > Ved. instr. pi. stfbhis (Av. star-), Gr. dcrcfip (=> Eng. asteroid), Lat. 
Stella (=> Eng. stellar), Goth, stairno (Eng. star), OIr. ser, Hitt, hasterza /hsterz/, 
Arm. astl, Toch. B scirye 
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PIE *h 2 weh r ‘blow’ > Ved. vati ‘blows’ (YAv. va'ti), Gr. cir||.ii ‘I blow’, Lat. uentus 
‘wind’, OCS vejati ‘to blow’, Lith. vejas ‘wind’, Goth, waian ‘to blow’, winds 
(Eng. wind), MW gwynt ‘wind’ (< PCelt. *wintos), Hitt, huwanza /hwanz/ ‘wind’, 
Toch. A want 

PIE *h,ner ‘ man’ > Ved. naras ‘men’ (GAv. na), Gr. dvip (=> Eng. Andrew, android), 
Osc. niir, MW ner ‘chief, hero’, (?) Hitt, in-narawawar ‘strength’, Arm. ayr. Alb. 
njer(i) ‘human being’, NPhryg. avap 

*h 3 C- 

PIE *h,neh 3 mii ‘name’ > Ved. nama (YAv. nqma), Gr. ovopa (=> Eng. onomastics), 
Lat. nomen (=> Eng. nominal), OCS img (< BS1. *inmen- < *h 3 nh,men-), Goth. 
namo (Eng. name), OIr. ainm (< PCelt. *anman < *h,nh 3 men-), Hitt, laman, Ann. 
anun, Toch. B iiem, NPhryg. ovopav 

PIE *h 3 b h ruHs ‘(eye)brow’ > Ved. bhriis (YAv. dat. pi. brvatbyqm), Gr. ocppuq, OCS 
instr. pi. brbVbmi, Lith. dial, brim's, OEng. bru (Eng. brow’), OIr. for-bru, Toch. B 
pdrwdne ‘brows’, Ancient Macedonian (Hesychius) aPpouia; 

PIE *h 3 neyd- ‘ revile’ > Ved. nindanti ‘they revile’ (YAv. naismi ‘I revile’), Gr. 
oveiSoq ‘reproach’, Lith. niedeti ‘to scorn, despise’, Goth, ga-naitjan ‘to revile’, 
Arm. anicanem ‘I curse’ 

Initial pre -consonantal laryngeals disappear in most languages. One exception is Ana- 
tolian, where *h 2 -/h 3 - > h- in some positions and *h, > a- (which is disputed) (p. 47). The 
other is the Greek-Macedonian-Phrygian-Annenian dialectal group, where *h,, *h 2 , h 3 
vocalize - the reflexes (traditionally called “prothetic vowels”) are e-, a-, 6- respectively 
in Greek (and Phrygian) and e- (> /-), a-, o- (> a-) respectively in Armenian (p. 429-430). 

Diphthongs 

In PIE, all combinations of vowels and resonants (even *er, *om, etc.) are often regarded 
as diphthongs (like in modern Lithuanian). However, only the combinations of vowels 
and *y/w were diphthongs in a strict sense: 

*ey *oy *ay 

*ew *ow *aw 

Only these have special reflexes in IE languages - e.g., PIE *ey does not reflect as a mere 
sequence of *e and *y, while *er does (cf. *wert-, p. 33), so there is no need to treat the 
latter separately in comparative phonology. Long diphthongs (*ey, *oy, *ew, *ow) are rare, 
just like the long-grade *e, *o (p. 40-41). The position of *ay and *aw is similar to that 
of *a (p. 42). Secondary *a- and *o-diphthongs can arise from laryngeal combinations 
(p. 43): *h,ey/w > *ay/w, *eh 2 i/u > *ay/w and *h 3 ey/w > *oy/w, *eh,i/u (or *oHi/u) > *oy/w. 
Cf. PIE *neh,us ‘ ship ’ (Gr. vauq) with *eh 2 u > *a(h 2 )u > *aw, but the clear presence of the laryn- 
geal in the ace. sg. *neh 2 wm (Gr. vrj(p)a, Ved. navam, Lat. nauem) with *eh 2 w > *aw. Only 
tautosyllabic (pre-consonantal) Vy/wC and (word-final) Vy/w# sequences are diphthongs - 
heterosyllabic (pre -vocalic) Vy/wV sequences are not. Cf. the special diphthongal reflex 
of *ey in PIE *weyd- ‘know’ (Ved. ved-, p. 49) or /-stem voc. sg. *-ey (Ved. -e, p. 72), but 
regular reflex of *e and *y in non-diphthongal PIE *treyes ‘three’ (Ved. travas). 

There are four basic kinds of development of diphthongs in IE languages: (a) an 
unchanged diphthong (e.g., PIE *ew > Gr. eu), (b) a different diphthong (e.g., PIE *ow > 
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Gmc *aw), (c) a monophthong (e.g., PIE *ey > OCS i), and (d) subsequent monoph- 
thongizations and diphthongizations (e.g., PIE *ey > East Baltic *e > Lith. ie). Both 
monophthongizations (of diphthongs) and diphthongizations (of long vowels) are typo- 
logically common. PIE diphthongs are best preserved in (the oldest stages of) Ancient 
Greek. 


*ey 

PIE *deywos ‘god’ > Ved. devas (GAv. daeuua- (pre-Zarathustra god), OPers. daiva- 
‘false god’), Lat. dTuus ‘godly’ (OLat. deiuos ‘god’; => Eng. divine), (?) OCS divb 
‘astonishment’, Lith. dievas ‘god’, delve ‘goddess’ (Latv. dievs, OPruss. deiw(a)s), 
OEng. TTw (name of a god, cf. Tues-dav), OIr. dia (Gaul. Deuo-gnata ‘bom of god’ 
(pers. name), CIb. Teiuo-reikis ‘god’s king’ (pers. name)) 

PIE *h,ey- ‘go’ > Ved. eti ‘goes’ (YAv. a-e‘ti, OPers. aitiv), Gr. eipi ‘I go’, Lat. Ts ‘you 
go’ (Paelignian pi. eite ‘go!’), OCS iti ‘to go’, Lith. eiti ‘to go’ (Latv. iet ‘to go’, 
OPmss. eit ‘goes’) 

PIE *weyd- ‘see; know’ > Ved. vedas ‘knowledge, insight’, Gr. eiSov ‘I saw’ (=> Eng. 
idol), Lat. uTs us ‘sight’, OCS videti ‘to see’, Lith. dial, veizdeti ‘to look’, Goth. 
fair-weitjan ‘to observe’, OIr .fiadu ‘witness’ 

PIE *ey remains unchanged in early Greek, Old Latin, Old Prussian, and sometimes in 
Lithuanian. In Indo-Iranian, where *e/o/a > *a, *ey/oy/ay merge into *ay (cf. *ew/ow/aw 
> *aw), which is preserved in OPers. ai, yields Av. ae in open and di in closed syllables 
(e finally), and becomes e in Old Indie (always long and still considered a “diphthong” 
in Old Indie grammar tradition). Earliest Greek ei /ei/ monophthongized to a closed long 
/e/ by the time of the Classical Greek in Athens in the 5th century BCE (and to /I/ even 
later) - p. 296. Old Latin still had ei (up to Plautus and Terence), which later yielded f. 
BS1. *ey was preserved in Old Prussian and in certain conditions in Lithuanian, while 
in others *ey > *e > ie (p. 488; traditionally, it is thought that *ey/oy/ay yield ie origi- 
nally only in stressed syllables, which would later be obscured by analogies). In Slavic, 
*ey > i (originally always long). In Germanic, PIE *ey > *T (written in Gothic as <ei>). 
In Celtic, *ey > *e (perhaps not in Celtiberian and Lepontic, p. 360, 362; OIr. e > ia). 
PIE *ey monophthongizes to PAnat. *e (Hitt, e, p. 176), to i in Tocharian and Albanian 
(p. 562), and to e in Armenian (p. 425). 


*oy (*h 3 ey, *eh 3 i) 

PIE *loyk"(n)- ‘ remaining’ > Ved. reknas ‘inherited property’ (GAv. raex’nah- 
‘inheritance’), Gr. 7.ouroq ‘remaining’, OCS ot'b-lekb ‘remainder’, Lith. at-laikas 
‘remainder’, OHG lehan (< PGmc *layx w naz, Eng. loan) 

PIE *woyk- ‘settlement’ > Ved. vesas ‘inhabitant’ (YAv. vaesa- ‘servant’), Gr. otKoq 
‘house’ (=> Eng. eco-nomy, etc.), Lat. ulcus ‘village’ (OLat. ueicus), Lith. vies-pats 
‘lord’ (OPmss. ace. sg. wais-pattin ‘housewife’) 

PIE *h,oyn/k/wos ‘one’ (p. 89) > Ved. ekas (Av. a-euua-, OPers. aiva-), Gr. oloc 
‘alone’, Lat. iinus (OLat. ace. sg. oino), Lith. vienas (Latv. viens, OPmss. ains), 
Goth, ains (Eng. one), OIr. den 

PIE *oy is preserved in Greek (ot), Old Latin, and Celtic (written as <oi>, <oe>, etc. 
in Old Irish). See *ey for Indo-Iranian. OLat. oi remains a diphthong (written as <oe>) in 
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seven words after labials (cf. foedus ‘treaty’) and yields u elsewhere in Classical Latin, (ini- 
tial *woy- > uT-, however). In Balto-Slavic, *oy and *ay merge into *ay, preserved as ai in 
Old Prussian and yielding, as with *ey above, ai in some positions in Lithuanian and ie (via 
*e) in others (p. 488). In Slavic, the reflex is e and sometimes -i finally. In Gennanic, *oy 
and *ay also merge into *ay (Goth. ai). Hittite has ai for PIE *oy before coronal consonants, 
otherwise e (p. 176). In Armenian, PIE *ey and *oy yield e (p. 425). In Albanian, PIE *oy 
(and *ay) yield e (p. 562). In Tocharian B, it is ei (> ai), and e in Tocharian A. 


*ay (*h 2 ey, *eh 2 i) 

PIE *kavkos ‘ one-eyed, blind’ > Skr. kekaras ‘squinting’, Lat. caecus ‘blind’, Goth. 
haihs ‘one-eyed’, OIr. caech ‘one-eyed’ 

PIE *laywos ‘left’ > Gr. kaioc,, Lat. laeuus, OCS levb, (?) Toch. B laiwo ‘lassitude’ 

PIE *h 2 eyd h - ‘burn’ > Ved. edhas ‘firewood’ (YAv. aesma-), Gr. otilko ‘I kindle’, Lat. 
aestus ‘heat’, Lith. iesme ‘firewood’, OEng. ad ‘funeral pile’ (< PGmc *ay6az), 
OIr. aed ‘fire’, Alb. (h)ethe ‘fever’ 

For the problem of *a(y), cf. p. 42. Initial *ay- is usually interpreted as original *h,ey-, 
while *ay is often assumed after velars (p. 42), though alternative reconstructions like 
*kh,eykos and *keh,ikos are not unimaginable. See *oy above for Indo-Iranian, Bal- 
to-Slavic, and Germanic. Old Latin still has ai, later spelled as <ae> (and pronounced as a 
diphthong by the upper classes). In Armenian (ay) and Celtic, the diphthong is preserved 
(Old Irish has a late merger with *oy, p. 366). In Hittite, ai is preserved before coronals; 
otherwise it is e (p. 176). Tocharian B preserves ai (cf. e in Tocharian A). 

*ew 

PIE *b h ewd h - ‘be awake/aware’ > Ved. bodhati ‘is aware, notices’ (Av. baoSa‘ti), Gr. 
neuflopai ‘I find out/inquire’, OCS bljusti ‘to watch/guard’, Goth, ana-biudan ‘to 
bid/command’ 

PIE *h,lewd h - ‘free; man’ > Gr. eT-rufiepoq ‘free’, OCS ljudie ‘people’, Lith. liaudis 
‘people, nation’, OEng. leod ‘man’ (< PGmc *lew6iz) 

PIE *h,ews- ‘singe, scorch’ > Ved. osati ‘singes’, Gr. euco ‘I singe’, Lat. urd ‘I scorch/ 
burn’ 

PIE *ew remains unchanged in Greek (eu). In Indo-Iranian, *ew/ow/aw merge into *aw 
(cf. *ey/oy/ay), preserved in OPers. au, yielding Av. ao in open and §u in closed syllables 
(-(uu)o finally), and o in Old Indie (always long and considered a “diphthong”). In Archaic 
Latin eu merges with ou < PIE *ow and later yields ii. BS1. *ew is perhaps preserved in 
OPruss. eu in some cases in Lithuanian it yields iau (p. 489, 498), and in Slavic (palataliz- 
ing) ’u. Gmc *ew yields Goth, iu (cf. p. 40). In Celtic, *ew tended to change to *ow (> OIr. 
mostly ua, p. 366). Monophthongization occurs in Anatolian (Hitt, u, p. 176), Albanian ( e ), 
and Tocharian ( ’«). In Annenian, *ew and *ow merge into ov. 

*ow 

PIE *b h owd h eye- ‘wake (somebody) up’ > Ved. bodhavati ‘wakes up’ (YAv. baoSaiie- 
‘ti ‘reveals’), OCS u-buditi ‘to wake up’, Lith. baudvti ‘to incite’ 
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PIE *klownis ‘haunch, thigh, hip, buttock’ > Ved. sroni- ‘hip and loins, buttocks’ 
(YAv. sraoni- ‘hip, buttock’), Lat. clunis ‘buttock, haunch’, Lith. slaunis ‘thigh, 
hip’, OIc. hlaun ‘buttock, haunch’, MW clun ‘thigh’ (< PCelt. *klownis) 

PIE *h,rowd h os ‘red’ > Ved. lohas ‘reddish’, Lat. rufus (a Sabellic loanword), Cz. 
rudy, Lith. raudas ‘reddish’, Goth, raups (Eng. red), OIr. ruad (Gaul. Roudius 
(pers. name)) 

For Indo-Iranian and Armenian see *ew. The earliest Greek still had a diphthongal 
oi), which became a long closed /o/ and later /u/ in Attic (p. 296). OLat. ou became 
Classical Latin u. In Balto-Slavic, *ow and *aw yielded *aw (preserved as Lith. au), 
which in turn gave Slav. u. Germanic also had merger of *ow and *aw (cf. Goth. au). 
PIE *ow is preserved in Celtic (> OIr. 6 > ua). Hittite has u but au before coronals 
(p. 176) as reflexes for *ow and *aw, the Albanian reflex is a, while Tocharian B has eu 
(later au, cf. Toch. A u). 

*aw (*h 2 ew, *eh 2 u) 

PIE *h 2 ews- ‘shine’ > Gr. aupiov ‘tomorrow’, Lat. aurora ‘dawn’, aurum ‘gold’, 
OCS utro ‘morning’, Lith. ausra ‘dawn’, au(k)sas ‘gold’, ONor. austr (Eng. east) 
PIE *h 2 ews- ‘ear’> Lat. auris, OCS uxo, Lith. ausis, Goth, auso (Eng. ear), OIr. auld 
(Gaul. Su-ausia (pers. name)) 

PIE *h 2 ewg- ‘grow’ > Ved. ojas ‘strength’ (GAv. aogo), Gr. au^co ‘I grow’, Lat. auged 
‘I increase’, Lith. augti ‘to grow’, Goth, aukandei ‘increasing’ 

Concerning the PIE status of *aw, cf. *a and *ay (p. 42). Initially, *h,ew- is usually 
reconstructed, while *aw occurs, for instance, in some loanwords (cf. *tawros, p. 57). For 
Indo-Iranian see *ew; for Balto-Slavic, Germanic, and Hittite see *ow. Greek, Latin, and 
Celtic preserve the original av/au (OIr. ua/o). PIE *aw is preserved in Armenian (aw) and 
Tocharian B (au, cf. Toch. A o). In Albanian, the reflex is a (p. 562). 

Long diphthongs 

Like *e and *5, long *ey, *oy and *ew, *ow (original and resulting from various contrac- 
tions) were rather rare and later merged with short diphthongs in most languages. The 
special reflexes are clearest in Indo-Iranian, where they yield *ay (OInd. ai, word-finally 
-a; Av. ai) and *aw (OInd. aw, A v. au) respectively. Cf. PIE *g w ows (p. 24, 71) and instr. 
pi. *-5ys (p. 65, 266). Greek has special reflexes for *-ey(-), *-5y(-) (p. 289-290, cf. dat. 
sg. *-5y, p. 65). 


TABLE 1.9 THE REFLEXES OF PIE DIPHTHONGS 


PIE 

OInd. 

Av. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

OIr. 

Hitt. 

Arm. 

Alb. 

Toch. 

*ey 

e 

ae/oi 

£1 

1 

i 

ie/ei 

ei 

ia/e 

e 

e 

i 

i 

*oy 

e 

ae/di 

01 

oe/u 

e 

ie/ai 

ai 

oi ( oe ) 

e/ai 

e 

e 

Ae,B ei > ai 

*ay 

e 

ae/oi 

(XI 

ae 

e 

ie/ai 

ai 

ai (ae) 

e/ai 

ay 

e 

A e, B ai 

*ew 

0 

ao/du 

£1) 

u 

u 

iau 

iu 

ua/o 

u 

oy 

e 

u 

*ow 

0 

ao/du 

OD 

u 

u 

au 

au 

ua/o 

u/au 

oy 

a 

A o, B eu> au 

*aw 

0 

ao/du 

ao 

au 

u 

au 

au 

tia/o 

u/au 

aw 

a 

A o, B au 
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PHONOTACTICS 

Syllable structure and consonant clusters 

PIE had CVC, CV, and VC type syllables - with consonant clusters also, (CC)CVC(CC), 
etc. As usual, more types of clusters were permitted in medial (heterosyllabic) than in 
initial or final (homosyllabic) position - for instance, the cluster *-mb h - was possible 
medially in *gomb h os (p. 18), but not initially (where it would be **mb h -, p. 31) and 
finally (there was no final *-b h , p. 53). As in living languages, it is sometimes difficult to 
ascertain the exact syllabification in medial positions - e.g., is it *h,elt-wos or *h 1 e-Kwos 
(p. 22), *ne-b h os or *neb h -os (p. 31), or both. 

It seems that onsetless initial syllables (#VC-) were rare. It is common practice now 
to reconstruct initial laryngeals even when not strictly provable, though in some cases 
they can be proven (p. 44). Thus, the frequency of initial VC- depends on whether one 
prefers to reconstruct *ekwos or *h,ekwos (which in cases of *h 2 e- also depends on one’s 
position on the status of PIE *a - p. 42). It is possible that #VC- was usual in pronouns, 
e.g., in forms like *ey ‘that’ (p. 87), though here one can also reconstruct *h[ey. In any 
case, onsetless initial syllables were possible with syllabic resonants, like *n- ‘non-’ or 
*udn- ‘water’ (in oblique cases of *wodr). In cases with zero grade only, optional initial 
laryngeals are theoretically reconstructable, cf. *(H)ish 1/2 ros (Ved. isiras ‘strong, active’, 
Gr. iepoi; ‘holy; strong’). 

As for single initial consonants, it seems that only *r- was not permitted (p. 32), so 
*Hr- may be reconstructed instead of *r- even when not strictly provable (e.g., in *(H) 
rotHos - p. 32). Initial *TR- (like *klewos ‘fame’ > Ved. sravas ‘renown’, Gr. k7.so<;) 
was not possible in case of homorganic stops + nasals (e.g., **pm-, *tn-, etc.), though 
this was nonnal in medial heterosyllabic position (e.g., *potnih 2 ‘mistress’ > Ved. patnT, 
Gr. noivia). Initial *sC- was usual (cf. *steyg h - (p. 19), *sneyg wh - (p. 25)), and ini- 
tial *ks- was also apparently possible (cf. *ks-, p. 16). Only some cases of (labial + 
dental/alveolar) *RR- (with asyllabic first resonant!) were permitted: *wr-, *wl-, *ml- 
(cf. *mlewh 2 - > GAv. mraomT ‘I say’), *mr- (cf. comp. *mreg h wyds ‘shorter’ > Lat. 
breuis ‘short’), and *mn- (cf. *mneh 2 - ‘think’ (LIV)). Before other consonants, resonants 
were always syllabic: *RC- (p. 31). Initial *CHV- is suspicious, though some would 
reconstruct it in some cases of PIE *a (p. 42, 50). In cases like *ph,ter, laryngeals were 
“syllabic” (p. 46). In cases of expected *CHR-, a laryngeal metathesis occurs (*CHR- > 
*CRH-). Cf. *peh 3 - ‘drink’ (Ved. aor. apat ‘he drank’, Lat. potus ‘drunk’) but *ph 3 i- 
(a zero grade of *peh,-y-) > *pih 3 - (metathesis) > Ved. pitas ‘drunk’, OCS piti ‘to drink’. 
Initial *HC- (frequently with resonants, less with stops and *s) was permitted, as proven 
by “prothetic vowels” in some languages (p. 48), but was perhaps “syllabic” in those 
positions (cf. p. 46^-7). 

Initial clusters of two stops were infrequent in PIE, but some cases seem to have existed 
(in spite of the tendency to eliminate such clusters). The initial cluster *pk- is apparently 
seen in Av. fsumant- ‘wealthy; having cattle’ (cf. PIE *peku ‘cattle’ > Ved. pa.su, Goth. 
faihu). In some cases, a non-phonemic “secondary schwa” appears to break a cluster 
(which yields a in Latin and most languages but apparently i in Greek): cf. *k' v twor- 
[k^atwor-] 12 ‘four’ (in oblique cases) > Lat. quattuor, Gr. (Horn.) Jiicrups:; (the cluster 
is attested without a prop-vowel medially in YAv. a-xtu'nm ‘four times’ < *k w tury-, and 
initially in Ved. tuny a- ‘fourth’ with probably later cluster simplifications); *ptneh 2 - [pst- 
neh 2 ] > Lat. pando, Gr. nixvqpi ‘I spread out’ (cf. also Hitt, taknas below). In other cases, 
original initial clusters had probably been simplified already in PIE, cf. *kmtom (p. 34) < 
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*dkmtom ‘hundred’, obviously derived from *dekm(t) ‘ten’ (p. 34, 90). Some all-stop ini- 
tial clusters (like Gr. jixepov ‘feather’ from PIE *pet- ‘fly’) are likely post-PIE, and others 
(like *pteh,k- ‘to duck’ in LIV) are dubious. A number of original dental + velar clusters 
seem to have existed - e.g., *tken- ‘hit, injure’, *d h g h uH- ‘fish’, *d h g h (e)m- ‘earth’, etc. 
These clusters were rarely preserved (cf. Toch. A tkam ‘earth’) - either the first stop drops 
(Lith. zuvis ‘fish’; OCS zemlja ‘earth’); a metathesis occurs (*tken- > *kten- [kj>] > Ved. 
ksanoti ‘injures’, Gr. icrelvco ‘I kill’; *d h g h em- > *g h d h em- [g h d h -] > Ved. ksam-, Gr. $kov 
‘earth’); an epenthetic vowel emerges (Hitt. gen. sg. taknas ‘earth’) or, rarely, a prothetic 
vowel appears (Gr. i$K >5 ‘fish’). In both initial and medial dental + velar clusters, there 
is often a metathesis (as in *d h g h (e)m- > *g h d h (e)m-) already in PIE (probably not in all 
conditions), and the dental stop (*t/d/d h ) changes to an allophonic *J}/6/6 h , as traditionally 
reconstructed, with special reflexes in some languages (OInd. ks, Lat. 5 but Gr. - cf. 
*h 2 rk|}os, p. 35). In these cases (the so-called thorn-clusters), we are thus dealing not 
with all-stop clusters but with stop + fricative clusters (similar to *ks- or *THV- above) 
if the traditional reconstruction is correct. 

As for triconsonantal initial clusters, almost all of them had a fricative (*s/H) as the 
first or the second element. Those in *sTR- were the most usual (e.g., PIE *streyg-/stri(n) 
g- > Lat. stringo ‘I strip off’, OCS strigg ‘I cut’), the *CsC- ones were rare and likely 
onomatopoeic (cf. *pster- > Gr. 7tidpvupai ‘I sneeze’, Lat. sternud ‘I sneeze’; *ksweyb h - 
‘swing’ (LIV); *ksnew- (p. 16)). Possible clusters with initial laryngeal (*h,ster (p. 47), 
*h,syeh,t (p. 102), *h 3 b h ruHs (p. 48), etc.) perhaps had a “syllabic” laryngeal (i.e., pho- 
netic *HaCC- or similar). Triconsonantal clusters without *s/H seem to be “thorn-clus- 
ters” - cf. *d h g h yes (> *g h d h yes) ‘yesterday’ (> Ved. hyas, Gr. x&e?) and *pkten- ‘comb’ 
(Gr. Kielq, cf. Lat. pecten < *pekten-). An initial four-consonant onomatopoeic cluster can 
also possibly be reconstructed: *sptyewH- (LIV) > Gr. jttuco ‘I spit out’, Goth, speiwan 
‘to spit’. A number of additional clusters were permitted word-medially, but we cannot 
adduce them all here. Geminates were possible only in expressive or nursery words like 
*atta ‘daddy’ (Gr. ana ‘father’, Lat. atta, Goth, atta ‘father’, Hitt, attas ‘father’ - cf. also 
the two *a phonemes in the word). The *-ss- on morpheme boundary was probably sim- 
plified to *-s- already in PIE (cf. p. 78 for 5-stems and p. 93 for *h[e(s)si). 

Obviously, not all segments and clusters were allowed word-finally. A PIE word could 
end in any syllabic segment, except for *e, *5 (apparent final *-e/o always stem from *-e/ 
oH), marginal *a/ay/aw (p. 42), and apparently *-l. As for single consonants, all resonants 
and fricatives were permitted word-finally but of stops only *-t/-d (cf. aor. 3 sg. *-t (p. 
96), *tod ‘that’ > Goth .pat-a, Ved. tat/d , OCS to), and marginally *g (only in *hjeg T — 
p. 82). It is often claimed that *-t and *-d are not distinctive in final position. Indeed, in 
most languages there are no clear reflexes of such an opposition (and in languages like 
Greek or Old Church Slavic no final -t/-d is even possible). However, cf. the already 
adduced Goth, pat-a < PGmc *\>at (Eng. that. Germ, das) < PIE *tod, but Goth, milip 
‘honey’ < PIE *melit (p. 77). The resonants in *-CR were always syllabic, and the same 
goes for *-CH, cf. *g w enh, (p. 24, 66), *megh, [-gh,a] (p. 30) ‘big’ > Ved. mahi, Gr. peya. 
A number of final PIE clusters are morphologically reconstructable (all of them ending 
in *-s or *-t), though it is difficult to know whether some of them were phonetically 
simplified already in PIE. Considering *-RT type final clusters, *-nt was frequent (p. 97), 
and others appeared occasionally, cf. root aor. *g w emt ‘went’ (p. 96) or root noun *kerd 
‘heart’ (Gr. Horn, idjp, Hitt, ker), though these are usually structurally reconstructed (i.e., 
the clusters are not actually attested in daughter languages so it is not impossible that, for 
instance, *-rd# was simplified already in PIE). All *-Rs clusters were possible, though in 
pre-PIE *-VRs > *-VR innom. sg. (p. 70) (cf. also *-Vs/nh, > *-Vs/n innom. pi. -p. 70). 
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The following fricative-first clusters are attested: *-Hs (cf. *muHs - p. 38), *-Ht (cf. root 
aor. *deh 3 t ‘gave’ - p. 96), and *-st (cf. inj. *h,est ‘is’ - p. 93, 97). Clusters of *-Ts type 
were usual: *pods (p. 15), *h,regs ‘king’ (> Lat. rex), *snoyg wh s ‘snow’ (p. 25), etc. All- 
stop final clusters were exceptional, cf. *glagt [-kt] ‘milk’ (Gr. yaLa, also yLayo q, Lat. 
lac). As a parallel to *(s)TR- initial clusters, there were also some final triconsonantal 
clusters: *-CCs (*nog wh ts > *nok w ts (p. 32) > Lat. nox, Goth, nahts; ptcp. *-nts (p. 104)) 
and *-Cst (cf. sigm. aor. fonns like *yewgst ‘yoked’ - p. 96). 

Not all combinations of stops were permitted (or usual) in PIE roots (in the onset and 
coda). There were two root constraints (or near-constraints since there are some counter- 
examples): (a) no aspirated and voiceless stops together (i.e., no roots like **ped h - or 
**g h et-) and (b) no two voiced stops together (i.e., no roots like **ged-). Constraint (a) 
does not include *sT- initial roots like *steyg h - (p. 19) and constraint (b) is sometimes 
interpreted glottalically (p. 18). 

Phonological rules in PIE 

We have already mentioned Pinault’s Law (p. 30), laryngeal metathesis (p. 52), the 
post-velar allophonic *]Vd/6 h (p. 53), and here we shall briefly list some of the other pho- 
nological rules of PIE. Some rules will be discussed in chapter 2 “Proto-Indo-European 
Morphology” (p. 70) and for Bartholomae’s Law cf. p. 229. 

Dental clusters 

In PIE, when two dentals (*t/d/d h ) occurred next to each other (on the morpheme border), 
an *-s- was automatically inserted between them: *wid-tos ‘seen/known’ (cf. *weyd-, 
p. 49) > *wid-s-tos [witstos], *h,ed-te! ‘eat! (pi.)’ (cf. *h,ed-, p. 44) > *h[ed-s-te [h,etste]. 
This *tst was preserved in Anatolian as <zt> (cf. OHitt. eztenl) but changed in various 
ways in other languages: to -tt- in Indie (Ved. vittas ‘found’, atta!); to -st- in Iranian (GAv. 
vis to ‘found’), Greek (a-toioq ‘unseen’), and Balto-Slavic (OCS vestb ‘known’); and to 
-ss- in Italic (Lat. uTsus ‘seen’), Celtic (OIr. ro-fess ‘is known’), and Germanic (Goth. 
un-wiss ‘uncertain’). 

Sievers 'Law and Lindeman ’s Law 

Evidence in some IE languages (like Vedic and Gothic) points to a possible, but disputed, 
PIE rule (Sievers’ Law) that *-y- and *-w- changed to *-iy- and *-uw- after a “heavy” syl- 
lable (i.e., after *-VCC- and *-VC-), perhaps only in final position. Cf. Goth, asneis [-Is] 
‘day-laborer’ < *asn(i)jaz but nipjis ‘relative’ < *ni]}jaz. The effects of the law were later 
blurred by various analogical developments in separate languages. A variant of the same 
law occurred in monosyllabics (Lindeman’s Law), where the same distribution (*-y-/-iy-, 
*-w-/-uw-) would have originally depended on the ending of the preceding word, cf. 
*k(u)wo(n) (p. 41) or *d(u)woh, (p. 89). Again, later languages can generalize one or 
the other form - cf. both in Ved. s(u)va (only Vedic has clear traces of the law) but only 
*-uw- in Gr. kucov (though this can also be by analogy to gen. sg. kuvo<;). 


Monosyllabic lengthening 

There was a tendency in PIE for monosyllabic forms to lengthen phonetically. 13 Cf. *ne 
‘no(t)’ (Ved. na, Lat. ne-que ‘and not’, OCS ne, Lith. ne, Goth, ni) and *ne (Lat. ne, OCS 
ne-kako ‘somehow’, Lith. ne, Goth, ne, OIr. ni); *nu(n) ‘now’ (p. 81); pers. pronouns *tu 
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‘thou’, *me ‘me’, *nos ‘us’, etc. (p. 82-84); *wis (or oblique *wis-) ‘poison’ (Ved. visam) 
and *wis (YAv. vis, Lat. uirus => Eng. virus), etc. This lengthening, exhibited in numerous 
monosyllabic forms attested in IE languages both with brevity and length, appears in a 
few nouns with *i/u (though in *wls it may be from older *wis-s as well - p. 70, 79) and 
otherwise mostly in personal pronouns and particle-like words (particles, conjunctions, 
postpositions - cf. e.g., *pro and *-we ‘or’, p. 105), apparently in both accented and unac- 
cented (clitic) forms. Not all monosyllables exhibit both variants, cf. only short *h,eg ‘I’ 
(p. 82), *so ‘this’ (p. 85), or *yo ‘which’ (p. 88), which is in some cases probably due to 
chance preservance of just one variant and in others probably to various morphological 
analogies. Monosyllabic lengthening is the only non-laryngeal source of long *1 and *u 
(p. 38). It is not clear if it originally had anything to do with the stable (and morphono- 
logical) length in nouns like *pods (p. 15). 

ACCENT 

Some IE languages have a fixed accent place (like Latin - p. 328), while others have a 
free accent, i.e., an unpredictable accentual position. Since those that have a free accent 
(like Vedic and Greek) display certain correspondences, these free accent systems are 
taken to be inherited from PIE. The following branches/languages preserve (to differ- 
ent extents) the old PIE free accent or traces of it: Vedic (but not Classical Sanskrit - 
p. 222-223), Avestan (indirectly in marginal traces - p. 272-273), certain modem 
Iranian (Pashto, perhaps Yidgha) and Dardic languages (Shina, Dameli), Greek (p. 296- 
297), Balto-Slavic (Lithuanian, Latvian (p. 490-494); Proto-Slavic and certain Slavic 
languages - East Slavic, Slovene, Bosnian/Croatian/Montenegrin/Serbian, Bulgarian, 
etc. (p. 523-525)), Germanic (not directly but through Grimm’s and Verner’s Laws - 
p. 20, 393), and Anatolian (not directly but through vowel lengthening - p. 177). 

Traditionally, the reconstruction of the PIE accent is very simple - one compares the 
accent position of Vedic and Greek, plus indirect information that can be gathered from 
Germanic consonantism. The data from Anatolian is sparse, and that from Balto-Slavic is 
rarely used because it is considered to be innovative. When the accent position in Vedic, 
Greek, and Germanic corresponds (which is often the case), the same stress position is 
reconstructed for PIE (marked with * '): 

Ved. bhrSta, Gr. (ppaiqp, Goth, bropar < PIE *b h reh 2 ter ‘brother’ (p. 18) 

Ved. nabhas, Gr. vecpoq (cf. also Hitt, nepis with lengthening of the accented syllable 
and OLith. debesis) < PIE *neb h os ‘cloudy sky’ (p. 31) 

Ved. panca, Gr. nev xe < PIE *penk w e ‘five’ (p. 24, 90) 

Ved. pita, Gr. jraTip, Goth .fadar < PIE *ph 2 ter ‘father’ (p. 46) 

Ved. dhumas, Gr. Su|.i6<; ‘spirit’ < PIE *d h uh,m6s ‘smoke’ 

Ved. yugam, Gr. ijuyov < PIE *yugom ‘yoke’ (p. 36) 

However, there are also (often disregarded) disagreements between these languages: 

Gr. ‘forearm’ but Ved. bahus ‘arm’ 

Gr. 7 ie^eicu<; but Ved. parasus ‘axe’ 

Gr. noTepoq, Goth, hapar (< PGmc *xwa]jaraz) but Ved. kataras ‘which of two’ 

Gr. isTiape:; but Ved. catvdras, Goth, fidwor (< PGmc *fe6worez) ‘four’ 

Though PIE accent is usually thought to have been a “pitch-accent” (like in Vedic or 
Ancient Greek), it is also traditionally connected with ablaut (p. 61, 68), i.e., with the 
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(originally) accented full grade and unaccented zero grade. Cf. the traditionally recon- 
structed *dyews ‘day sky’ (Ved. dyaus ‘sky’, Gr. Zsrx;) - gen. sg. *diwos (Ved. divas, 
Gr. Aioq); *ponteHs ‘path’ (Ved. panthas - gen. sg. *pntHos (Ved. pathas); *ph,ter 
(p. 46) - acc. sg. *ph,term - dat. sg. *ph,trey (p. 74—75); *h,esmi - *h[Smes (p. 93), 
etc. However, the question that is rarely asked is, which was first, accent or ablaut? In 
some cases, like *h[Sines (Ved. smas) or *ph 2 trey (Ved. pitre), it is hardly possible for 
the accent to be on the zero grade *-sm- or *-tr-. Also, although the traditional ablaut/ 
accent scheme works well for some forms, like the athematic present (p. 93) or perfect 
(p. 98), it is not present in thematic presents like *b h erete (p. 93) or 5-stems like *neb h esos 
(p. 78-79). These are often explained away as secondary but without real proof. Like- 
wise, it is rather strange that direct correspondences such as Ved. gen. sg. sun as ‘dog’ 
and Hitt. gen. sg. kunas ‘hound-man’ (which would point to PIE gen. sg. *kunos from 
*kwo(n) (p. 41)) are usually interpreted as somehow secondary to Gr. KDVoq, 14 although 
the Greek form is irrelevant since all Greek monosyllabic consonant stems have an auto- 
matic end-stress in the gen. sg. (unlike Vedic). The ablaut/accent scheme also has to turn 
to various kinds of analogies when it comes to “unexpected” stressed zero-grade forms 
like *wlk w os (p. 35), *h,ftkos (p. 35), *(h,)wlh 1/2 neh, (p. 37), *septm (p. 29, 90), loc. pi. 
*ph,tr'su (Ved. pitrsu, Gr. dat. pi. jiaipdcn, - p. 75), etc. What is more, while *wlk w os is 
root-stressed, there are cases like *deywos (p. 49) with an unstressed full-grade root. To 
sum up, the case for the ablaut/accent scheme does not at all look as firm as it is usually 
portrayed. 

As noted, the traditional IE accentology holds that it is Vedic, and to a lesser extent 
Greek and Germanic, that is most archaic and that the PIE accent place is to be recon- 
structed by simple comparison of their accent positions. However, there are different 
ideas about IE accent. The Moscow Accentological School (p. 524) holds that it is not 
Vedic and Greek but Balto-Slavic (with its much more complex accentual systems, espe- 
cially in derivation, and with its full-scale accentual mobility in some word types, which 
cannot be derived from a rather trivial “Graeco-Aryan” PIE accentual model) that is most 
archaic, and that PIE, like Proto-Balto-Slavic, should be reconstructed as a tone language 
with every syllable/morpheme either low or high. 15 

Note: I would like to thank Mislav Benic, David Mandic, Mikhail Oslon, and Petra Sostaric for reading 
and commenting on the first draft of the chapter, as well as Vytautas Rinkevicius for his help with the 
Baltic data. Of course, all the mistakes are just mine. 


FURTHER READING 

There are a number of recent general handbooks on IE linguistics, all of them containing 
a section on comparative phonology and differing slightly in their perspectives, though 
often rather short on concrete examples of sound correspondences. We shall list those in 
English here: Szemerenyi 1996 (dated but still useful; has no laryngeals in reconstruc- 
tions), Meier-Brugger 2003 (a comprehensive treatment of phonology with numerous 
in-text references, but the translation is at times unfortunate), Tichy 2006 (the approach 
of the “Erlangen School” of IE, written as a series of university lectures), Clackson 2007 
(not a standard handbook but rather problem-based, discussing methodology and theoret- 
ical issues), Voyles & Barrack 2009 (an introduction focusing on the best-attested ancient 
IE languages), Fortson 20 10 2 (perhaps the best English handbook and introduction, with 
separate chapters on all the major branches, including a discussion of historical phonol- 
ogy and more detail than in other general surveys), Beekes 20 1 1 2 (the introduction to the 
“Leiden School” of IE, with glottalic theory and no PIE *a). Gamkrelidze & Ivanov 1995, 
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with their glottalic theory, is comprehensive but “heterodox”. Works like Sihler 1995 or 
Ringe 2006, while obviously focusing on specific languages/branches, also offer informa- 
tion on (P)IE in general. The most comprehensive treatment of PIE segmental phonology 
(without accent) is still Mayrhofer’s in Cowgill & Mayrhofer 20 12 2 . Concerning specific 
problems in (P)IE phonology, the number of publications (even just of more important 
works) is impossible to list, so we shall just mention some of the books. For laryngeal 
theory cf. Lindeman 1987 (though he is sometimes unorthodox) and Bammesberger 1988 
(Winter 1965 is rather old but still a classic, and Voyles & Barrack 2015 is, curiously, 
an attempt at the refutation of the now completely established and accepted laryngeal 
theory), for glottalic theory Salmons 1993, and for discussion on both Vennemann 1989. 
Byrd’s (2015) recent book deals with the PIE syllabification but also with the PIE phono- 
logical processes and phonology in general. Collinge 1985 (with supplementary Collinge 
1995 and 1999) provides an overview of sound laws and rules in PIE and IE languages, 
but is all too often chaotic and unclear. 


NOTES 

1 The Austronesian language Lifu, which has b only in loanwords (Lenonnand 1952), 
is often adduced as a counterexample (but cf. Gamkrelidze & Ivanov 1995: 10-1 1). 

2 For Kelabit cf. Blust 1974 and 2006. Compare also a similarly unstable system of p : 
p h : b h (but no voiced stops) in the Chinese Wu language (Mallory & Adams 2006: 52). 

3 There, one finds the opposition of k \k \k" andg : g : g"\ Cf. 3,gejn>MaH 1966: 15-18 
and 3,geni>MaH 1973. 

4 Cf. also Melchert 1987 and 2012 (though his claim that Anatolian is a centum branch 
does not seem warranted). 

5 Cf. the list in Melchert 2012: 216-217. 

6 Cf. a typological parallel in the q : k." : k system in the Mataco language of the Andes 
(Adelaar & Muysken 2004: 493). 

7 Cf. Kortlandt 1978. 

8 The fact that *h, (like *h,) causes aspiration in Old Indie (cf. Ved. asthi, Gr. ocrteov 
< PIE *h 3 esth, ‘bone’) would point to *hj having been an A-type sound as well (and 
not a glottal stop *? as some would have it). 

9 Cf. Pinault 1982. 

10 There are also rare quasi-ablaut non-systematic controversial correspondences like 
Goth. + tibr ‘sacrifice, offering’ < PIE *depr-, compared with *dapn- above with *r/n 
heteroclisis (p. 77-78), which is possibly a loanword, cf. Proto-Semitic *6-b-h ‘sac- 
rifice’; PIE *tawros ‘bull’ (Gr. tempo:;, Lith. tauras ‘aurochs’, OIr. tarb, etc.; proba- 
bly ultimately a loanword, cf. Proto-Semitic *j)awru(m)) and *(s)tewros (Goth, stiur 
‘young steer’ (Eng. steer), Av. staora- ‘big cattle’ - arguably unrelated to *tawros), 
etc. 

11 Cf. a somewhat similar analysis for a phonologically “vowelless” Modem Chinese 
(Pulleyblank 1984), though this only seems to prove that basically anything is possi- 
ble in a “deep” and very abstract phonological analysis. 

12 Sometimes written as *k w e twr- in the literature. 

13 Cf. Kapovic 2006: 151-153 for more examples and details. 

14 Cf., e.g., Kloekhorst 2008: 506. 

15 Cf., e.g., Dybo, Nikolayev & Starostin 1978, HmconaeB 1989, or flbi6o 2003. A com- 
pletely different, but to a point complementary (when discussing evidence for PIE as 
a tone language), approach to PIE as a tone language is Lubotsky 1988. 
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SYMBOLS USED IN THIS AND THE FOLLOWING CHAPTER 

C - any consonant 
V - any vowel 
R - any resonant 
R - any syllabic resonant 
K - any velar 
o - orthographic writing 
// - phonological writing 
[] - phonetic writing 

# - beginning/end of a word 

* reconstructed 

+ not directly attested (in a text, etc.) but inferred 
< (is) derived from 
> (has) become 

<= (is) a loanword from; non-phonetic development 
=> (is) a loanword into 
~ connected with, related to 
lit. - literally 



CHAPTER 2 


PROTO-INDO-EUROPEAN 

MORPHOLOGY 

Mate Kapovic 


INTRODUCTION 

Proto-Indo-European, as well as all early attested Indo-European languages, had a rich 
and complex synthetic morphology. This morphology was often simplified in later 
languages - e.g., no modem IE language has all the eight cases PIE had, and some have 
none. However, new morphological forms also appeared - e.g., most modem IE lan- 
guages have a separate future tense (though not always synthetic), which did not exist in 
PIE. Forms and endings are usually easier to reconstruct than their functions and exact 
meaning. The focus of this short overview will be on the reconstruction of PIE forms 
and paradigms and not on the exact (and sometimes disputable) function and meaning 
of those forms. Obviously, not all reflexes and developments in all descendant languages 
can be dealt with here. As a rule, only the most archaic attested forms (even if rare) 
are adduced, with the point of reconstructing PIE forms and paradigms. The focus is 
more on PIE itself, rather than on the development and situation in separate IE branches 
(dealt with in separate chapters of this volume). Non-inflected parts of speech, though not 
strictly part of morphology, are also dealt with in the end, while word formation will be 
tackled only marginally throughout the chapter. 


ABLAUT 

Ablaut is a regular morphonological alternation of vowels in PIE morphemes. Depending 
on its fonn (either as part of a morphological paradigm or derivational process), a PIE 
root, suffix (rarely a prefix), or ending could have had different shapes. The following 
vowels were in alternations: the full grade (*e, *o), the lengthened or long grade (*e, 
*5), and the zero grade (*0). In the case of *eR in the full grade (*R being any resonant/ 
glide), the zero grade was *R/R (i.e., *ey - *oy - *ey - *5y - *y/i; *em - *om - *em - 
*om - *m/m, etc.), depending on syllable structure (p. 31). Thus, we have PIE *h,es-ti 
‘it is’ but *h,s-enti ‘they are’ (with *h 1 es-/h,s- alternation). The mles determining the 
appearance of a certain ablaut form were quite complex in PIE. Some ablaut alternations 
are seen even in modem IE languages (e.g., sing - sang - sung in English), but they are 
best preserved in early attested languages. Cf. all five grades in the suffix in these Greek 
forms: 


na-xep\ ‘father!’ (*e-grade) 

a-Jia-iop! ‘orphan (fatherless)!’ (*o-grade) 

Jia-ip-oq ‘of the father’ (zero grade) 


jra-Tip ‘father’ (*<?-grade) 
d-Jid-icop ‘orphan’ (*o-grade) 
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In certain roots, resonants can change their place - cf. *dyew-/deyw- in *dyews (p. 40, 
56) and *deywos (p. 49) - this is called Schwebeablaut ‘floating ablaut’ (cf. Anttila 1 969). 
PIE verbal, nominal, and verbal/nominal roots had a CVC structure (with possible con- 
sonant clusters - p. 52-54), in most of which the ablaut scheme in the -V- part is attested. 

NOUNS 

Late (post-Anatolian) PIE nominals had three genders - masculine, feminine, and 
neuter- preserved in most early IE languages. The feminine gender is possibly a common 
PIE innovation (which is controversial) after the early split of the Anatolian branch, since 
Anatolian exhibits only a distinction of animate/common and inanimate/neuter nouns 
(p. 178). PIE had three numbers - singular, plural, and dual (the latter often considered a 
late addition to the PIE number system). Adjectives and pronouns agreed in number and 
gender with nouns, and verbs agreed with them in number (but cf. the curious case of the 
neuter nominative “plural” - p. 65, 166). Morphologically, adjectives behaved exactly 
like nouns as far as declension goes, and they can be treated together, while pronouns 
had some special case endings and traits. There were eight cases - nominative, genitive, 
dative, accusative, vocative, locative, instrumental, and ablative. 

PIE nouns had the following formation: (prefix +) root (+ suffix) (+ suffix) + ending. 
A noun had to have a root and an ending (even if it was a zero morpheme). Prefixes 
(like *n- ‘non-’), which were rare (p. 156-157), occurred in some prefix derivatives 
(p. 79-80) - these can be interpreted as the first part of a compound, in which case PIE 
would have had no prefixes. Words could have more than one suffix - cf. *pod-s ‘foot’ 
(root + ending), *pot-i-s ‘lord’ (root + suffix + ending), and *suH-n-u-s ‘son’ (root + suf- 
fix + suffix + ending). There were also some nouns with reduplication, e.g., *k w e-k w l-o-s 
‘wheel’. 

Many nouns have special declension suffixes directly preceding the case endings. Thus, 
we speak of different nominal stems: o-stems (*h,ekw-o-s ‘horse’); easterns (*(h 2 )wlh 1 n- 
eh 2 -0 ‘wool’); root nouns (*pod-s, no suffix); /-stems (*h,ew-i-s ‘sheep’); //-stems 
(*suHn-u-s); resonant stems - /-stems (*ph 2 t-er-0 ‘father’), //-stems (*h 2 ekm-d(n)-0 
‘stone’), ///-stems (*d h eg h -o(m)-0 ‘earth’), and /-stems (*h 2 eb-ol-0 ‘apple’); laryngeal 
stems (*pont-eH-s ‘path’); plosive stems (*(h,)nep-5t-s ‘nephew’); and 5-stems (*neb h - 
os-0 ‘cloudy sky’). These declension suffixes (*suHn-u-s) can be distinguished from 
the strictly derivational suffixes (*suH-n-us, cf. the verbal root *suH- ‘give birth’ > Ved. 
sute ‘gives birth’) that form the extended root. It goes without saying that sometimes it 
is impossible to ascertain the exact morphological/derivational shape of a word; e.g., is 
*ponteHs to be analyzed as *pon-teH-s or *pont-eH-s? The declension suffixes are not 
necessarily always the same, cf. the different ablaut shapes in suffixal *-e(n), *-o(n), *-n 
in the n-stems (p. 77). 

O- and e/? 2 -stems are called thematic stems (after the so-called thematic vowel *-e/o-), 
and the other stems athematic. All athematic stems can be called consonantal stems (since 
*i/u can be considered syllabic glides - p. 34). In thematic stems, there was no ablaut 
alternation in the root and endings - they had the same shape in all the cases (the only 
ablaut alternation in thematic stems was the very limited qualitative *-e-/-o- alternation 
of the thematic vowel in o-stems, not unlike athematic 5-stems). For instance, the root 
in *h,ekwos was in the e-grade in all the cases (gen. sg. *h,ekwosyo, dat. sg. *h[ekwoy, 
etc.), and the root in *wlk w os ‘wolf’ was always in the zero grade (gen. sg. *wlk w osyo, 
dat. sg. *wlk w oy, etc.). In athematic stems, all morphemes (root, suffix, ending) could 
potentially (according to their ablaut type) change their ablaut shape (i.e., they could have 
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one ablaut of the root in one case and another one in another case), though this did not 
occur in all forms and types (the s-stems were the most atypical). For instance, the gen. 
sg. of *pont-eH-s was *pnt-H-e/os with a different grade in both the root and (the pre- 
sumed) suffix. The same opposition of thematic/athematic occurs in verbs as well (p. 93). 

Case endings differed slightly depending on the type of declension and gender (o-stems 
differed the most from all other stems, and their specific endings are listed separately 


below): 

singular 

plural 

dual 

nom. 

*-s (m./f.), *-0 (n.) 

*-es (m./f.), *-h, (n.), *-0 (n.) 

*-h,(e) (m./f.), *-ih, (n.) 

voc. 

*-0 

= nom. 

= nom. 

ace. 

*-m (m./f.), *-0 (n.) 

*-ns (m./f.), = nom. (n.) 

= nom. 

gen. 

*-(e/o)s 

*-om 

(?) *-h,s, (?) *-u 

abl. 

= gen. 

= dat. 

= dat. 

dat. 

* 

l 

CD 

'C 

*-b h os/*-mos 

(?) *-b h (y)oh,/*-moh 1 

loc. 

*-i, *-0 

*-su 

(?) *-u 

instr. 

* 

1 

'o' 

J? 1 

*-b h i(s) 

= dat. 


O-stems differed in these endings: neuter nom./acc. sg. *-m; masculine gen. sg. *-syo; 
abl. sg. *-ot; masculine loc. pi. *-isu; masculine loc. pi. *-eys. For athematic endings cf. 
p. 69-70. 

In neuter nouns, nom./acc./voc. were always the same in all numbers. In the dual, only 
nom./acc./voc. can be reconstructed with certainty. The ablative had a special ending 
only in the sg. of o-stems; elsewhere, it was identical to the gen. (sg.) or dat. (pl./du.). In 
some cases, it is possible to speculate on the origin of the endings. The dat. sg. *-ey and 
loc. sg. *-i look like ablaut variants (and perhaps were just that originally in pre-PIE) 
but are distinct and invariant in the last stage of PIE (ablaut variants are found only in 
athematic gen./instr. sg. - p. 68). Ace. pi. *-ns surely derives from agglutinative *-m-s 
(ace. sg. + nom. pi.), and some reconstruct *-ms as the ending, 1 while the endings of the 
dat./loc./instr. pi. and dat./abl./instr. du. are of clear postpositional origin. In Celtic, Italic, 
Indo-Iranian, and Greek one finds endings with *-b h - in the dat./instr. pl./du. (with differ- 
ent variant forms), while Balto-Slavic and Germanic have *-m- here (the exact relation 
of these is unclear). 2 The reconstruction here is rather provisional. Indo-Iranian points 
to the instr. pi. *-b h i-s (parallel to the BS1. instr. pi. *-mi-H-s), while Greek (Mycenaean 
and Homeric) has the instr. pi. (used also in other functions in Greek) <-pi>/-(pi without 
the plural *-s (p. 302) - cf. also the Arm. instr. sg. ending -v < *-b h i (p. 434-435) and 
OCS instr. sg. -mb < *-mi (with *-m-). This could be originally related to the Germanic 
preposition *bi (> Eng. by), originally a postposition (case endings often appear out of 
former postpositions). Indo-Iranian points to proto-variants *-b h y-os for the dat./abl. pi. 
and *-b h y-oh[ for the dat./abl./instr. du., with *-b h i- where other languages have *-b h - (or 
*-m-). Different accounts are possible here (e.g., an original dat. *-mos but instr. pi. 
*-b h i(s), etc.) - provisionally, we reconstruct the dat. pi. as *-b h os/-mos and the dat. du. as 
*-b h (y)oh 1 /-moh 1 . The nom. du. perhaps has its origin in the instr. sg. (Schmalstieg 1998), 
with an ellipsis of the type *wlk w os wlk w oh, ‘wolf with a wolf’ > (*wlk w os) wlk w oh[ ‘two 
wolves’ - cf. the Vedic type Mitra (nom. du.) (. . .) Varuno (nom. sg.) ‘Mitra and Varuna’ 
and the ellipsis in Ved. dvava ‘heaven and earth’ (literary: ‘two heavens’) and the like 
cases in Greek and Slavic. The *-h, in the dual forms *-ih, and *-b h y/moh| is assumed by 
analogy to the nom. du. *-h, . In the gen./loc. du. the *-o-w from o-stems seems to have 
spread to other stems (otherwise, *-u should be reconstructed as an o-stem variant, and 
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*-ow for other stems). In this chapter, we will not reconstruct all dual forms everywhere 
(especially when there are no reflexes of the original forms). 


Symbols (used in tables and lists of forms) 

{form or ending not related to the original PIE one or completely transformed}, [form or 
ending reflecting the PIE one but with a different function or taken from another form in 
the paradigm], \an irregular phonological change, analogy, or a non-transparent addition 
in the form or ending\, |a secondary addition to a form or endingl 

Thematic stems 

O-stems 

O-stems were mostly masculine (like *wlk w os) or neuter (like *yugom ‘yoke’). There 
were also some feminine o-stems like *snusos ‘daughter-in-law’ (p. 36). Masculine and 
neuter nouns had different endings only in the nom./voc. sg. and nom./acc./voc. pl./du. 
All cases had thematic *-o- except the voc. sg., which had *-e and a zero ending. 


TABLE 1.10 IE O-STEMS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

OLat. 

OCS 

Lith. 

Goth. 

nom. 

*-o-s 

-as a 

-oq 

-OS 

-b 

-as 

-s 

voc. 

*-e 

-a 

-8 

-e 

-e 

-e 

-0 

acc. 

*-o-m 

-am 

-OV 

-om 

-b 

-q 

-0 

gen. 

*-o-syo 

-asya 

Horn, -oio 

-osio 


g 

{-is} 

abl. 

*-o-ot 

-ad 

Dor. 071(0 c 

-5d 

[gen. -a] 

[gen. -o] 


dat. 

*-o-ey 

-ay\a \ b 

-CD 

-oi 

-u 

-ui 

-a 

loc. 

instr. 

*-o-y 

*-o-h, 

-e 

RV -a 

OlKOl d 

domI c 

-e 

{-omb} { 

namie d 

-u 

OHG -u 


* The final -s in Old Indie (in this and other forms) is often written as -h (a different sandhi-variant); b cf. 
GAv. -ai ; c ‘whence’ (adv.); d ‘at home’ (adv.) (p. 302); e Class. Lat. ‘at home’ (synchronically gen.); f cf. 
adv. vbcer-a ‘yesterday’; g OPruss. -as (< *-os(y)o?). 


The gen. sg. ending *-osyo (cf. also Lep. -oiso. Arm. -ov, HLuw. -as(s)i), different 
from the usual *-(e/o)s found in other declensions, is usually compared to the pronominal 
ending *-eso (p. 81), though the exact connection is not clear. The Latin and Old Indie 
abl. ending (also CIb. -ud) can be derived from both *-t and *-d (p. 53), but the possible 
connection to Hitt. abl. -az < *-ot-i, Toch. A -s < *-ti (p. 462), and OCS preposition oh, 
‘of’ would point to the original PIE abl. *-ot (though *-od is often reconstructed for PIE). 


TABLE 1.11 IE O-STEMS (PLURAL) 


pi. 

PIE 

Ved. 

Gr. 

OLat. 

OCS 

Lith. 

Goth. 

nom. 

*-o-es 

-as 

{-oi} 

{-oe} 

{- i } 

{-at} 

-OS 

acc. 

*-o-ns 

-\a\n b 

dial, -ovq 

-OS 

-y/-f 

-US 

-ans 
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pl. 

PIE 

Ved. 

Gr. 

OLat. 

OCS 

Lith. 

Goth. 

gen. 

*-o-om 

RV -dm c 

-cov 

-om 

-•b d 


{-e} 

dat. 

*-o-b h os a 

-\e\bh\y\as 



-om-b 

OLith. -am\u\s 

-am 

loc. 

*-o-ysu 

-esu 

dial. -OI|2|\I\ 


-ex-b 

dial, -iesu e 


instr. 

*-o-eys 

-ais 

[dat. -ok;] 

[dat. -ois] 

-y 

-ais 



a BSl./Gmc variant *-mos. Hr. *-b h yos; b sandhi-variant: -ams, cf. GAv. sandhi-form -qs; c only four times 
in the fossilized collocation dev-afi janman- ‘race of gods’; d PS1. *-r>; ' (cf. p. 502). 


Two adjacent vowels contracted: dat. sg. (*-5y < *-o-ey), abl. sg. (*-5t < *-o-ot), 
nom. pi. (*-os < *-o-es), instr. pi. (*-5ys < *-o-eys), and gen. pi. (*-om < *-o-om). 
It is possible that these endings were still (at least as variants or dialectally) uncon- 
tracted in PIE, judging by occasional disyllabic scansion in Indo-Iranian, and perhaps 
by the development of Lith. gen. -o < *-a (p. 41) < BS1. *-a-at < PIE abl. *-o-ot 
(with contraction after PIE, *o > BS1. *a?). In many languages/branches, the original 
ending *-5s (cf. also Osc. nuvlan-us ‘inhabitants of Nola’) was replaced by the pro- 
nominal *-oy (*toy wlk w os ‘those wolves’ => *toy wlk w oy, p. 81). The *-b h -/-m- in 
the dat. (and elsewhere) is problematic (p. 63), but *-os is clear from BS1. *-mos 
(OCS -mb, OLatv. -ms), Lat. -bus (in athematic stems), Celt. *-b h os (CIb. -bos), and 
Ven. -bos (louderobos ‘to the children’), supported by Hr. *-b h -y-os. The loc. *-i-su 
probably has the additional *-i- from the singular (cf. the simple ending *-su in other 
declensions), and *-ey-s in the instr. pi. can tentatively be connected to the Slavic 
conjunction i ‘and’. 


TABLE 1.12 IE O-STEMS (NEUTER) 


neuter 

PIE 

Ved. 

Gr. 

OLat. 

OCS 

Goth. 

Hitt. 

nom./acc./voc. sg. 

*-o-m 

-am 

-ov 

-om 

-\o\ 

-0 

-an 

nom./acc./voc. pl. 

*-e-h 2 

-a 

-\a\ 

-\a\ 

-a 

-a 

-a 


The nom./acc. forms are always the same in neuter nouns. In o-stems, the neuter 
nom./acc. sg. has the same form as the masculine acc. sg. (*-om). This, and the fact 
that nom./acc. are always the same in neuter nouns, undoubtedly derives from the 
pragmatic use of neuter (inanimate) nouns, which are rarely the subjects of transitive 
verbs. For this reason (and others), pre-PIE is considered by some to have been an 
ergative language (cf. 162-164). The ending *-eh, of the nom./acc./voc. pi. is identi- 
cal to the nom. sg. of feminine *eh 2 -stems. That is due to the fact that the “plural” of 
neuter nouns is originally a collective, not a plural per se. That is why in IE languages 
one finds the *-eh 2 - plural forms for masculine nouns as well, cf. Av. regular masculine 
nom. pi. in -a (p. 275) or Lat. nouns like iocus ‘joke’ - pi. both iocT and ioca (sim- 
ilar cases exist in Iranian, Greek, Anatolian, Slavic, etc.). In Greek, Anatolian, and 
Gatha-Avestan, the plural of neuter nouns agrees with verbs in the 3 sg. This is known 
as the id C,8> a ipe/et ‘the animals (pi.) run (sg.)’ rule in Greek, and is an archaic PIE 
feature stemming from the said fact that later neuter nominative plurals were origi- 
nally collective singulars. 
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TABLE 1.13 IE 0-STEMS (DUAL) 


du. 

PIE 

Ved. 

GAv. 

Gr. 

OCS 

Lith. 

nom./acc./voc. (m.) 

*-o-h. 

-a 

-a 

-CO 

-a 

-u 

nom./acc./voc. (n.) 

*-o-yh. 

-e 


{-to} 

-e 


gen. 

(?) *-o-h,s 

[-ayos] 

-\aii\d 

{-ouv} b 

l-u] 


loc. 

*-o-w 

-\ay\o\s\ b 

-\aii\d b 


-u 

(?) pusiau 

dat./abl./instr. 

(?) *-o-b h (y)oh l “ 

-\d\bhya\m\ 

-\5i\biia 

{-ouv} b 

-oma 

-am! -am 


BS1. *-omoh,; b cf. pronominal forms p. 86; c dial, 'in two/half’. 


Old Church Slavic and Vedic point to *-ow in both gen. and loc. (the gen. *-s in Vedic 
is probably secondary), but Avestan has gen. -a < *-as (presumably from *-o-h r s with 
nom. *-h,- + gen. *-s) and loc. -6 < *-aw, which is possibly archaic (Beekes 2011: 217). 
The loc. (and gen.) ending was originally perhaps *-u, thus *-o-u in o-stems, but this 
*-ow was later, possibly already in PIE, generalized in other stems as well. 


Eh 2 -stems 

Eh 2 - stems (which were always feminine) seem to be a post-Anatolian innovation, 
since they do not appear there. In late PIE, these stems are correlated to o-stems 
in adjectives: *newos (m.)/newom (n.) and *neweh, (f.) ‘new’. The post- Anatolian 
emergence of the new feminine gender could have originated from the identification 
of the collective *-eh 2 (and possessive *-ih 2 , cf. Lat. o-stem gen. sg. -f) suffix with the 
accidental *-h, in the originally athematic word for ‘woman’: *g w enh 2 (Ved .jani, OIr. 
be, p. 77, 369). 4 


TABLE 1.14 IE £7/ 2 -STEMS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

nom. 

*-eh. 

-a 

-d 

-\d\ 

-a 

-a 

-a 

voc. 

*-e(h 2 ) 

{-e} 

-a a 

= nom. 

-0 

-a 

= nom. 

acc. 

*-eh,-m 

-am 

-av 

-am 

-9 

-q 

-a 

gen. 

*-eh,-es 

-\dy\ds 

-aq 

OLat. -as 


-5s 

-os 

dat. 

*-eh,-ey 

-\dy\ai 

-a 

-ae 

-e 

-ai 

-ai 

loc. 

*-eh,-i 

{-ay am} 


-ae 

-e 

l-ai-W c 


instr. 

*-eh 2 -eh. 

-a 

Kpixpa b 


{~oj<?} 

-\d\ 



Horn. vuu(p-«!; b adv. ‘in secret’; c with a postposition -p in the new adessive. 


All languages except Latin point to *-a (cf. also Umbr./Osc. -o < *-a and Celtic - 
p. 369) in the nom., which is then interpreted as older *-e-h 2 (p. 43), though the laryngeal 
is nowhere attested (*-e- is a thematic vowel). However, the existence of other femi- 
nine endings (*-i-h 2 -(s) and *-u-h 2 -s, p. 67, 74) indicates a laryngeal. Other cases have 
this *-eh 2 - (which has a zero ending in the nom.) + general endings: acc. *-eh 2 m (not 
**-eh 2 m!) > *-am (Stang’s Law - p. 70); gen. *-eh,es > *-ah 2 as > *-aas (p. 43) > *-as; 
dat. *-eh 2 ey > *-ah 2 ay > *-aay > *-ay; loc. *-eh,i > *-ah 2 i > *-ay; instr. *-eh 2 eh, > *-ah 2 ah, 
> *-aa > *-a, etc. Some endings are formally reconstructed on a structural basis in their 
laryngealistic and pre-contraction shape. Greek and Balto-Slavic (cf. also Umbr. voc. 
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tursa! (name of a goddess)) point to *-a in the voc. In laryngeal terms, this is usually 
explained as *-e(h,), with the laryngeal coloring the preceding vowel (*-a(h 2 )) but then 
dropping before a pause without lengthening (cf. also Ved. devi! < *-i(h 2 ) from devT ‘god- 
dess’ < *-ih 2 , see below). 


TABLE 1.15 IE £7/ 2 -STEMS (PLURAL) 


PL 

PIE 

Ved. 

Gr. 

Lat. 

ocs 

Lith. 

Goth. 

nom. 

*-eh,-es 

-as 

{-“!} 

OLat. -as 

i-y-f] 

-OS 

-OS 

acc. 

*-eh,-(n)s 

-as 

dial. -av<; 

-as 

-yi-? 

-as 

-OS 

gen. 

*-eh,-om 

-\dn\dm 

dial, -\dco\v c 

{-arum} c 

-i J 

-H 

-0 

dat. 

*-eh,-b h os a 

-abhyas 

{-“!?} 

{-&} 

-amb 

OLith. -om\u\s 

-om 

loc. 

*-eh,-su 

-dsu 



-a\x\b 

dial, -osu 


instr. 

*-eh,-b h i(s) b 

-abhis 

Myc. -ap h i 


-ami 

-omis 



a BSl./Gmc *-mos, Hr. *-b h yos; b Gr. *-b b i, Ilr. *-b h is, BS1. c cf. pronominal *-som (p. 81); 

d PSlav. *-r>. 


Some IE languages (Vedic, Gothic) show no trace of the original *-n- in the ace. pi., 
so it is possible to assume that this was already a PIE change (at least in some dialects). 
The gen. ending *-eh 2 -om is formally reconstructed, since, if we disregard the secondary 
forms, all we have are a few reflexes pointing to a contracted long *-om (Goth, -o, Lith. 
-if, PS1. *-£). 


TABLE 1.16 IE £7/ 2 -STEMS (DUAL) 


du. 

PIE 

Ved. 

ocs 

Lith. 

nom./acc ./voc. 

*-eh,-ih. 

-e 

-e 

-i c 

gen./loc. 

*-eh 2 -u a 

{-ay os) 

-u 


dat./abl./instr. 

(?)*-eh,-b h yoh 1 b 

-dbhya\m\ 

-ama 

-oml -5m 

a Or analogical *■ 

-eh,-ow (p. 63); b BS1. *-eh,moh,; 

c cf. -ie-ji-dvi in definite adjectives with 

-ie- < *-ayH-. 


The nom./acc. endings point to *-ay (< *-ai < *-a(h 2 )ih,). The e/? 2 -stems have the dual 
*-ih[ (like the neuter - perhaps due to the connection of the feminine sg. and neuter pi., 
p. 65), not *-h,(e) like other feminine stems. 

Two feminine *-ih 2 -types can also be reconstructed (PIE *deyw-ih 2 ‘goddess’ > Ved. 
devi; cf. Lith. pati ‘wife’; PIE *wlk w -ih 2 -s ‘she -wolf’ > Ved. vrkis, p. 236-237); one of 
the types had the apophonic alternation *-ih 2 -/-yeh 2 - in the suffix (Ved. dat. sg. devvai < 
*deyw-yeh,-ey but vrkie < *wlk w -ih 2 -ey). 

Athematic stems 

PIE athematic ablaut declension types 

As already said, in athematic nouns all morphemes (roots, suffixes, endings) could 
exhibit ablaut alternations - for instance, the suffix was in the lengthened grade in *ph 2 ter 
‘father’, in the zero grade in *suHnus ‘son’ and the gen. sg. *ph 2 tre/os, in the full grade 



68 MATE KAPOVIC 


in the gen. sg. *suHnows, while the ending was in the full grade in the gen. sg. *ph 2 tre/os 
but in the zero grade in the gen. sg. *suHnows. PIE had quite a number of complex ablaut 
declension types (ADTs), which are difficult to reconstruct. 

In the case of root alternations, the original ADTs can sometimes, but very rarely, 
be reconstructed from direct reflexes in some language (e.g., *ponteHs - gen. sg. 
*pntHos ‘path’ can be reconstructed from Ved. panthas - pathas). Usually, they have 
to be reconstructed from different reflexes in various languages, where one of the alter- 
nate PIE ablaut grades had usually been generalized - e.g., if one compares Lat. aurora 
(< *ausos-) with Ved. usas (gen. sg. usas), we can conclude that PIE had ablaut variants 
*h 2 ewsds-, *h,us5s-, and *h,us(s)- (by additional analysis in the context of the whole 
PIE declension system, we come to the conclusion that the original pattern was probably 
nom. sg. *h,ewsos - gen. sg. *h 2 use/os ‘dawn’, p. 51). However, already in PIE, as far as 
can be reconstructed, many stem types (e.g., /-stems, most 7-stems, etc.) did not exhibit 
ablaut alternations in the root (which may or may not be an innovation in comparison to 
the pre-PIE period). The alternations in suffixes (cf. Lat. pater ‘father’ - gen. sg. patris), 
more common in PIE, were also more likely to be preserved, though here one of the types 
usually tends to be generalized in the daughter languages. 

The standard theory of ADTs connects the ablaut types with the supposed PIE accen- 
tual alternations, the usual supposition being that the full (especially *e) grade goes with 
accentedness, while the zero grade should be unaccented. However, there are many prob- 
lems with this standard account, as well as with the reconstruction of PIE accent in gen- 
eral (p. 55-56), and its relation to ablaut patterns. The problem with the standard theory is 
that it usually (at least unconsciously) tries to reconstruct the ablaut types not of the latest 
stage of PIE, but rather of some “original” supposed pre-PIE stage, and thus often oper- 
ates with the supposed, but completely speculative, forms, like **sewHnus or **mentis, 
instead of *suHnus (p. 38) and *mntis (p. 35) (with the accent marked as traditionally 
reconstructed) that are actually attested in the daughter languages but often considered as 
secondary without any real evidence. 

The standard theory usually operates with the following basic and neatly organized 
ablaut/accent types: static (acrostatic/acrodynamic) *nok w ts - gen. sg. *nek w ts ‘night’ 
(immobile root accent); proterodynamic (proterokinetic) *peh 2 wr - gen. sg. *ph,wens 
‘fire’ (accent shifts from root to suffix); amphidynamic (amphikinetic, holokinetic/holo- 
dynamic) *h,ews5s - gen. sg. *h 2 us(s)e/os ‘dawn’ (accent shifts from root to ending); 
hysterodynamic (hysterokinetic) *ph 2 ter- gen. sg. *ph 2 tre/os ‘father’ (accent shifts from 
suffix to ending). However, here we shall use an analysis that is somewhat heterodox 
in comparison to the standard one because it stresses morpheme alternations and not 
the supposed PIE accent shifts, with emphasis on the last stage of PIE (and not some 
supposed pre-PIE “original” stage). In this analysis, the reconstructed ablaut types are 
somewhat different than those in the standard analysis, but the usual terminology is more 
or less preserved. 

In our analysis here, we take it that all PIE morphemes (roots R, suffixes S, endings 
E) in a nominal paradigm were either strong (+), alternating (±), or weak (-). Strong 
morphemes always have the full (or lengthened) grade; alternating morphemes have the 
full (or lengthened) grade in some cases but the zero grade in others; weak morphemes 
are always in the zero grade. E.g., if we look at the roots of the already mentioned words, 
*ne/ok w t- is (+), *h 2 ews-/h 2 us- is (±), and *suHn- is (-). Most endings were invariant 
(whether they were weak like nom. sg. *-s or strong like nom. pi. *-es) - only gen. sg. 
*-es/-os/-s, instr. sg. *-eh 1 /-h 1 , and perhaps nom. du. *-h[/-h,e (cf. OLith. -e for the latter) 
had ablaut variants, i.e., were of the alternating type (thus, whether an ending in an ADT 
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is +, or ± is seen only in these two or three cases, while being irrelevant in others). 
The same morpheme did not have to be the same in all words - e.g., the suffix *-i- was 
weak in *potis and alternating in *mntis (p. 35), etc. Thus, the (strong/altemating/weak) 
nature/valency of a morpheme was paradigmatic, not universal. A similar analysis can be 
applied to athematic verbs as well. 

The following nominal ADTs can be reconstructed for the last stage of PIE: 

1) aerodynamic (fixpoc ‘outermost’, duvapic ‘power’) 

RSE + ± — *me-h,ter-0 - gen. sg. *me-h,tr-s ‘mother’ 

2) proterodynamic (upoTspoq ‘in front’) 

RSE ± ± — *h 3 neh 3 -mn-0 - gen. sg. *h 3 nh 3 -men-s ‘name’ 

3) mesodynamic (p:':ooc ‘middle’) 

RSE — ± — *suHn-u-s - gen. sg. *suHn-ow-s ‘son’ 

4) amphidynamic (aptpi ‘on both sides’) 

RSE +-+ *pot-i-s - gen. sg. *pot-y-e/os ‘master’ (cf. p. 77 for *melit) 

5) hysterodynamic (iioiiipoc ‘coming after’) 

RSE - ± + *p-h 2 ter-0 - gen. sg. *p-h 2 tr-e/os ‘father’ 

6) holodynamic (6Xo q ‘whole’) 

type a) RSE + ± + *h 2 ek-m5(n)-0 - gen. sg. *h,ek-mn-e/os ‘stone’ 
type b) RSE ± ± + *pont-eH-s - gen. sg. *pnt-H-e/os ‘path’ 

7) holostatic (only neuter 5-stems) (otoito:; ‘placed’) 

RSE +++ *neb h -os-0 - gen. sg. *neb h -es-e/os ‘sky’ 

What is relevant in an ADT are not only the types of morphemes but the way they pat- 
tern - e.g., two holodynamic types are put together because they differ only in the num- 
ber of morphemes that alternate (but both have + in the nom. sg. and - in the gen. sg. in 
alternating morphemes), while aerodynamic (+ ± -) and proterodynamic (± ± -) type are 
distinguished because they differ not only in what morphemes alternate, but also in how 
they alternate (cf. the zero suffix in the gen. sg. *me-h,tr-s, but a full suffix in the gen. 
sg. *h 3 nh 3 -men-s). Additional (sub)types may also be potentially added (cf. p. 77), and 
different naming/grouping of the types is always possible. In separate stems (e.g., /?-, 

etc.), one can usually find one to three types. Because of numerous analogies and diver- 
gent later developments, reconstructions are sometimes speculative, and different authors 
have different opinions. Some nouns may have had coexisting different ablaut variants. 
For slightly different ADTs in root nouns, cf. p. 71. 


Athematic case endings 

All athematic stems had mostly the same endings (differing in a number of cases from 
the o-stems), the differences being in the presence of *-i in the loc. sg. and neuter *-h 2 
in the nom./acc./voc. pi. (or synchronic *-0). In the gen. sg. aero-, protero-, and meso- 
dynamic stems had a zero-grade *-s, while other types had a full-grade *-es or *-os. The 
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original distribution of *-es and *-os is not clear (it might have originally depended on 
accent, ADT, stem type, dialect, etc.) since most languages have generalized one of the 
variants - some languages have only *-es (OCS - e ), some have only *-os (Gr. -oq), in 
some we cannot tell the original vowel (OInd. -as), and only rarely does one see both 
variants in one language (Lat. -is < OLat. -es, together with OLat. -us < *-os). The nom. 
pi. *-es is identical in form to the gen. sg. variant *-es but is invariant (as are most case 
endings), unlike the gen. sg. ending (however, cf. the pronominal plural *-s(-) - p. 83). 
In the gen. pi., one can assume an original short *-om on structural grounds (with possi- 
ble reflexes in Celtic, Italic, Anatolian, and Slavic), while most languages show reflexes 
of a long *-om (presumably by early, perhaps partially already PIE, analogy to o- and 
<?/z 2 -stems). 5 


Compensatory lengthening in athematic stems 

A number of compensatory lengthening (CL) processes occurred word-finally in athe- 
matic nouns. The masculine/feminine ending *-s dropped in pre-PIE after a root- or 
suffix-final resonant (*m/n/l/r), *y, and *s, with subsequent CL (Szemerenyi’s Law): 
*d h eg h o(m) < *-om-s (p. 77), *h,ekmo(n) < *-on-s (p. 76), *h,eb5l < *-ol-s (p. 77), *ph 2 ter 
< *-er-s (p. 75), *sek w Hoy < *-oy-s (p. 73), *h,ews5s < *-os-s (p. 79). The same kind of 
lengthening might have occurred in some other cases, like in *kerd > *ker (with possible 
subsequent restoration of *-d, p. 53). This lengthening was morphonological, not a live 
phonological process, in the last stage of PIE as seen from forms like the gen. sg. *dems 
(p. 71) and the acc. pi. ending *-ons (p. 64). This law explains many s-less long-grade 
forms in the nom. sg. of athematic stems. However, the length in forms with the preserved 
*-s, like *pods (p. 15), *g w ows (p. 24), *(h,)nep5ts (p. 39), *-5ws (p. 74), and ptcp. *-5nts 
(p. 104), is not completely clear. The original dropping of *-s, CL, and then analogical 
restoration of *-s is possible but perhaps not convincing (why would it be restored only 
in these forms and not after resonants, *y, and *s as well?). Perhaps it is a consequence of 
morphologization and analogical interplay of several earlier phonological processes like 
Szemerenyi’s Law, early monosyllabic lengthening (p. 54-55) in cases like *pods, and 
perhaps Stang’s Law (see below) in the acc. sg. like *g w om. In some cases, the length can 
be interpreted in more than one way, e.g. *wis ‘poison’ (p. 55) can be a result of either 
monosyllabic lengthening or CL from *wis-s. 

CL occurs with *-h 2 after *n, *r, and *s as well, cf. nom. pi. *h 3 nh,mdn < *-on-h 2 
(p. 77), *k w etwor< *-or-h 2 (p. 89), and nom. pi. *neb h os < *-os-h 2 (p. 79); and with *-i 
after glides, cf. loc. sg. *mntey < *-ey-i (p. 72) and loc. sg. *suHnow < *-ow-i (p. 73). 
CL also occurred when glides and laryngeals dropped before the final *-m (Stang’s 
Law), cf. *dyem < *-ewm (p. 40, 72), *g w om < *-owm (p. 71), and *-am < *-eh 2 -m in 
thematic stems (p. 66). 


Root nouns 

Most athematic nouns have a suffix of some kind between the root and endings and are 
usually disyllabic in the nom. sg. Those that do not have a suffix are monosyllabic in the 
nom. sg. and are called root nouns (the ending is added directly to the root), cf. *pod-s 
‘foot’, gen. sg. *ped-e/os. The root in root nouns can end in a stop (*pod-s), resonant 
(*do(m)-0 ‘home’), diphthong (*dyew-s ‘sky’), cluster (*kerd-0 ‘heart’), etc. Accord- 
ing to the final segment of the root, root nouns can also be considered plosive stems 
(*pod-s), m-stems (*do(m)-0), etc. Root nouns had a tendency to become less frequent 
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or disappear altogether in many IE languages, being replaced with various suffixal deriv- 
atives. In some cases, it is not clear whether certain nouns are indeed root or suffixal 
stems; e.g., is *kwo(n) ‘dog’ to be analyzed as a root noun or as an n-stem *kw-b(n) with 
a vowelless root (p. 41)? Root nouns had the following ablaut types (although they have 
no suffixes, root nouns can tentatively be fitted into the above ADT scheme): 

a) RE + - (aerodynamic) *do(m)-0 - gen. sg. *dem-s ‘home’ 

b) RE ++ (amphidynamic) *pod-s - gen. sg. *ped-e/os ‘foot’ 

c) RE - + (hysterodynamic) *muHs-(s) - gen. sg. *muHs-e/os ‘mouse’ 

d) RE ±+ (holodynamic) *dyew-s - gen. sg. *diw-e/os ‘day sky’ 

Aerodynamic nouns had *o in the nom./acc. and *e in other cases, cf. *do(m) (Gr. Horn. 
8c&, Arm. tun) - gen. sg. *dems (Ved. dan , GAv. dang, Gr. Sso-noiriq ‘master’ < *dems 
potis ‘lord of the house’), or *g w ows ‘cow’ (Ved. gaus, Gr. Pohq) - gen. sg. *g w ews (Ved. 
gos, Av. gaus). For the length in *do(m) or *kwo(n) cf. p. 70. Stang’s Law operates in the 
ace. sg. *g w owm > *g w om (Ved. gam, Dor. ptov, Umbr. bum) - cf. p. 70. The amphidy- 
namic stems had a more complex ablaut pattern: 


TABLE 1.17 IE ROOT NOUNS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

Lat. 

nom. 

*pod-s 

pat 

rtVnAi; 

p\e\s 

ace. 

*pod-m 

pada\m\ 

jto5a 

p\e\dem 

gen. 

*ped-e/os 

padas 

ji\o\5o<; 

pedis 

dat. 

*ped-ey 

pade 

Myc. po-de 

pedd 

loc. 

*ped-i 

padi 

[dat. Jt\o\5i] 

[abl. pede\ 

instr. 

*ped-eh! 

pada 



Cf. p. 70 for the nom. sg. length. 


TABLE 1.18 IE ROOT NOUNS (PLURAL) 

pi. 

PIE 

Ved. 

Gr. 

Lat. 

nom. 

*pod-es 

padas 

7i65ei; 

p\e\d\e\s 

ace. 

*pod-ns 

padas < *padas a 

JtoSai; 

p\e\des 

gen. 

*ped-om 

pad\a\m 

7i\o\6\co\v 

pedum 

dat. 

*ped-b h os 

*padbh|y|as 


ped\i\bus 

loc. 

*ped-su 

patsu 

[dat. 7t\o\CTCT\I\] 


instr. 

*ped-b h i(s) 

padbhis 

Myc. p\o\-pi 



Cf. RV apas ‘waters’ (6x), together with the secondary apas. 


Dual: nom./acc./voc. *pod-h[(e) (Gr. node), dat./instr./abl. *ped-b h yoh, (Ved. pad- 
bhyam). The nom./acc. *pod- (accented in Vedic/Greek) and oblique *ped- stem are indi- 
cated by Ved. pad- (by Brugmann’s Law - p. 40, 206) and pad- respectively (Greek has 
generalized *pod- and Latin *ped-). The usual supposition is that the oblique *ped- is 
secondary for the supposedly original holodynamic *pd- (cf. Gr. snt-pS-a ‘the day after 
the festival’ < *‘following the trace’, Ved. upa-bd-as ‘trampling, rattle’, Av. fra-bd-a- 
‘ front foot’). 
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For holodynamic stems cf. *dyews (Ved. dvaus, Gr. Zeu^ ‘Zeus’), gen. sg. *diwe/os 
(Ved. divas, Gr. Aioq), ace. sg. *dyem with Stang’s Law (Ved. dyam, Gr. Zr)v, Lat. diem 
‘day’), voc. sg. *dyew! (Gr. Zev) or *Kwo(n) (Ved. sva, Gr. tcucov, Lith. suo) - gen. sg. 
* kune/os (p. 41, 56). Holodynamic stems show an *e or *o full grade in the nom./acc. and 
the zero grade in the oblique cases. 


I- stems 

In the /-stems (as in the w-stems), the root was unchangeable (either full as in *h 3 ewis 
‘sheep’ or zero as in *mntis ‘mind’), while the suffix could be in various ablaut grades 
(*-i-/-ey-/-oy-/-ey-). The /-stems had two ablaut types, mesodynamic and amphidynamic, 
of which the former generally prevailed in later IE languages. The mesodynamic /-stem 
(m./f.) paradigm in IE: 


TABLE 1.19 IE /-STEMS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

nom. 

*-i-s 

-is 

-t? 

-is 

-b 

-is 

-s 

acc. 

*-i-m 

-im 

-tv 

-em 

-b 

-/ 

-0 

voc. 

*-ey 

-e 

{-1} 

[-M] 

-i 

-ie 


gen. 

*-ey-s 

-es 

Horn. {-r( 0 (g} 

{-«}* 

-i 

-ies 

AaMs b 

dat. 

*-ey-ey 

-dye 


-l 

-i 

{-iai} 

{-ai} 

loc. 

*-ey 

-a 

dial, -pi 


-i 

{-yje} 


instr. 

*-i-h, 

-T 



{-y'p} 

{-imi} 



Cf. Osc. original (and generalized) -eis; b cf. original OHG (f.) -i < PGmc *-iz < PIE *-eys. 


For the invariant zero grade of the root cf. also *ng <w) nis ‘fire’, *k lv rmis/*wnnis ‘worm’ 
(p. 35), *mrtis ‘death’ (p. 35), and other nouns in *-tis, etc. (presumably, only the /-stems 
with the zero-grade root originally had this type of endings). In the gen. sg., the zero- 
grade suffix was followed by a full-grade ending and vice versa (gen. sg. *-y-e/os but 
*-ey-s). Goth. gen. sg. (f.) -ais and ONor. -ar are probably secondary (by analogy to 
M-stem *-auz < *-ows), and there is no need to reconstruct a PIE gen. sg. variant *-oys 
(OCS and OHG point to *-eys). The loc. sg. *-ey is to be explained through a (pre-)PIE 
process of *-ey-i > *-ey (p. 70). 


TABLE 1.20 IE /-STEMS (PLURAL) 


PL 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

nom. 

*-ey-es " 

-ayas 

-Ei <; 

-es 

-bje 

{->»} 

-eis 

acc. 

*-i-ns 

-\is\ 

dial, -xv q 

-Is 

-i 

-is 

-ins 

gen. 

*-ey-om 

{-mam} 

-e\co\v 

-\i\um 

-bi c 


-e 

dat. 

*-i-b h /mos 

-ibh\y\as 


-ibus 

-bmb 

-ims 

-im 

loc. 

*-i-su 

-isu 

[dial. dat. -i o|\i\] 


-bXb 

-\y\s{e} 


instr. 

*-i-b h i(s) b 

-ibhis 



-bmi 

-imis 



Cf. also the Hitt, generalized nom. pi. -es < *-eyes; b BS1. *-miHs; c *-tjb. 
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The full grade (*-ey-) of the suffix is found in the gen./dat./loc./voc. sg. and nom./gen. 
pi. - other cases have the zero grade (*-i-). Dual forms are nom./acc./voc. *-i-h, (Ved. 
-T, OCS -/), dat./instr./abl. *-i-b h y/ m ohi (Ved. -ibhya\m\, OCS -bma), gen./loc. *-ey-ow 
< *-ey-u? (p. 63) (OCS kost-bju is likely archaic, since it is different from the other dual 
forms, while Ved. gen./loc. du. hcuyos ‘tawny’ < *-y-ow-s is probably analogical to them, 
though OCS form could also theoretically be due to analogy to the gen. pi.). The neuter 
forms differed in the nom./acc./voc. only: sg. *-i (Ved. vari ‘water’, Lat. mare ‘sea’, Hitt. 
tuppi ‘clay tablet’) and pi. *-ih, (Ved. adj. sucT ‘bright’, Hitt, armizzi ‘bridges’). 

The scarcely attested amphidynamic paradigm had a weak suffix (*-i-/-y-) in all cases, 
cf. gen. *pot-y-e/os ‘master’ (Gr. noaioq ‘husband’), *h,ew-y-e/os ‘sheep’ (Ved. a\yas, 
Gr. Ion. oioq), dat. *-y-ey (Ved. patve ‘husband’), loc. *-y-i (Gr. dat. oil), voc. sg. *-i 
(Gr. ;t6gi!), nom. *-y-es (Ved. adj. atyas ‘loyal’, Gr. Horn, oieq), gen. pi. *-y-om (Gr. 
oicov). One could assume that originally all /-stems with a full-grade root - like *g h ostis 
‘guest’ (p. 40), *klownis ‘hip’ (p. 51), *h 3 eg wh is ‘snake’ (Ved. ahis, Gr. otptq ‘serpent’), 
etc. - were amphidynamic (i.e., that the ablaut of the suffix was connected to the ablaut 
of the root), even when we have no direct attestation for that (unlike the case in *h 3 ewis, 
*potis), though aerodynamic + ± — stems like **g h ost-ey-s (perhaps later and analogical) 
are not unimaginable. A holodynamic + ± + type can also be reconstructed, cf. *-5y < 
*-oy-s (Ved. sakha ‘friend’, Gr. ij/cb ‘echo’, Hitt, zahhais ‘battle’), ace. sg. *-oy-m (Ved. 
sakhayam, Hitt, zahhais), voc. sg. *-oy (Gr. Saropot!), gen. sg. *-y-e/os (Ved. sakhyur 
with secondary -ur, Hitt, zahhiyas), etc. For *-ih 2 (s) f. stems see p. 67. 


U-stems 

The u-stems were mostly parallel to the /-stems concerning ADTs. The root was invariant 
in masculine/feminine stems but not in all neuter stems (cf. *dor-u - gen. sg. *dr-ews 
below). The mesodynamic (m./f.) paradigm prevailed in later languages: 


TABLE 1.21 IE {/-STEMS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

nom. 

*-u-s 

-US 


-US 

-b 

-US 

-US 

acc. 

*-u-m 

-um 

-t)V 

-um 

-b 

-H 

-u 

voc. 

*-ow 

-0 


{-us} 

-U 

-au 

-au 

gen. 

*-ow-s 

-os a 

Horn. {-E05} 

-US c 

-U 

-aus 

-aus 

dat. 

*-cw-cy 

-ave 

{-El} 

-\u\T 

-ovi 

{-»/} 

-\a\u d 

loc. 

*-ow 

-au b 



-U 

{-We} 


instr. 

*-u-h, 

GAv. -u 




{-uml} 



a GAv. -3us \ b mdn-au 'man’; c OLat. senat-ovs ‘of the Senate’, Osc. castr-ous (cf. Lat. castrum ‘fort’); d 
early runic/early OHG -iu < PGmc *-iwi < PIE *-ewey. 


For the invariant zero grade of the root cf. also *prtus (Av. pai^tu- ‘crossing’, Lat. 
portus ‘harbor’, OEng. ford), *tnh,us ‘thin’ (p. 16), *g w rh 2 us ‘heavy’ (p. 39) and other 
adjectives, etc. Loc. *-5w derives from pre-PIE *-ow-i (like /-stem loc. sg. *-ey above). 
Loc. *-ew is also reconstructable (cf. *-ey in /-stems), but the non-palatal -u (i.e., not -ju) 
in OCS svnu is easier to explain from *-5w (cf. the *-eys : *-ows difference between /- 
and M-stems in the gen. sg. as well). 
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TABLE 1.22 IE {/-STEMS (PLURAL) 


PL 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

nom. 

*-ew-es 

-avas 

dial, -esq 

{-ils} 

-ove 

{-“•?} c 

-jus 

acc. 

*-u-ns 

-\u\n 

dial, -nvq 

-us 

-y 

-US 

-uns 

gen. 

*-ew-om 

{-unam} 

-e\co\v 

-uum 

-OVb 

H] 

-iw{e} 

dat. 

*-u-b h /mos 

-ubh\y\as 


-ibus 

--bmb 

-urns 

-um 

loc. 

*-u-su 

-usu 

b 


-bXb 

dial. -us\e\ 


instr. 

*-u-b h i(s) a 

-ubhis 



-bmi 

-umls 



BS1. *-miHs; b dat. pi. 8dKp-u|c|\i\ ‘tears’; c dial, -aus < *-awes < PIE *-ewes. 


The full grade (*-ew-/-ow-) of the suffix is found in the gen./dat./loc./voc. sg. and 
nom./gen. pi. (the *e-/o-grade distribution is different from in /-stems) - other cases have 
the zero grade (*-u-). Dual forms are nom./acc./voc. *-u-h, (Ved. -u, OCS -y, Lith. -u), 
dat./instr./abl. *-u-b h y/ m ohi (Ved. -ubhyd\m\, OCS -bina), gen./loc. *-ew-ow < *-ew-u? 
(p. 63) (cf. OCS synovu < *-ew-ow but probably secondary Ved. bahvos ‘arms’ < *-w-ow-s, 
like in /-stems, p. 73). Neuters had a nom./acc./voc. sg. *-u (Ved. madhu ‘sweetness’, Gr. 
pefiu ‘wine’, Goth .faihu ‘cattle’, Hitt, genu ‘knee’), nom./acc./voc. pi. *-u-h 2 (Ved. vd.su 
‘wealths’, Hitt, assii ‘goods’), and nom./acc./voc. du. *-w-ih[ (Ved. adj. urvi ‘wide’). 

The vestiges of the amphidynamic type (with weak suffix *-u-/-w- in all cases) are 
often attested in neuters (e.g., *med h u ‘honey’, *peku ‘cattle’); gen. *-w-e/os (Ved. 
madhvas, kraU’as (m.) ‘ability’, Gr. Myc. me-tu-wo, older Lat. senatuos ‘of the Sen- 
ate’), dat. *-w-ey ‘cattle’ (Ved. pasve, Goth, maim (m.) < *manw- ‘man’), loc. sg. 
*-w-i (Gr. Horn. dat. yor>vi < *yovpi ‘knee’), nom. *-w-es (YAv. pasuuo), and gen. pi. 
*-w-om (YAv. pasuuqm). It can be perhaps be assumed that all u- stems with full-grade 
roots were originally amphidynamic (even in cases where there is no direct evidence 
for that), e.g. *genus ‘jaw’ (Ved. hanus, Gr. yevvq) - gen. sg. *genwe/os (Goth, kinnus 
‘cheek’ with generalized -nn- < *-nw-), adj. *h,oh,kus ‘fast’ (Ved. asus, Gr. ducrx;), etc. 
Cf. also the proterodynamic neuter *dor-u-0 ‘tree’ (Ved. daru , Gr. 5opn, Hitt, taru) - 
gen. sg. *dr-ew-s (Ved. dms, YAv. draos), though this might not be the oldest/only 
ablaut type; cf. the ablaut variant *derw- in OCS drevo < *dervo ‘tree’, Lith. derva 
‘resin, tar’. The original ablaut pattern of *gen-u- (Lat. genii, Hitt, genu)/* gon-u- (Ved. 
janu, Gr. yovu)/*gn-ew- (Hitt, ganu-, Goth, thematized kniu) ‘knee’ is controversial 
(see above for Gr. dat. yonvi). There is some evidence of a holodynamic type with 
a nom. sg. in *-e/ows (m./f.): Av. hiOaus ‘associate’, Gr. unrenq ‘horseman’, jraxpox; 
‘paternal uncle’, Hitt, harnaus ‘birthing chair’. 

The suffix *-u- is found in a feminine motion ending *-u-h,-s as well, cf. *swekuros 
‘father-in-law’ (Ved. svasuras, Gr. eicupoq, Lat. socer, Lith. sesuras) and *swekruh,s 
‘mother-in-law’ (Ved. svasrus, OCS svekrv), declined like a nonnal consonant amphidy- 
namic stem (gen. sg. *swekruh,e/os > OCS sveknve, cf. Ved. tanuas ‘body’) and root 
nouns with roots ending in *-H- like *h,b h ruHs ‘eye-brow’ (p. 48) or *suHs ‘swine’ (p. 38). 


R-s terns 

The ; -stems are dominantly masculine/feminine nouns with an invariant root. Many 
of them belong to nouns denoting family relations - *ph 2 ter ‘father’ (p. 46), *meh 2 ter 
‘mother’ (p. 45), *d h ugh 2 ter ‘daughter’ (p. 47), *b h reh 2 ter ‘brother’ (p. 18), *swesor ‘sis- 
ter’ (p. 32), *deh 2 iwer ‘husband’s brother’ (Ved. devar-, Lith. dievens, OCS deverb). 
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*genh 1 tor ‘begetter’ (p. 46), etc. Nomen agentis *-ter/*-tor nouns were also r-s terns (cf. 
PIE *dh 3 ter > Gr. 5oxr)p but *deh,tor > Gr. hcoxcop, Ved. data ; Lat. dator, gen. sg. -oris is a 
mix of the two types - all ‘giver’). Some monosyllabic words ending in *-r# (like *h 2 ster 
‘star’, *h,ner ‘man’) can be treated as root nouns - in such cases it is sometimes impos- 
sible to tell if a segment was originally part of the suffix or the root. One can reconstruct 
two basic ablaut types for most r-s terns - hysterodynamic (H) and aerodynamic (A) - that 
differed only in a few cases: 


TABLE 1.23 IER-STEMS (SINGULAR) 


sg. 

PIE 

Ved. 

Gr. 

Lat. 

ocs 

Lith. 

Goth. 

nom. 

*-er 

-a 

-r|p/-f|p 

-er 

-i 

-e/-e 

-ar 

acc. 

*-er-m 

-ara\m\ 

-spa 

-\r\em 

-erb 

-erf 

-ar 

voc. 

*-er 

-ar 

-Ep 

-er 

N] 

{-erie} 

[-ar] 

(A) gen. 

*-r-s 

-ur a 





d 

(H) gen. 

*-r-e/os 

b 

-p6? 

-ris 

-|e|re 

-\e\fs c 

-rs 

dat. 

*-r-ey 

-re 


-ri 

-\e\ri 

{-eriai} 

-r 

(A) loc. 

*-r-i 


[dat. -pi] 


{-ere} 



(H) loc. 

*-er-i 

-ari 




{-eryje} 


(A) instr. 

(?) *-r-h, 




{ -erbjg} 

{-erimi} 


(H) instr. 

*-r-eh, 

-ra 






a Av. n-dr’s ‘man’; 

1 b Av. braQ-ri 5 < * 

-rah ‘brother’ 

; c OLith./dial. -eres; 

d ONor. fgd-ur, OEng. fad-ur. 


fead-ur < PGmc *-urz. 

The length in the nom. sg. is due to Szemerenyi’s Law (p. 70). The vowel *e is more 
frequent in the nom. sg. than *o. 


TABLE 1.24 IE /{-STEMS (PLURAL) 


pi. 

PIE 

Ved. 

Gr. 

Lat. 

ocs 

Lith. 

nom. 

*-er-es 

-aras 

-epE? 

{-res} 

-ere 

-ers c 

(A) acc. 

*-r-ns 

-\rn \ a 





(H) acc. 

*-er-ns 


-spaq 

-V\es 

-eri 

-eris 

gen. 

*-r-om 

{-rnam] h 

-p\to\v 

-rum 

-\e\n> 

{-eriq} 

dat. 

*-r-b h /mos 

-rbh\y\as 


-r\i\bus 

{-erbmb} 

{-erims} 

loc. 

*-r-su 

-rsu 

[dat. -pao\t\] 


{-erbXb} 

{-eryse} 

instr. 

*-r-b h i(s) 

-rbhis 



{-erbmi} 

{-erimis} 

a GAv. mat-drqs < 

*-r-ns (p. 31); 

b cf. Ved. svas-ram 

‘sisters’, na-ram 

‘men’; c 

OLith./dial. - 

eres. 


The PIE ablaut pattern of the suffix is reconstructed mostly from the overlap of Vedic 
and Greek patterns, which look original (the ablaut is mostly generalized in other lan- 
guages). The acro-/hysterodynamic opposition is seen in different attested endings in 
the daughter languages in the gen. sg., in the difference between Vedic and Greek in 
the loc. sg., in structural analogy to the gen. sg. in the loc. sg., and in the difference 
between Vedic and Greek in the ace. pi. From the reflexes, it is impossible to deduce 
which nouns had which type. For structural reasons, one can assume that the derivatives 
with an invariant full-grade root (*meh,ter, *b h reh,ter, *swesor, *deh,iwer, *genh,tor, 
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etc.) were aerodynamic (full-grade root + zero-grade ending in gen./instr. sg.), while the 
nouns with an invariant zero-grade root (*ph,ter, *d h ugh,ter, etc.) were hysterodynamic 
(zero-grade root + full-grade ending). An exceptional proterodynamic type with ablaut- 
ing root can tentatively be reconstructed for ‘husband’s brother’s wife’: *yen-h 2 ter (Gr. 
£vcuT|p, OlAtYi. jente) - gen. sg. *yn-h 2 ter-s (Skr. yatar-). 


N-stems 

Masculine/feminine n-stems point to a holodynamic + ± + type with a complex suffixal 
ablaut pattern: 


TABLE 1.25 IE TV-STEMS (SINGULAR) 


sg- 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

Hitt. 

nom. 

*-o(n) 

-a 

-COV 

-0 

-\y\ 

-uo 

-\<A f 

-a\s\ 

acc. 

*-en-m * 

-\d\na\m\ 

-\o\va 

-inem 

-enb 

-enj 

-\a\n 

-\a\nan 

voc. 

*-on 

-an 

-ov c 

l-d] 

[-V] 

{-enie} 



gen. 

*-n-e/os 

-nas 

-|°|VO? 

-\i\nis 

-\e\ne 

-\e\ns e 

-\i\ns 

-\a\nas 

dat. 

*-n-ey 

-ne b 


-\i\nT 

-\e\ni 

{-eniui} 

{-in} 


loc. 

*-en 

-an 

d 


-en\e\ 

{-enyje} 


g 

instr. 

*-n-eh. 

-nd 



{-enbmb} 

{-enimi} 




a With a younger variant *-on-m; b mahim-ne ‘greatness’; c Scug-ov! ‘god’; d adv. aiEV ‘always’ < *h,eywen 
(cf. Gr. aitbv ‘age’ < *h,eywo(n)); ' older -enes ; f cf. -o (f./n.); 8 loc. sg. (n.) laman ‘name’. 


The length in *-o(n) < *-on-s is due to Szemerenyi’s Law (p. 70). Some believe that 
the final *-n had dropped already in PIE (i.e., *h 2 ekmo, *kwo ‘dog’ unlike *swesor 
‘sister’), cf. Lat. -or, OIr. -ur < *-or in r-stems but Lat. -o, OIr. -u < *-5(n) in n-stems 
(unlike Gr. -cov, Arm. -un < *-on, and OCS -y < *-on-|s|, which would then have to 
have reintroduced *-n-(s) by analogy later). Perhaps it is best to assume that both 
variants (*-on and *-o) occurred in PIE, just as in *-o(m) in m-stems (p. 77). Besides 
the usual *-o(n), there might have also existed a less frequent *-e(n), cf. the Greek 
jroipfjv ‘herdsman’ (but Lith. piemuo) - see below. As in 5-stems (p. 78), the loc. sg. 
points to a zero-ending (*-en-0). This is different from the i- and n-stems, where *-ey 
and *-ow can be derived from older *-eyi and *-owi and thus do not have an original 
zero-ending. 


TABLE 1.26 IE TV-STEMS (PLURAL) 


pi. 

PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

Hitt. 

nom. 

*-on-es 

-anas 

-OVEq 

-in\e\s 

-\e\ne 

-\e\ns 

-ans 

-anis 

acc. 

*-en-ns 

-anas 

-\o\vaq 

-ines 

-eni 

-enis 

{-ans} 

-\a\n\u\s h 

gen. 

*-n-om 

-n\a\m a 

-|o|v\co\v 

-\i\num 

-\e\nb 

{-enij} 

-\a\n\e\ 


dat. 

*-n-b h /mos 

-abh\y\as 


{-inibus} 

{-enbmb} 

{-enims} 

{-am} 


loc. 

*-n-su 

-asu 

[dat. -\o\o\t\] 


{-enbxb} 

{-enyse} 



instr. 

*-n-b h i(s) 

-abhis 



{-enbmi} 

{-enimis} 




dur-namnam ‘ill-named’; b argamanus ‘tributes’. 
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The original ablaut of the suffix (the interchanging *-on-/-en-/-n-) is best preserved in 
Vedic, with the other languages usually generalizing one ablaut grade in most forms (e.g., 
*-on- in Greek). Generalized BS1. oblique *-en- (which can hardly stem from the loc. sg. 
*-en alone) and Ved. nom. pi. -anas but ace. pi. -anas could point to the same original 
ablaut variation in the sg. as well (cf. Lith. -uo but -enf). It seems that some nouns had 
an *-en- in all (full-grade) cases, e.g., (hysterodynamic) *uk w s-en ‘ox’ - nom. pi. *uk w s- 
en-es (Ved. uksanas, OEng. exen ‘oxen’). Some possible root nouns (like *kwo(n) ‘dog’) 
can be treated as n-stems as well (‘dog’ can originally be *pkw-5n from *peku ‘cattle’ - 
p. 52). In the neuter n-stems, the word ‘name’ and other derivatives in *-men- point to a 
proterodynamic declension: *h 3 neh 3 -mn (Ved. nama, Lat. nomen, Gr. ov\o\pa) - gen. sg. 
*h 3 nh 3 -men-s (OIr. gen. sg. anmae; GAv. haxmSng < *-anh < *-ens, Goth, -ins < *-en-es 
<= *-en-s; OCS im-, OPrass. emm-<BSl. *inHm-)-nom. pi. *h 3 n(e)h,-mdn (Ved. nama). 
In the neuter nom. pi., one has to suppose pre-PIE *-on-h, > *-on with a compensatory 
lengthening (p. 70). For the *-mn (gen. sg. *-men-s) derivatives cf. also Ved. kar-ma ‘act’ 
(=> Eng. karma), Gr. Os-pa (=> Eng. theme), Lat. se-men ‘seed’ (=> Eng. semen), etc. 

M-, laryngeal, and plosive stems 

Unlike the frequent r- and n-stems, other resonant stems were less frequent. For w-stems, 
cf. the root noun *dd(m) (p. 7 1) and the holodynamic (type ±±+) *d h eg h -5(m) ‘earth’ (Hitt. 
tekan, cf. p. 53) - gen. sg. *d h g h -m-e/os (Hitt, taknas with an analogical -n-; Gr. %S-cbv has the 
gen. root and the nom. suffix) and *g h ey-o(m) ‘winter’ (Gr. jprov ‘snow’, Arm .jiwn ‘snow’, 
YAv. ziia all with secondary *g h i-, cf. OCS zim-a, Gr. %sip-(bv for *g h ey-m-) - ace. sg. 
*g h y-em-m (Lat. Idem-) - gen. sg. *g h i-m-e/os (GAv. gen. sg. zimo) (p. 22). Like in n-stems 
(p. 76), the final *-m was unstable after a long vowel in the nom. sg. (cf. Horn. Sco ‘house’ 
< PIE *dd(m) but otherwise -cov; Lat. homo ‘man’ but Hitt, tekan; Arm. jiwn, etc.). 6 In 
/-stems, one finds just a (potentially root noun) *seh,l-s ‘salt’ with a controversial vocalism/ 
ablaut pattern (p. 42) and (dialectal?) *h,ebdl ‘apple’ (Lith. obuol-as) (p. 16). 

Of the rare laryngeal stems, the holodynamic (±±+) word for ‘path’ is easily reconstruc- 
table (with Vedic preserving the original ablaut pattern): *pont-e/oH-s (Ved. panthas, cf. 
generalized *pont- in Gr. novToq ‘sea’, Lat. pont- ‘bridge’, OCS pQtb) - gen. sg. *pnt- 
H-e/os (Ved. pathas, cf. generalized *pnt- in Gr. hoito:;, OPrass. pintis). An early protero- 
dynamic laryngeal stem, transformed in most descendant languages to an e/? 2 -stem (p. 24, 
66), was *g w en-h, ‘woman’ (OIr. be, Ved .jani-) - gen. sg. *g w n-eh 2 -s (OIr. gen. sg. mnci, 
Ved. nom. sg. gnas ‘divine woman’, cf. gnas-pati- ‘husband of a divine woman’ for the 
gen. sg.). Cf. also the adjective *meg-h 2 -s (p. 53, 79-80). 

Stems ending in a stop were rare (cf. the root noun *pod-s - p. 15, 60, 70-71). The 
masculine word *(h 2 )nepots ‘grandson, nephew’ (Ved. napat, Lat. nepos - p. 39) is widely 
attested, though it is unclear whether it was aerodynamic (gen. sg. *(h,)nept-s > Ved. nap- 
tur with secondary *-rs from r-stems) or +±+ holodynamic (gen. sg. *(h,)nep-t-e/os > 
YAv. napto). The neuter word *mel-it ‘honey’ (Hitt, milit, Gr. pe/a, Goth. ace. sg. milip) 
with the *-it- suffix (cf. Hitt, seppitt- ‘a kind of grain’) perhaps had a special ±— + amphi- 
dynamic subtype (Hitt, dat./loc. sg. malitti < *ml-it-ey, cf. CLuw. generalized mallit-). 

Heteroclitic stems 

Heteroclites were an ancient class of PIE neuter nouns with suppletive suffixes (one vari- 
ant in the nom./acc., the other in the oblique cases). Though sometimes difficult to recon- 
struct exactly (with many languages often showing only isolated relics), their number is 
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not so small in total. The most common suppletion type was *-r-/-n-: cf. Ved. as-r-k- gen. 
sg. as-n-as, Hitt, esh-ar - gen. sg. isha-n-as ‘blood’, Hitt, wat-ar - gen. sg. wit-en-as 
‘water’, etc. The suppletion *-l-/-n- is attested in the word ‘sun’ (GAv. huuarS - gen. sg. 
x'Sng < *sh,wens), and there may have been some other, less frequent and more contro- 
versial, types of suppletion (cf. p. 429, 437). Some languages in some cases preserve the 
actual suppletion as in PIE: Hittite abundantly (cf. Hitt, pahhu-r - gen. sg. pahhu-en-as 
‘fire’, p. 15), often Indo-Iranian (cf. Ved. yak-r-t - gen. sg. yak-n-as ‘liver’), sometimes 
Greek (rjn-ap - gen. sg. fjn-a-xoq ‘liver’ with -a- < *-n-), and Latin ( iec-ur - gen. sg. ioc- 
in-eris ‘liver’, p. 33). Most descendant languages usually generalize one of the variants 
(e.g., ONor. vatn but OEng. wceter ‘water’; Lat. sol, Goth, sauil but Goth, also sunno 
‘sun’; Goth./ow but OEng.^r ‘fire’; CS ikra ‘roe’ but OLith . jeknos ‘liver’). PIE nom. 
sg. forms are usually reconstructable (*h,esh 2 r ‘blood’, *wodr and coll. *wedor ‘water’, 
*peh,wr ‘fire’, *yek w r ‘liver’), though not always (cf. *seh,-w-l- ‘sun’ with a problematic 
nom. sg. ablaut pattern, often reconstructed as *seh,w5l - p. 32), while the reconstruction 
of oblique cases and ADTs is often more speculative. 


S-stems 

Numerous neuter 5-stems (with the *-e/os- suffix) had a unique holostatic ADT with all 
full grades (+++), like the thematic stems (p. 62) and unlike other athematic stems. Neu- 
ter 5-stems are considered athematic only because they have no thematic vowel, but their 
characteristics, such as the invariant root (though this occurs in other athematic stems as 
well) and only qualitative (*-e-/-o-) ablaut alternation of the suffix, remind one more of 
the thematic o-stems than of the other athematic stems: 


TABLE 1.27 IE 5-STEMS (SINGULAR) 


Sg- 

PIE 

Ved. 

Gr. (Horn.) 

Lat. 

OCS 

Hitt. 

nom./acc./voc. 

*-os 

-as 

-oq 

-US 

-VA 

-\As 

gen. 

*-es-e/os 

-asas 

-so<; 

-eris 

-ese 

-isas 

dat. 

*-es-ey 

-ase 

b 

-en 

-esi 


loc. 

*-es a 

-as\i\ 

[dat. -s ii|] 

[abl. -er\e ] 

-es\e\ 

-is 

instr. 

*-es-eh, 

-as a 



{-esbmb} 



Younger analogical form: *-es-i; b Myc. we-te-e ‘year’. 


The original loc. sg. has *-0 (cf. p. 76 for n-stems). 


TABLE 1.28 IE 5-STEMS (PLURAL) 


pi. 

PIE 

Ved. 

Gr. (Horn.) 

Lat. 

OCS 

nom./acc./voc. 

*-os 

-a\m\s\i\ 

{-£Ct} 

{-era} 

{-esa} 

gen. 

*-es-om a 

-as\a\m 

-eVd\v 

-erum 

-esb 

dat. 

*-es-b b /mos 

-obh\y\as b 


-er\i\bus 

{-esbmb} 

loc. 

*-e(s)-su 

-a(s)su 

[dat. -£a(o)\t\] 


{-esbxb} 

instr. 

*-es-b h i(s) 

-obhis c 

d 


{-esy} 


Cf. Hitt, nepisan; b < *-asbhyas; c < *-asbhis; d Myc. pa-we-pi ‘pieces of cloth'. 
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The original nom. pi. *-5s (from older *-os-h 2 , p. 70), which cannot be explained as 
secondary, can be seen regularly in Avestan (cf. raoca Tights’ < *-ah), contaminated in 
Ved. -amsi, and indirectly perhaps in OEng. lombor ‘lambs’ (< PGmc *-oz-a). Reflexes 
of the nom. pi. *-es-eh 2 etc. are secondary. The distribution of suffixal *-os-/-es- reminds 
of the *pod-/ped- distribution in the root nouns (p. 71). 

Masculine/feminine (holostatic) derivatives in *-es (derived from neuter 5-stems) 
also existed - cf. PIE *h,su-men-es ‘well-disposed’ (Ved. sumanas, Gr. eupevrn;) from 
*menos - gen. sg. *menese/os (Ved. manas ‘mind’, Gr. pevoq ‘spirit’), Greek names 
ending in -KArjq (like Sophocles), or Lat. Ceres (goddess). A holodynamic (±±+) feminine 
noun can be reconstructed: *h 2 ews-5s-0 (< *-os-s, p. 70) ‘dawn’ (Gr. Aeol. aura q, Lat. 
generalized auror-; Ved. usds and GAv. usa with analogical u-) - gen. sg. *h,us-s-e/os 
(Ved. usas); cf. also Lat. bonds ‘honor’ - gen. sg. honoris with generalized -os- for the 
type. Note also the perf. ptcp. in *-wos- (p. 104). Other, shorter and infrequent, 5-stem 
types can perhaps be reconstructed as well, cf. Ved. aksi < *h 2 ek w -s-0 (neuter) with OCS 
oko - gen. sg. ocese ‘eye’ (like nebo). There are also root nouns ending in (originally 
suffixal?) *-s, e.g., *muHs (p. 38, 71). 

ADJECTIVES 

PIE adjectives were morphologically identical to nouns. However, the number of possible 
stems in adjectives was much smaller. For the most numerous o-stems cf. *new-o-s (m.) - 
*new-e-h 2 (f.) - *new-o-m (n.) ‘new’ (Ved. ndvas - nava - navam, Gr. veoq - ved - veov, 
Lat. nouns - noua - nouum, OCS novb - nova - novo, Lith. nau\j\as - nau\j\a, p. 32). For 
the frequent «-stems cf. *sweh,dus (m.)- *sweh 2 dwih 2 (f.) - *sweh 2 du (n.) ‘sweet’ (Ved. 
svadus - svadvT - svadu, Gr. f]8u<; - f)8|e|ia - f|8u; cf. Lith. saldiis - saAdh ‘sweet’; Goth. 
kaurus ‘heavy’). Note that the *-us form is used for the masculine only, unlike in nouns 
(p. 73). For the apparently not-so-frequent /-stems cf. *semh,l-is (m./f.) - *semh,l-i (n.) 
‘similar’ > Lat. similis (m./f.) - simile (n.), OIr. samail ‘likeness’ (f.) (cf. also Hitt, palh-i-s 
‘wide’). Original root adjectives may perhaps be reconstructed when dealing with differ- 
ent suffixes in later languages, i.e., *nog w -s ‘naked’ (cf. Ved. nagnas, Gr. yupvoq, Lith. 
nuogas, OCS nagu, Goth, naqaps, Hitt, nekumanza). A ±±+ holodynamic laryngeal stem 
can be reconstructed for *meg-h,-s (m./f.) - *meg-h 2 (n.) ‘big’ (Gr. psyaq (m.) - peya (n.), 
Ved. mdhi (n.), Hitt, mek (n.)), gen. sg. *mg-h,-e/os (Gr. Horn, aya- ‘greatly-’), ace. sg. 
*mg-eh,-m (Aeol./Dor. adv. ayav ‘very much’). Cf. p. 79 for 5-stem adjectives in *-es. 
A heteroclitic adjective can be reconstructed as well: *piHwo(n) ‘fat’ (Ved. pivan-, Gr. 
;urav) - f. *piHwerih 2 (Ved. pTvarT, Gr. jusipa). As can be seen, some adjectives had sepa- 
rate masculine/feminine/neuter forms (e.g., all thematic forms), while others had identical 
masculine/feminine forms (as in some nominal stems). There were a number of adjectival 
suffixes, cf. *med h -y-os (p. 31), *h,rud h -r-os (p. 19), *-went- ‘having . . .’ (Ved . putra- 
vant- ‘having a son’ from putras ‘son’), etc. The Caland System is a name for a pattern 
of adjectives in *-ro-, *-mo-, *-u-, *-nt- (often occurring with the same root, cf. Gr. tAacp- 
p-oc, Tight (in weight), nimble, small’, £ka%-v-q ‘small, short’) that have *-i- instead as 
the first element in compounds (cf. Gr. icuS-p-oq ‘glorious’ but Ku5-i-aveipa ‘bringing men 
glory’) and that often go together with neuter 5-stem nouns (Gr. ku5o<; ‘glory’ => Eng. 
kudos) and stative *-eh r verbs (cf. Gr. spufi-p-oq, OCS nd-r-r, ‘red’, and Lat. rub-e-re ‘to 
be red’) formed from the same root. 

A couple of adjectival/adverbial prefixes appearing in compounds can be recon- 
structed: *n- ‘un-’ (Ved. a-, Gr. a-, Lat. in-, Goth, un-, OIr. an-; zero grade of the 
particle *ne ‘no’ - p. 54; cf. the famous PIE collocation *klewos n-d h g wh itom ‘imper- 
ishable fame’ > Ved. sravo . . . aksitam, Gr. Horn. Kktoq acpfinov), *h,su- ‘good’ (p. 47, 
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cf. *h,sumenes - p. 79), *dus- ‘bad’ (Ved. dus-, Gr. 5uo-, Goth, tuz-, cf. Gr. 5uo-|.i£vf|<; 
‘hostile’, OCS dbzdb ‘rain’ < *‘bad day’), *semi- ‘half’ (Gr. r||u.-, Lat. semi-, OEng. 
sam-), etc. 


Comparison of adjectives 

PIE did not have the full-fledged paradigmatic comparison that later and modem IE languages 
have. It apparently had a number of derivation suffixes (similar to Eng. -ish in smallish, 
etc.) that modified the meaning of adjectives, some of which became regular comparative/ 
superlative suffixes in later languages. The most widespread comparative(-like) suffix was 
*-yos (m./f.) (Lat. senio\r\, arch. Lat. -d\r\, OIr. siniu ‘older’) - *-yos (n) (Lat. melius, OCS 
bolje ‘better’, perhaps also *-is, cf. adverbial Lat. magis ‘more’, Goth, mins ‘less’, OPruss. 
talis ‘farther’) - oblique *-is- (OCS oblique drazb\s\- ‘dearer’, Goth. batiz\a\ ‘better’), and 
probably *-yes- in some cases (Ved. oblique -vas-), with +±+ holodynamic paradigm and 
a full root even from adjectives with a zero root in the positive, cf. Gr. (Ion.) icp£OGCo\v\ 
‘stronger’ < *kret-y5s from Kpaxxx; ‘strong’ < *krt-u-s; Ved. dragh\i\yas- ‘very long; lon- 
ger’ < *dleHg h -yos- from diighas ‘long’ < *dlHg h -os. The zero-grade *-is- also appeared 
in complex superlative suffixes *-is-t(H)o- (Ved. navisthas ‘newest’, Gr. p£kxicrco<;, Goth. 
batists ‘best’; made from a full-grade root and not from the positive, cf. Ved. tej-isth-a- from 
tig-m-a- ‘sharp’), pairing with the comp. *-yos-, and dialectal/innovative *-is-m-Ho- (Lat. 
maximus ‘biggest’, OIr. nessam ‘nearest’). The suffix *-ter-o-, originally used in contrastive 
(positive) meaning in adjectives (Lat. dexter, Gr. 8£^ix£po<; ‘right’, Ved. uttaras ‘upper’) and 
pronouns (*k w oteros ‘which (of two)’ > Ved. kataras, Gr. JioxEpoq, OCS kot\o\ryi ‘which’, 
Goth, fvapar), later yielded the comparative in Greek (ooqxnxEpoq ‘wiser’) and Indo-Iranian 
(Skr. mahattaras ‘greater’). The complex suffix *-tm-H-o- often yielded a superlative (Ved. 
uttamas ‘highest’, Lat. optimus ‘best’, Gaul, name Uertamocorii ‘highest warriors’; in Greek 
*-tm-t-o-, cf. ooqxbxaxoq ‘wisest’), pairing with the comp. *-tero-. It is possible that in some 
cases an unmarked PIE adjective might have functioned as a comparative/superlative, cf. 
Hitt, sallavas siunas sallis ‘the greatest (lit. great) of the great gods’, and p. 438 for the 
Armenian comparative. An interesting but not completely clear phenomenon in IE languages 
is that for frequent/basic adjectives (‘good’, ‘bad’, ‘small’, ‘big’) one often finds suppletive 
forms, which usually differ from language to language, cf. for ‘good’ - ‘better’ - ‘best’ Gr. 
dyaSoq - Pekxicov - Pfikxioxoi; (with other variants), Lat. bonus - melior - optimus, OCS 
dobn - bold ( lucii ), Goth, gops - batiza - batists, OIr. maith —ferr - deg. Cognates among 
the mentioned suppletive adjectives are just *bel-/mel- (cf. p. 17) ‘better’ and *min- ‘smaller’ 
(Lat. minor, OCS munii, Goth, minniza). 


ADVERBS 

PIE did not have general, special adverbial suffixes for derivation from nouns and adjec- 
tives (cf. Eng. man-ly, nice-ly). Instead, various (usually singular) nominal and adjectival 
cases (all except the vocative) were used adverbially. However, not a lot can be recon- 
structed directly for PIE since this type of adverb formation was very productive, and 
the forms are widely diverse in IE languages; cf., e.g., the adverbial ‘by night’ expressed 
by the gen. sg. in Gr. vuKxoq, Goth, nahts, the ace. sg. in Ved. naktam, and the instr. 
sg. in OCS nostbjQ ‘by night’. Neuter forms (in the nom./acc. sg.) are generally often 
used adverbially, cf. the neuter adjectival *megh, (p. 79) used as an adverb (Ved. mdhi 
‘greatly’, Gr. piya ‘very much’, ONor. mjgk ‘much, very’, Hitt. mekk\i\ ‘very, greatly’), or 
Ved .puru ‘abundantly’, Gr. jioAu ‘much’, Goth, filu ‘much, very’; OCS nibnogo ‘much’, 
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etc. For the adverbial use of the locative, cf. Lat. temere ‘blindly’ (< *‘in the dark’), OCS 
dole ‘below’, dobre ‘good’, etc.; for the instrumental, cf. Ved. diva ‘by day’, madhya ‘in 
the middle’, Gr. Kpixpfj ‘in secret’ (p. 66), OCS vbcera ‘yesterday’ (p. 64), etc. 

Some apparently non-derived (or not easily analyzable) adverbs can also be reconstructed, 
e.g., *d h g h yes (p. 53) ‘yesterday’ (Ved. hvas, Gr. x&sq, Lat. her\i\, OIr. \in\-de, Alb. dje), orig- 
inally perhaps a loc. sg.; *nu(n) ‘now’ (Ved. nu, Gr. vu/vuv, Lat. nun\c\, OCS ny\ne\ and CS 
nb\ne\, Goth./OIr. nu, Hitt, nu ‘and, but’, \ki\nun ‘now’, etc. -p. 54—55), usually connected 
to the adj. *new-o-s ‘new’, etc. The later verbal augment *h[e (p. 6, 95, 161) was originally 
probably an adverb. A few postpositions (p. 104—105) were originally probably adverbs or 
both postpositions and adverbs, cf. *(s)upo ‘below; under’ (Ved. upa ‘up to, upon, above, 
near’, Gr. imo/uno ‘(from) under’, Lat. sub ‘under, below, up to’, Goth, (//‘under’, Olr.fo 
‘under’) and *(s)uper(i) ‘above; over’ (Ved. upari ‘above’, Gr. rmep/u;rsp ‘over, above’, 
Lat. super ‘over, above’, Goth, ufar ‘over, above’, OIr. for ‘over’). Some adverbial suffixes 
did exist; for *-d h e, *-r, and pronominal adverbs generally, cf. p. 89. For numerical adverbs, 
cf. p. 91; for adverbial compound prefixes like *h,su- ‘good’, cf. p. 79-80. 


PRONOUNS 

Unlike nouns/adjectives and verbs, which always had CVC type roots, pronominal roots 
were usually CV, VC, or V (of course, *HV- is always theoretically possible for initial 
*V-, cf. p. 52). The pronominal declension had mostly the same endings as the nominal 
one, except for five special different endings (apparently, not all appeared in all pronomi- 
nal declensions): nom. sg. m. *-0 (cf. nominal *-s), nom. sg. n. *-d (cf. nominal *-0/m), 
gen. sg. m./n. *-(e)so (cf. nominal o-stem *-osyo, otherwise *-es/-os), nom. pi. m. *-i (cf. 
nominal *-es), and gen. pi. *-som (cf. nominal *-om, p. 63). The oblique m./n. sg. cases 
(gen./abl./dat./loc.) of non-personal pronouns also feature the element *-sm- before the 
endings, usually thought to be the zero grade of *som- ‘one’ (p. 89); cf. the English use 
of this one, which one, etc. In the plural (and dual) masculine/neuter forms, the nom. *-i- 
often spreads already in PIE to the oblique cases as part of the enlarged stem (e.g., gen. 
pi. *to-y-som, p. 86). Many basic pronouns had suppletive forms (cf. heteroclitic nouns - 
p. 77-78). Personal pronouns, unlike other pronouns and nouns, did not have all seven/ 
eight cases (this is opposite to the situation in many present-day IE languages - like 
English, the Romance languages, and Macedonian/Bulgarian - where personal pronouns 
preserve the declension, while the nouns do not). In the history of the IE languages, pro- 
nominal and nominal forms often mixed, cf. *-o-i instead of original *-o-es in the nom. 
pi. o-stems in Latin/Greek/Balto-Slavic (p. 65), or -rum instead of the older -um in the 
gen. pi. in Latin (p. 67). Non-personal pronouns, which have different gender forms, like 
adjectives, sometimes had two (m. = fi, n.), sometimes three forms (m., fi, n.). 


Personal pronouns 

The personal pronouns are rather difficult to reconstruct since the attested IE systems are in 
many cases rather different (due to copious innovation). 7 They were characterized by partial 
(nom. sg. *tu ‘you’ - gen. sg. *teb h e) or complete suppletivity (nom. sg. *h,eg ‘I’ - ace. sg. 
*me); by the existence of clitic forms - sometimes as the only existing, or at least recon- 
structable, fonn and often syncretic (like gen./dat./acc. pi. *nos ‘us’); and apparently by the 
lack of reconstructable loc./instr. cases (all languages have different, innovative-looking 
forms), which were either non-existent or, less likely, innovative in all languages - the 
lack of separate vocative fonns is not strange, and the ablative is the same as the genitive 
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in most nominal declensions anyway. Like many older IE languages, PIE had no separate 
3rd person forms of personal pronouns - demonstrative/anaphoric pronouns (p. 85) were 
used instead. Many monosyllabic fonns exhibited facultative lengthening (p. 54-55). The 
reconstructed fonns are not always completely morphologically transparent (i.e., the root, 
suffixes, and endings are not always easily distinguished - e.g., in gen. *mene ‘of me’). 


1 sg. T 

nom. *h,eg > GAv. as-cit (lx), Slav. *ja (e.g., BCMS ja), OLith. e\s\, Hitt. \u\k 

*h,eg-Hom > Ved. aham, GAv. azS m, Gr. sy\(b\v, Slav. *jazi (e.g., OCS azd), 
Goth, ik, Ann. elsl 

*h,eg-oh, > Gr. syco, Lat. ego, Ven. ego 

ace. *me (accented/clitic) > Ved. ma, GAv. ma, Gr. |s|pe, clitic ps, Lat. me, OCS m\$\, 
OPruss. mie\n\, Goth. mi\k\, OIr. me 
gen. *mene > Ved. ma\m\a, YAv. mana, OCS mene 
*moy (clitic) > Ved. me, GAv. moi, Lat. mf|s 
dat. *me-g h -i > Ved. mahy\a(m)\, Lat. mih{T} (Umbr. meh{e}). Arm. \i\nj 
*moy (clitic) > Ved. me, GAv. moi, Gr. pot, OCS mi, OLith. mi 

The 1 sg. pronoun showed the suppletivity of *h,eg- (nom.) and *m- (oblique). The 
oblique cases had both stressed and clitic fonns. Three nom. forms are reconstructable, 
one non-derived (*h,eg) and two with suffixes, the *-oh 2 one perhaps being dialectal. The 
existence of two (even three) parallel forms may not be as strange as it might seem (even 
without any specific different function/meaning, which may have existed in PIE), cf. 
Hitt, uk and ukel, ukila ‘I, myself’. Hitt, oblique amm- (cf. ace. ammuk ‘me’), Ann. i- (cf. 
ace. is ‘me’), and Gr. e- in stressed fonns (cf. ace. sps but clitic ps) would point to *h,m- 
(p. 48), not *m-. However, this is not in accord with the glottogonic, but nonetheless 
persuasive, connection of pronominal *me- with verbal eventive 1 sg. *-m, 1 pi. *-me 
(p. 93). Thus, it is perhaps better to assume that this *h,m- is innovative - an analogy to 
the initial *h,- in the nom. (where *h[eg, not *eg, has to be reconstructed in that case) or 
later to the initial *e- in the nom. However, *h,m- would formally also work for PIE. Like 
in the 2nd person accented dat. sg., the ending is difficult to reconstruct (all languages 
show different endings). Here, we tentatively reconstruct PIE *-i, on the basis of Ved. -y- 
am, OLith. dat. mani, and perhaps clitic *mo-i, but that is admittedly highly speculative. 
The element *-g h - in the dat. is usually thought to be some kind of particle originally. It is 
possible that *moy had a variant *mey (which would be a more ready pre-form for Latin, 
Lithuanian, and Old Church Slavic). In the ablative, Ved. mad and OLat. med are formal 
cognates (PIE *me-t?) but are more likely separate innovations. 


2 sg. ‘you ’ 

nom. *tu > Ved. t(u)\vam\, GAv. tu, Gr. (Dor./Aeol.) in, Lat. tii, OCS tv, Lith. til (OPruss. 

tii), Goth .pu, OIr. tu (also tu-ssu). Arm. \d\u. Alb. ti, Toch. B t(u)\we\ 
ace. *twe > Ved. tva\m\, GAv. dfla, Gr. as 

*te (clitic) > Gr. (Dor.) is, Lat. t\e\, OCS t\§\, OPruss. tie\n\, Goth. pi\k\ 
gen. *tewe > Ved. tava, OCS te\b\e, Lith. tav-, Toch. B ci 
*toy (clitic) > Ved. te, GAv. toi, Lat. ti]s\ 

dat. *tub h -i > Ved. tubhy\a(m)\; *teb h -i > Av. ta‘bii\d\, Lat. tib{T} (Umbr. tef{e}), OCS 
teb{e}, OPruss. tebb{e(i)} 

*toy (clitic) > Ved. te, Gr. (Dor./Lesb./Ion.) xoi, OCS ti, OLith. ti 
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The forms are in many ways parallel to those of the 1 sg. The stems *tew-/tu-/tw- and *t- 
are marginally suppletive (these are probably ultimately somehow related to the verbal 2 
pi. *-te). Anatolian languages show *ti- in the nom. (CLuw. tT) and *tu- in oblique cases 
(Hitt, tu-), which is difficult to explain as innovative so some think that PIE originally 
had a nom. *tf (cf. Kloekhorst 2008: 112-115), though the supposed original Anat. nom. 
*tu- is often thought to be the origin of the vocalism of Hitt, iik T. The existence of 
both short and long reflexes in the nom. points to monosyllabic lengthening (p. 54-55), 
as in other similar personal pronouns forms (cf. ace. *me and *t(w)e), not to a laryngeal 
length. Ved. tubhya(m) with tu- looks like an archaism (cf. OCS rm>ne ‘to me’, Latv. dial. 
mm- by possible early analogy), while other languages point to a more regular *teb h - (cf. 
gen. *tewe and dat. *meg h i). The usual dat. ending *-ey in Lat. tibT, OPruss. tebbe(i) is 
perhaps an innovation (by analogy to nouns). Unlike the 1 sg. pronoun, the stressed and 
unstressed forms were distinguished in the ace. 


1 pi. ‘we’ 

nom. *we-y(-es) > Ved. vcty\am\, GAv. va\em\, Goth. wei\s\, Hitt. we\s\, Toch. B we{s} 
(also archaic *mes?, see below) 

ace. *n-s-me > Ved. asm\a\\n\, YAv. ahma (GAv. Shma), Gr. (Aeol.) apps, Goth, uns, 
Hitt. anz\a\\s\ 

*no-s (clitic) > Ved. nas, GAv. na (< *nas), Lat. nos, OCS n\y\ Hitt, -nas, Alb. 
nelna 

gen. *nd-s (clitic) > Ved. nas, GAv. ns (< *nas), Lat. nos\trum\, OCS nas \b\, OPruss. 
n\oii\s\an\. Alb. na 

dat. *no-s (clitic) > Ved. nas, GAv. ns (< *nas), OCS n\y\, Hitt, -nas, Alb. na 

The suppletive fonns were *we- and *no-/n- with plural endings (the pronominal *-i in 
the nom. and *-s(-), presumably a variant of the nominal pi. *-es, in other cases). The form 
*wey is not really attested as such, since Indo-Iranian has the usual additional *-om and 
Germanic/ Anatolian point to *wey-es with the additional plural *-es. For *we- cf. the dual 
pronominal and verbal fonns (p. 94), while *ns- is often derived from *ms- (cf. the nominal 
ace. pi. - p. 63), i.e., from *me- as in the 1 sg. pronoun (cf. also verbal 1 sg. *-m) and *-me- 
in 1 pi. verbal forms. Because of these considerations, one is prone to consider *mes ‘we’ 
(Lith. mes, Arm. me¥) an archaic dialectal by-form (with *we- often considered originally 
dual), though it could also be an innovation due to analogy to the verbal ending *-mes 
(p. 93). The ace. *(ns)me- is an unlikely source for *mes. For the gen./dat., only clitic forms 
seem to be reconstructable. In the stressed ace., a suffix (or reduplication of the original 
root?) *-me can be reconstructed on the basis of Greek and Avestan. It is not impossible 
that the dat. had a non-clitic *ns-m- with an unreconstructable ending (cf. Ved. asm-e, Av. 
ahm-a’biia, Gr. Aeol. app-i(v), Goth, uns-(is)), but this can also be a later analogy to the ace. 
(apparently, this *-sm- had nothing to do with *-sm- in non-personal pronouns, see below). 


2 pi. ‘you ' 

nom. *yu-s > Ved. yu\vam\, GAv. vus, OCS \v\v, Lith.y'us, Goth, jus 
ace. *u-s-we > Goth. \Azwi|s|, MW \chwi\; us-me > Ved. \y\usm\a\\n\, Gr. (Aeol.) uppe, 
Hitt. summ\a\\s\ (< *usmas) 

*wo-s (clitic) > Ved. vas, GAv. vci (< *vas), Lat. uos, OCS v\v\, OPruss. wa\n\s 

gen. *wo-s (clitic) > Ved. vas, GAv. vs (< *vas), Lat. uos\trum\, OCS vas|7j| 

dat. *wo-s (clitic) > Ved. vas, GAv. vs (< *vas), OCS v\v\ 
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The forms are in many ways parallel to those of the 1 pi. The suppletive forms are *yu- 
(nom.) and *wo-/u- (oblique). A frequent ad hoc assumption is that *yu- stems from the 
older *u-. The nom. (which, unlike the 1 pi., has nominal *-s) is attested with a long *u 
only, so a laryngeal *yuHs is theoretically possible instead of the monosyllabic length- 
ening (p. 54—55), but not likely (cf. the dual form). The stressed ace. again shows *-me, 
but Goth, izwis looks archaic if we consider this *-we (also seen in the dual) an original 
reduplication of the root. Cf. Katz 1998 for *-me/-we. 


I du. ‘us two ’ 

nom. *we > Ved. vd\m\ (lx), GAv. vd, OCS ve, OLith. ve\du\, Goth. wi\t\ 
ace. *nh r we > Ved. av{3m}, GAv. Soauua, Goth. ug{kis} 

*no-h[ (clitic) > Ved. na\u\, Av. na, Gr. (nom./acc.) vco, OCS na 
gen./dat. *no-h[ (clitic) 

The suppletive forms are much the same as in the 1 pi. Cf. also the verbal 1 du. *-we- 
(p. 94). We reconstruct the nom. form with monosyllabic lengthening because of the short 
and long reflexes. If one were to reconstruct *weh[, OLith. ve-du and Goth, wi-t would be 
difficult to account for. The nom. form is the same as the plural one, except for the miss- 
ing plural *-i. The laryngeal length of the gen./dat./acc. clitic (with the dual ending *-h,) 
is proven by consistent long reflexes, parallelism to the plural forms (which are the same 
except for having the plural ending *-s(-)), and the zero-grade length in the stressed ace. 
form (*nh,-). The clitic was probably *noh[ (like pi. *nos), but no language distinguishes 
*oH from *oH. The reconstruction of the gen./dat. clitic is due to the supposed structural 
parallelism to the plural forms. In dual forms, the stressed ace. has only *-we (unlike the 
prevalent *-me in the plural), according to the reflexes. Some think this *-we is found 
also in the ace. sg. *t-we (but this may also be *tu-e, cf. nom. *tu). 


2 du. ‘you two ’ 

nom. *yu > Ved. yu\vam\, OCS \v\y, lA\h.ju\du\ 
ace. *uh r we > Ved. y\u\v{am} 

*wo-h[ (clitic) > Ved. vd\m\, OCS va, Toch. B nom. we\ne\ 
gen./dat. *wo-h[ (clitic) 

The forms are parallel to the 1 du., and the suppletive stems to ones in the 2 pi. forms. 

Reflexive pronouns 

The reflexive pronoun ‘self’ was identical to the 2 sg. personal pronoun, except that it 
did not have a nom. sg. and had an initial *s- instead of *t-. Cf. gen. *sewe (OCS se\b\e), 
dat. *seb h i (YAv. h\uu\auudii\a\ Ihwawyal, OLat. sib{ei}, OCS seb{e), OPruss. sebb{ei}), 
clitic gen. *soy (GAv. hoi), clitic dat. *soy (GAv. hoi, Gr. oi, OCS si), ace. *swe (Ved. 
sva[vam\, Gr. e, Pamph. phe), clitic *se (Lat. se, OCS s\$\, OPruss. sie\n\, Goth. si\k\). The 
pronoun was originally used not only for the 3 sg. but for the 1 and 2 sg. as well. 

Possessive pronouns 

The singular possessive pronouns, derived from personal and reflexive pronouns, can be 
formally reconstructed, though their antiquity is not beyond doubt (the forms often differ 
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in specific languages): *m-o-s (Gr. |e|(xo<g, GAv. ma) and *me/oy-o-s ‘my’ (Lat. meus, 
OCS mojb ), *tw-o-s ‘thy’ (Ved. t\’ds, GAv. &fia, Gr. adq, Arm. to), *sw-o-s ‘own’ (Ved. 
svas, Gr. oq). 


Demonstrative pronouns 

A number of demonstrative pronominal stems can be reconstructed - some better attested 
than others. Considering the number of the attested stems, it is probable that PIE had 
a three-way system of demonstrative pronouns (like Lat. hiclistelille and most IE lan- 
guages, or Japanese koret sore/ are), correlated with three persons, and not a two-way 
one (like English this/that), cf. p. 160. The exact reconstruction of the semantics of the 
stems is a bit speculative, since these change easily in languages. The three best-attested 
(and probably main) demonstratives are *ki- ‘this (near me/speaker)’, *s-/t- ‘that (near 
you/the addressed)’, and *(h,)ey-/(h 1 )e- ‘that (near him/over there)’. The somewhat less 
well-attested stems are *(h[)e/ow-o-s ‘this’ (Ved. avas, OCS ovi >) and *(h, /3 )e/on- ‘that 
(near him)’ (OCS om>, Lith. anas. Arm. na). It is not clear why there were five different 
reconstructable demonstrative stems (a number of scenarios are possible), though some 
languages preserve all of them, cf. OCS sb, ovb, tb, om (suppletive ace. i). 

For the pronoun *ki- ‘this’ (also *ko-?), cf. OCS Sb (m.) - si (f.) - se (n.), Lith. sis - si 
(fi), OIr. ce, Arm. sa, Hitt, kas (CLuw. zas) - kT (n.), PGmc *xi (OEng. he ‘he’, Goth. 
himma daga ‘today’, etc.), also Lat. (hi)-c (cf. cis ‘on this side’), Gr. (£)K£i-vo<; ‘that one 
there’ (*key-?), Alb. si-vjet ‘this year’. Owing to differences in languages and partial 
attestations, the PIE declension is not clear. 

The suppletive forms are well attested in *s-o (m.) - *s-eh, (f.) - *t-o-d (n.) ‘that’ (cf. 
Av. ho — ha — tat, Gr. 6 - f| - to ‘the’, CIb. so (?) - sa - sod, Toch. B se - sa - te, OLat. 
f. sa-psa ‘herself’). The *s- appears only in the nom. sg. m./fi; all other forms have a *t-. 
Here is the declension (n. = m., except in the nom./acc. and du.): 


TABLE 1.29 IE DEMONSTRATIVE PRONOUN *SO - *SEH 2 - *TOD (SINGULAR) 


sg- 

PIE 

Ved. 

OCS 

Lith. 

Goth. 

nom. 

*s-o - *s-eh, - 
*t-o-d a 

sa (|s|) — sa- tad 

\tb\ - \t\a - to 

\Ad|s] - \Ao b 

sa-so— pat\a\ 

acc. 

*t-o-m - *t-e-h 2 -m 
*t-o-d 

tdm - tarn - tad 

tb - tg — to 

tq 

pan\a\ -po- pat\a\ 

gen. 

abl. 

*t-e-so — *t-e-sy- 
e-h 2 -es 

*t-o-sm-ot -*t-e-sy- 
e-h 2 -es 

tas\y\a - tasyas 

tasmat - tasyas 

Ao\{g}o - Ao j?\ 

t{d} - 1 {<5i} 

pis -pi\z\os 

dat. 

*t-o-sm-oy — *t-o-sy- 
e-h,-ey 

tasmai - tasyai 

tomu - to\i\ 

tamui c t{ai\ 

pamma - p\iz\ai 

loc. 

*t-o-sm-i -*t-o-sy- 
e-h,-i 

tasmi\n\ - 
tasy{am} 

tomb - to\i\ 

tam{e} - to\j\{e} 


instr. 

*t-o-y-? - *t-o-y- 
eh, (?) 

tend - taya 

temb - toj\g\ 

t{ud} - t\q\ 



a In sg./pl. always m. - f. - n. and m./n. - f. (when just two forms); b Lithuanian has no neuter, cf. OPruss. 
sta < *s-t-od; c Old Lithuanian. 


The *-o-/-e- that appears after *s-/t- in the sg./pl. is apparently a thematic vowel. The 
non-pronominal endings are from o-/e/? 2 -stems (except for masculine loc. *-i, not *-oy). 
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Ved. sa, Gr. 6, and Goth, sa, with no ending, are archaic (cf. also *ey below), though the 
Greek declension of the pronoun is otherwise innovative, with all nominal endings. Goth. 
pat-a probably proves the neuter ending was *-d (not *-t). Germanic (and Slav, ceso, Gr. 
Ion. leo) probably points to a gen. in *-e-so, replaced by thematic *-o- and other endings 
elsewhere. In the masculine abl./dat./loc., the element *-sm- (Ved. -sm-, OCS/Lith. -m- 
with a special development of *-sm-, OPruss. -sm-, Goth, -mm-) appears after the root, cf. 
also OPruss. dat. stesmu (with *s- + *t-), CIb. dat. somui (with generalized *s-), and Arm. 
dat. dma < *doma (with d- as in du ‘thou’, p. 438). The f. gen./abl./dat./loc. sg. forms 
appear, on the basis of Indo-Iranian, to have *-sy- instead (connected to *si- ‘one’ > Hitt. 
sT-, Gr. Horn. ia, besides pia, cf. Beekes 1988: 81; or perhaps from older *-smy- from 
*smih, ‘one’ - p. 89). In Old Church Slavic, *tosy- is replaced with *toy- (as in the pi. 
and instr. sg.); in Gothic with *tes- (as in the masculine gen.); in Lithuanian (and Greek) 
with a simple *t- plus endings (e.g., the gen. *t-eh 2 es instead of *tesy-eh 2 es). The instr. 
forms are unclear, though both Vedic and Old Church Slavic point to *toy- (otherwise 
the pi. stem). 


TABLE 1.30 IE DEMONSTRATIVE PRONOUN *SO - *SEH, - *TOD (PLURAL) 


PL 

PIE 

Ved. 

OCS 

Lith. 

Goth. 

nom. 

*t-o-y - *t-e-h 2 - 
es - *t-e-h. 

te - tas — ta 

ti - t\y\ — ta 

tie - tos 

pai - pos - po 

acc. 

*t-o-ns - *t-e-h 2 - 
(n)s - *t-e-h. 

t\a\n - tas - ta 

ty (m./f.) - ta 

tubs - tqs d 

pans - pos - po 

gen. 

*t-o-y-som - *t-e- 
h,-som 

tes\a\m - tas\a\m 

text 

t\i }\ e 

p\i\z{e } -p\i\zo { 

dat. 

*t-o-y-b h / 
mos - *t-e-h 2 - 
b h /mos 

tebh\y\as - 
tabh\y\as 

temb 

Hems - toms f 

paim 

loc. 

*t-o-y-su - *t-e- 
h 2 -su 

tesu — tasu 

texb 

tuos{e} - tos{e } 


instr. 

*t-o-y-b h i(s) - 
*t-e-h,-b h i(s) " 

tebhis b_ tabhis 

temi c 

t{ais} - tomis 



a BS1. *-miHs; b AV tais (cf. Av. tais and Lith. tais) is innovative; c Old Church Slavic (and Gothic in the 
dat.) have generalized *toy- in both m./f. in the gen./dat./loc./instr. (cf. the Lat. f. gen. \is\tarum); d cf. 
OPruss. stans ', e cf. OPruss. steison ; 1 OLith. -mus \ 8 cf. adjectival -a/re, -aizo. 


The nom. and gen. have pronominal endings. From forms like *toy, many later lan- 
guages generalized the originally pronominal ending *-oy in the nominal o-stems as well 
(p. 64-65). For gen. *-som (often changed to *-sdm, as in the nominal *-om/-om, p. 70), 
cf. also CIb. soisum and Hitt, kinzan (from *ki-). This *toy- seems to have spread from 
the nom. to the gen./dat./loc./instr. pi. (and instr. sg.). 


TABLE 1.31 IE DEMONSTRATIVE PRONOUN *SO - *SKIL - *TOD (DUAL) 


du. 

PIE 

Ved. 

Gr. 

OCS 

nom./acc./voc. 

*t-o-h! (m.) - *t-o-yh, (f./n.) 

ta (m.) - te (f./n.) 

Ttb 

ta (m.) — te (f./n.) 

gen./loc. 

*t-o-y-ow 


rot{v} 

toju 

dat./abl./instr. 

*t-o-y-b h y/moh l 

t\a\bhya\m\ 

rot {v } 

tema 
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The dual forms have the nominal endings, with the oblique *-oy- as in the plural. 

The pronoun for ‘that (over there)’ (used anaphorically as well - referring to some- 
thing already mentioned - which was perhaps its original function) has suppletive 
*(h[)ey- (zero-grade (*h[)i-) and *(h[)e- (n.=m., except in nom./acc.): 


TABLE 1.32 IE DEMONSTRATIVE PRONOUN *EY (*IS) - *IH 2 - II) (SINGULAR) 


sg- 

PIE 

Ved. 

Lat. 

OCS 

Goth. 

nom. 

*ey (*i-s) - 

ay\am\ - i\yam\ - 

is h ~ {ea} - id c 

{om> - ona - ono 

is - |s|/ - it\a \ h 


*i-h,_ *i-d 

id\am \ a 


f i 


acc. 

*i-m - *i-h,-m - 

im\dm\ - {imam} - 

OLat. im - 

* {j<? ~je} 

in\a\ - i\ja\ - it\a\ 


*i-d 

id\am\ 

{earn} - id 



gen. 

*e-so - 
*e-sy-e-h,-es 

as\y\a — asyas 

e\i\u\s \ d 

je{g}o -jey?\ 

is ' ~ i\z\os 

abl. 

*e-sm-ot - 
*e-sy-e-h,-es 

as mat - asyas 

e{o}-e{d} 



dat. 

*e-sm-oy - 
*e-sy-e-h,-ey 

asmai - asyai 

asmi\n\ - asy{am } 

e{T} 

jemu -je\i\ 

imma - i\z\ai 

loc. 

*e-sm-i - 
*e-sy-e-h,-i 


jemb - je\i\ 



instr. 

*ey-? - (?) 

end - aya 


imb -jejg 



a ‘this here’; b OLat. ei\s\ (lx); c 'that one; he/she/it’; d from *e-syo-s; c cf. South Picene loc. esme/in; * 'he, 
she, it’, cf. above for on-\ the original forms are seen in the determinate adjective endings -i < *-jt, {-ja}, 
{-je } ; B < *jb; b ‘he/she/it’ (cf. Germ, er - sie - es ); 1 OHG es (n.). 


Many forms are parallel to those of *s-/t- (cf. above), but the nature of the suppletion 
pattern is different and the reconstruction more difficult and less secure in some cases. The 
variant *ey-/i- appears in the nom./acc. sg. and *e- elsewhere. For the nom. cf. also GAv. m. 
aii\Stn\ ‘this’ - f. i\m\ ‘this’, n. Tt ‘it’; Lith. |/| is ‘he’ — ]/|i ‘she’ (this pronoun mixes with the rel- 
ative *yo- in Balto-Slavic), and OIr. e ‘he’ - |.sji ‘she’ - ed ‘it’. The archaic nom. m. fonn was 
perhaps, though a bit speculatively, *ey-0 with a pronominal zero-ending (Ved. ay-, perhaps 
OIr. <?), occurring together with a younger *i-s (with nominal *-s and zero-grade *i- as in the 
f./n. and ace.). For the ace. m. cf. also Gr. Cypr. iiv; the ace. f. is structurally reconstructed. 


TABLE 1.33 IE DEMONSTRATIVE PRONOUN *EY CHS) - *IH 2 - II) (PLURAL) 


pi. 

PIE 

Ved. 

Lat. 

OCS 

Goth. 

nom. 

*e-y - *e-h 2 -es - 
*e-h. 

\im\e - \im\ds - 
\im\a 

\eT\ c ~\eae\- 

\ea\ 

{oni - ony - 
ona} f 

<3 

1 

0 

1 

acc. 

*e-o-ns a_ *e-h,-(n) 
s — *e-h. 

\im\\a\n - \im\ds - 
\im\d 

\eo\s - \ea\s - 
\ea\ 

{)? ~j?} ~ja 

\i\ns - \ij\os - \ij\a 

gen. 

*e-y-som - es\a\m - as\a\m 

*e-h 2 -som 

{eum} d ~ \edrum\ 

ixb 

\Az{e} -V\zo 

dat. 

loc. 

*e-y-b h /mos - 
*e-h 2 -b h /mos 
*e-y-su - *e-h 2 -su 

ebh\y\as - 
abh\y\as 
esu - asu 


imb 

ixb 

\Am 

instr. 

*e-y-b h i(s) - ebhis - abhis 

*e-h 2 -b h i(s) b 

[dat. { //.s' } ] e 

imi 



a Contracted to *ons; b BS1. *-miHs; c Plautus (also \eA|.s|), OLat. WA[s|; d OLat., cf. Osc. eisun\k \; e OLat. 
{eieis }; 1 original forms in determinate adjective -ji, -je. -ja. 
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The *ey- appearing in the plural is not the *ey of the nom. sg. but *e-y- (with pronom- 
inal *-i), parallel to *to-y- (see above), though synchronically it made no difference. The 
ace. pi. is difficult to reconstruct - Vedic seems to have a secondary im- (that later spread 
to the nom. pi. as well) by analogy to the ace. sg., the Latin forms are “de-contracted”, 
while Gothic has a secondary ins (with i- as in most forms). 


Interrogative pronouns 

The interrogative pronoun for ‘who, what, which?’ has the root *k w -, which shows up in two 
forms - as *k w -o- and *k w -i-. Cf. *k w -o-s (m.) > Ved. kas ‘who, which?’ (GAv. sandhi-fonn 
kas), OCS kb-to (oblique ko-), Lith. kas, Goth, has (all ‘who?’); *k w -o-d (n.) > Ved. kad 
‘what, which?’ (GAv. kat), Lat. quod ‘which?’, OPruss. ka ‘what’, Goth, ha ‘what?’ (cf. 
*k w -e-h, (f.) > Ved. ka, GAv. ka, Goth. ho). Cf. also *k w -i-s (m./f.) > Ved. na-\k\is ‘no one’, 
GAv. cis, Gr. it q, Lat. quis, Hitt, kuis (all ‘who?’); *k w -i-d (n.) > Ved. cid ‘any’ (see below), 
GAv. -cTt (pronominal particle), Gr. it, Lat. quid, Slav. *& (Croat. Cak. ca, cf. OCS cb-to), 
Hitt, kuit (all ‘what?’). The relation of *k w i- and *k w o- is not clear; perhaps the first was 
originally ‘who/what?’, the second (adjectival) ‘which?’, cf. the opposition of Lat. quis 
‘who?’ - quid ‘what?’ and quod ‘which?’, and Slav. *& ‘what’ and *kr> ‘which’ (Croat. 
North Cak. ka, OCS kvi < *kb-ji>). Alternatively, it can be supposed that the thematic *k w o- 
is innovative/secondary altogether (perhaps already in PIE). The declension is not easy to 
reconstruct, mainly because the relation of the two stems is unclear (*k w o- and *k w i-/k w ey-, 
cf., e.g., the ace. sg. Ved. kam < *k w o-m but Lat. quern, Hitt, kuin < *k w i-m ‘whom?’; nom. 
pi. Ved. ke, GAv. koi < *k w o-y but GAv. caiias, OLat. ques < *k w ey-es), but the reconstruc- 
table forms apparently parallel those of the demonstrative *so, cf. the gen. *k w -e-so ‘of 
whom/what?’ (GAv. cah\ii\a, Gr. Ion. teo, early Lat. qiAo\\i\u\s\, OCS ceso ‘of what?’, Goth. 
his), dat. *k w -o-sm-oy ‘to whom?’ (Ved. kasmai, GAv. kahmai, Umbr. pusme, OCS komu, 
Goth, hamma), etc. It is unclear whether separate *k w -o-so (Ved. kas[v\a, GAv. kah\ii\a ‘of 
whom/what?’, OCS ko{g}o ‘of whom?’), *k w -e-sm-oy (OCS cemu ‘to what?’), etc. existed 
(with the oblique suppletive *k w e- in alternation with the nom./acc. *k w i-, and the oblique 
*k w o- from the nom./acc. *k w o-), though they are formally easily reconstructable. For inter- 
rogative pronominal adverbs, cf. p. 89. 


Indefinite pronouns 

The indefinite pronouns were derived from the interrogative ones. One pattern was 
doubling, cf. *k w is-k w is ‘whoever’ (Lat. quisquis, Hitt, kuis kuis), *k w os-k w id (Ved. kas 
cid ‘anyone’, GAv. kascTt ‘every, any’), etc. The other one was adding the particle *k w e 
(p. 105), cf. *k w os-k w e (Ved. kas ca ‘anyone’, Ann. ok' ‘someone’ - p. 440), *k w is-k w e 
(Lat. quisque ‘whoever’), *k w od-k w e (Lat. quodque ‘whatever’, Goth, hah ‘each, every’ 
(n.)), etc. Sometimes simple interrogative pronouns were probably used as indefinite pro- 
nouns as well. 


Relative pronouns 

The relative pronoun is reconstructed as *y-o (m.) - *y-e-h, (f.) - *y-o-d (n.) ‘who/what/ 
which’ (sometimes thought to be a thematization of the demonstrative *i-, p. 87), with 
the same declension as *so - *seh, - *tod (p. 85-86). Cf. Ved. yds - vd - yad, Gr. oc, - 
f) - 6, OCS i\ze\ (< *ji>) - ja\ze\ -je\ze\ (for ze cf. p. 535), CIb. io\s\ (ace. sg. iom, dat. sg. 
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iomui), Gaul, dugiionti-io ‘who serve’, OPhryg. ios. In many languages, the interrogative 
*k w - has taken over the function of the relative *yo- (e.g., in Anatolian, Tocharian, Italic), 
sometimes (as in part of Slavic) after the first attested texts. However, some doubt the 
antiquity/originality of the relative *yo (cf. also p. 166). 


Pronominal adjectives 

There are a number of pronominal adjectives, cf. *h 2 elyos ‘other’ (Gr. aXkoq, Lat. alius, 
Goth, aljis, OIr. aile, Toch. B alyek ), *k w oteros ‘which of the two’ (OCS kot\o\rvi, Lith. 
dial. kat\a\ras ‘which’, and cf. p. 55, 80), etc. 


Pronominal adverbs 

The pronominal adverbs are rather diverse in the IE languages, and not a lot of clear- 
cut correspondences exist. Cf. *k w u-d h e ‘where?’ (Ved. kuha, OCS kbde; cf. Gr. 7i6Sev 
‘whence?’ < *k w o-d h e-n) and *k w e/o-ti ‘how many?’ (Ved. kati, YAv. ca‘ti, Lat. quot, Bret. 
pet, cf. Gr. adj. noaoq < *k w otyos) from *k w - (p. 88); Ved. i-tas ‘from here’ from *i- 
(p. 87), cf. Gr. ev-Toq, Lat. in-tus ‘inside’ for the suffix *-tos; similar forms from pronomi- 
nal roots with the suffix *-r: Lat. quo-r ‘why?’, Lith. ku-r ‘where?’, Goth. Iva-r ‘where?’, 
Alb. ku-r ‘when?’, Arm. u-r ‘where?’, etc. 


NUMERALS 

PIE clearly had a decimal system. The reconstructed main cardinal numbers (with more 
or less direct reflexes), leaving out the many problematic issues, are: 

1. *h,oy-n-o-s ‘one’ (m.) (o-stem), *-eh, (f.), *-o-m (n.) > Gr. ow] ‘the ace on dice’, 
OLat. oinos, Lith. |v| ienas (cf. Slav. *im> ‘other’ < *h[eyn-, OCS in-o-rogu ‘uni- 
corn’), OPruss. ace. sg. ainan, Goth, ains, OIr. den (with different suffixes Ved. 
ekas; OPers. aiva ‘one’, Gr. Cypr. oipoq ‘only’, p. 49); *som (m.) ‘one’ (OCS sam- 
■b ‘alone’) - *sm-ih 2 (f.) (Gr. pia, Arm. mi, cf. p. 86 for *si-) - *sem (n.) (Gr. £v, 
cf. Lat. sem-el ‘once’), cf. Ved. sa-kft ‘once’ (*sm-), Gr. opoq ‘one and the same’, 
Goth, sama ‘same’ (*som-) 

2. *d(u)w-o-h, (m.) ‘two’ (cf. p. 54 for Lindemann’s Law; o-stem du.) > Ved. d(u)vd 
(-au), Gr. Horn. 5uco, Lat. duo, OCS dwa, Lith. du, OIr. dou, Ann. erku (p. 433^-34), 
Toch. A wu; *d(u)w-eh 2 -ih[ (f.) (eh 2 - stem du.), *d(u)w-o-yh[ (n.) (o-stem du.) > Ved. 
d(u)ve (f./n.), Lat. duae (f.), OCS dwe (f./n.), Lith. dvi (f.), Toch. A we (f.); cf. like- 
wise *-b h -o-h 1 ‘both’ (Goth. b{ai}), *h 2 ent-b h oh, (Toch. B antapi, Gr. apxpco, Lat. 
ambo), also OCS oba, Lith. abu (*h 2 e-b h "oh 1 ?), Ved. ubha (*Hu-b h oh,?) 

3. *tr-ey-es ‘three’ (m.) (/-stem, p. 32) > Ved. trayas, Gr. ipo;, Lat. tres, OCS trije, 
Lith. tr\y\s, Goth, preis, OIr. trl. Arm. eret, Toch. B tr\a\i, cf. loc. *tr-i-su (Ved. 
trisu), etc.; *tisres (f.) (< *tri-sr-es, 7-stem) > Ved. tisras, OIr. teoir; *tr-ih, (n.) 
(/-stem pi.) > Ved. trl, Gr. ipia, OCS tri 

4. *k' v etwor-es ‘four’ (m.) ( 7 -stem, p. 23) > Ved. catvaras, Gr. T£Tx\a\peq, OCS 
cet\y\re, OIr. ceth(a)ir. Arm. cforkf, Toch. B stwer; oblique/compound *k w twrC-/- 
urV- (p. 52) >Ann. tar-’, *k w ete-sr-es (f.) (< *k w etwer-sr-es?) > Ved. catasras, OIr. 
cetheoir; *k w etwor (n.) < *-orh 2 (p. 70) > Lat. qu\a\ttuor, Goth, \f\idwor (reshaped 
*k w etwor-h, (n.) > Ved. cat\’5ri, Gr. T£Ti\a\pa) 
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5. *penk w e ‘five’ (p. 24) > Ved. pahca, Gr. nevxe, Lat. qu\t\nque, Lith. penk{i}, Goth. 
fimf, OIr. coic, Ann. hing. Alb. pese , Toch. A pan, B pis; cf. *pnk w -stis ‘fist’ > PS1. 
*p§stb (BCMS pest), OEng .fyst 

6. *sw/s/weks ‘six’ > Ved. sat, Gr. Dor. pel;, Lat. sex, Goth, saihs, OIr. se (W chwech, 
Gaul, suexos), Ann. vec c (p. 440), Toch. B skas 

7. *septm ‘seven’ (loanword from Semitic *sab < I-at-um, p. 29) > Ved. sapta, Gr. ejxxd, Lat. 
septem, Lith. sept{vni}, Goth. si\b\u\n\, OIr. secht ", Ann. ewt c n, Toch. A spat 

8. *h 3 ekto(w) ‘eight’ (p. 45) > Ved. asta (-au), Gr. oktcd, Lat. octo, Lith. astuo{ni}, 
Goth, ahtau, OIr. ocht M , Ann. u\t c \ (p. 440), Toch. B okt 

9. *h,newn ‘nine’ (p. 35) > Ved. nava, Gr. e|v|vsa, Lat. noue\m\, Goth, niun, OIr. not", 
Arm. inn, Toch. AB nu 

10. *dekm ‘ten’ (p. 34) > Ved. dasa, Gr. Sekci, Lat. decent, Goth, taihun, OIr. deich". 
Arm. t\a\sn, Toch. B \s\ak 

20. *wlkntih, < *dw-i(h,)-dkmt-ih 1 ‘twenty’ (du.) > Ved. vi\m\sati\s\, YAv. visa’d, Gr. 
Dor. piKax\i\, Lat. uiginti, OW uceint, Ann. k'san, Toch. A wiki 

100. *kintom < *dkmt-o-m ‘hundred’ (p. 34) > Ved. satam, Gr. |e|koix6v, Lat. centum, 
OCS sbto, Lith. simta\s\, Goth, hunda, CIb. kantom, Toch. B kante 

1000. *g h esl- ‘thousand’ > Ved. \sa\-hdsram, Gr. Aeol. ^e^Atot (Lat. tni-lle < *smih 2 - 
g h esl- ‘one thousand’); dial. *tuHs-(d)kmt-ih 2 (literary *‘strong hundred’) > OCS 
tysf/g\st\i, OPruss. ace. pi. tusimt{ons}, Goth .pusundi (Eng. thousand) 

Only numbers 1^1 had declension and distinguished gender. The two forms for ‘one’ 
(disregarding the different suffixes with the root *hjOy-) perhaps had slightly different 
semantics or usage. In 3 and 4, Indo-Iranian and Celtic point to feminine forms with 
*-sr-, often interpreted as ‘woman’ (cf. *swe-sr- ‘sister’, etc.). The word for ‘twenty’ is 
originally ‘two tens’, and ‘hundred’ is derived from ‘ten’ (‘ten’ itself is usually thought to 
be ‘two-hand’, cf. Goth, handus ‘hand’ < *komt-). The numbers from 1 1 to 19 were prob- 
ably made by the pattern ‘one-ten’, ‘two-ten’, etc., cf. *dwoh,-dekm(t) ‘twelve’ > Ved. 
duvadasa, Gr. ScoSeica, Lat. duodecim. The decades 30-90 were made with *-(d)komth 2 
(originally *-(d)komt with the loss of *-h 2 ?), the old collective of *dekin(t), cf. *tri-h 2 - 
(d)komth, ‘thirty’ > Gr. ipi\d\Kovia, Lat. tngint\a\, OIr. tricho. Arm. eresun (p. 440^441), 
Toch. B t\dry\ak\a\ (Toch. B -ka could be from *-(d)komt as well, cf. p. 467), etc. The 
numbers 21-29, 31-39, etc. were perhaps made the same way as 11-19, cf. *treyes- 
trih 2 (d)komth 2 ‘thirty-three’ (lit. ‘three-thirty’) > Ved. trayas-tri\m\s\at\, etc. The hundreds 
from 200 to 900 are difficult to reconstruct but perhaps were (at least in one variant) 
compounds with *-(d)kmt- (cf. Gr. dial. -Kax-ioi, Lat. -cent-T). 

The ordinal numbers were derived from the cardinal ones (except ‘one’) with various 
thematic suffixes. The original pattern is sometimes difficult to discern. The provisory 
reconstructions are: 

1. *prh 2 -w/m/t-o-s ‘first’ (< ‘foremost’) > Ved .pumas ‘fore, first’, OCS prbVb, Toch. 
B parwe(-sse); Lith .pirmas, Goth .f\ru\m\a\; Gr. Dor. jrpaioq (cf. adverbs *preh 2 -i 
> Lat. prae ‘in front of’, *prh, > Gr. jrapa/wapa ‘beside’) 

2. ‘second’ cannot be reconstructed (all languages have different roots) 

3. *tr-/tri-t-y-o-s ‘third’ > Ved. trt\T\vas, YAv. dritiia-, Lat. tertius, OCS tr\e\tii, Lith. 
tr\e\cias; Gr. ipfiAot;, Goth. pridj\a\ 

4. *k lv twr-y/t-o-s ‘fourth’ > Ved. tur\T[vas, YAv. tu’riia-, Gr. x|e|TapToq, Lat. quartus, 
OCS c\e\tvrbtb, Lith. k\e\tvirtas, OEng. f\e\orp\a\, Toch. B starte 

5. *penk w -t-o-s ‘fifth’ > Gr. neymoc,, older Lat. quinctus, OCS pgtb, Lith. pehktas, 
Goth, fimfta-, Toch. B pihkte 



PROTO-INDO-EUROPEAN MORPHOLOGY 91 


6. *(s)(w)ek(s)-t-o-s ‘sixth’ > Ved. sast\h\as, Gr. 8kto<;, Lat. sextus, OCS \s\estb, Lith. 
Xs\estas, OPruss. uschts (< *ukstos), Goth. saihst\a\, Toch. B skaste 

7. *septm-o-s ‘seventh’ > Ved. sapt\am\as, Gr. e|35|o|(.io<;, Lat. sept\i\mus, OCS sedrrvb, 
OPruss. septmas ; *septm-t-o-s > Ved. saptat\h\as, Lith. septintas, OHG sibunt\o\, 
Toch. A saptant 

8. *h 3 ekt-ow-o-s > Gr. o\y8\ooq, Lat. oct\a\uus 

9. *h,newn-t-o-s ‘ninth’ > Gr. evaxoq, OCS \d\ev£t-b, Lith. devintas, Goth. niund\a\, 
Toch. B fiunte 

10. *dekmt-o-s ‘tenth’ > Gr. SeKaioq, OCS desgth, Lith. desimtas, Goth. taihund\a\, 
Toch. B skante; *dekm-Fl-o-s > Ved. dasamas, Lat. decimus 

Multiplicative adverbial numerals *dwis ‘twice’ (Ved. dvis, Gr. Slq, Lat. bis, Ole. tvis- 
var ), *tris ‘thrice’ (Ved. tris, Gr. xpiq, Lat. ter. Ole. pris-var), *k w eturs ‘four times’ (Ved. 
catur, Lat. qiiSdster) can also be reconstructed (cf. also Ann. - p. 441). 


VERBS 

The reconstruction of the PIE verbal system is more complex than the reconstruction 
of the nominal system. There is some concord when it comes to the reconstruction of 
specific fonns and paradigms (which is the primary task here - detennining the exact 
function of specific fonns is more difficult), although it is not always clear what catego- 
ries were present in what phase of PIE. There are a number of reasons for disagreement 
in the reconstruction of the PIE verbal system. First of all, the verbal systems of the 
early attested IE languages are much less congruent than the earliest nominal systems - 
e.g., while Vedic has three voices (active, middle, passive), five tenses (present, aorist, 
imperfect, perfect, future), and five moods (indicative, injunctive, imperative, subjunc- 
tive, optative), Hittite has only two of each (active, middle; present, preterite; indicative, 
imperative). Thus, the PIE system is more difficult to reconstruct, and there is more room 
for different possible reconstructions. Generally speaking, analyzing verbal systems is 
often hard and controversial even when it comes to living languages, so it is no wonder 
that this is even more difficult when it comes to dead languages (like Vedic) and even 
more so for reconstructed languages (like PIE). 

The earliest, traditional reconstructions of the PIE verbal system were based on the 
so-called Greco-Aryan model and the rich verbal morphologies of (mainly) Greek and 
Old Indie. Flowever, as became apparent after the discovery of Anatolian (and Toch- 
arian, but to a smaller extent), this reconstructed system was very different from the 
less complex systems found in Anatolian languages - for instance, Anatolian shows no 
traces of the subjunctive and optative moods that have to be reconstructed for the rest 
of IE. The basic problem is how to interpret these discrepancies - to consider them as 
the result of post- Anatolian common innovations in late PIE, to consider Anatolian as 
innovative (e.g., losing the subjunctive and optative), or some kind of combination 
of both (p. 194). This then results in the fluid nature of what exactly is reconstructed 
for PIE - sometimes it is Early PIE (with Anatolian) and sometimes it is Late PIE 
(post- Anatolian). 

The following PIE verbal categories can be reconstructed (we will mention possible 
others later in the text): 

number: singular, dual, plural 

person: 1st, 2nd, 3rd 
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aspect: present, aorist, perfect 
voice: active, middle 

mood: indicative, imperative, optative, subjunctive 

nominal verb forms: participles (and verbal adjectives, verbal nouns) 

Things are mostly straightforward when it comes to number, person, voice, and nominal 
formations. What were traditionally regarded as tenses are now most often interpreted 
as aspects (the present being imperfective, the aorist perfective, and the perfect stative 
aspect), 8 with the system best preserved in (early) Greek. Thus, for PIE we can recon- 
struct the imperfective (“present”) *stisteh 2 mi ‘I am getting up’ - the perfective (“aorist”) 
*steh 2 m ‘I stood up’ - the stative (“perfect”) *stestoh,e ‘I am standing’ (not all verbs 
could form all three). Present/imperfective and aorist/perfective are grouped together as 
eventive (as opposed to perfect, i.e., stative). The categories of voice and mood existed in 
the eventive system only (thus, the perfect had no middle and no moods). Both the pres- 
ent and aorist had separate middle forms. The (active or middle) indicative is unmarked, 
and the rest of the moods functioned as separate simple formations (there was only one 
optative, subjunctive, and imperative - though in the case of the latter also with special 
future endings); i.e., there were no separate forms for the subjunctive present, subjunctive 
aorist, etc. as in Vedic or Greek. The present/aorist opposition, optative, subjunctive, and 
dual (and special formations like thematic presents or reduplicated perfects) did not exist 
in Anatolian and were possibly post- Anatolian. 

All verbal roots had a CVC structure (with possible clusters - p. 52-53), with *e usu- 
ally thought to be the “basic” ablaut grade (p. 43), cf. *b h er- (p. 40), *d h eh r (p. 45), 
though there were exceptions like *kan- (p. 42). PIE verbal forms had the following 
structure: root (+ affix) (+ thematic vowel) + ending (+ particle). A verb had to have a 
root and an ending (even if a zero morpheme). Some forms also had derivational suffixes 
(a thematic vowel was a particular kind of them) and end-particles. Cf. verbal forms like 
*h,s-te ‘you (pi.) are’ (root + ending), *b h er-e-te ‘you (pi.) are carrying’ (root + thematic 
vowel + ending), *b h owd h -ey-e-me-s ‘we are waking (somebody)’ (root + suffix + the- 
matic vowel + ending + particle). Reduplication (cf. *d h e-d h eh 1 -mi ‘I am putting’) was 
more frequent than in nouns, and there was also an infix (cf. *yu-ne-g-mi ‘I am yoking’ 
from the root *yug-/yewg-). 

PIE had a special class of particles that were attached after endings in some finite 
verbal forms. Some were a sort of secondary markers for a certain category, others had 
no clear function, some were probably facultative, and some forms could have had more 
than one particle. The major verbal particles were *-i (the present, hie & nunc, particle), 
*-u (the imperative particle), and *-o and *-r (the middle particles). Minor ones were the 
1 (and 2) pi. and 1/2/3 du. present *-s and *-n, and the 1 pi. middle *-d h h 2 (others also 
occur in specific languages or cases). 

Present 

What is traditionally called the “present” in world languages often does not only stand 
for events in real present time (which is its primary and prototypical role) but can also 
be used for general statements, future, past, etc. This was undoubtedly the case in PIE as 
well - for instance, having no separate future tense, PIE surely had to “replace” it with 
present forms (at least in some instances). However, we cannot get into the specifics of 
the usage of the present in PIE here. 

There were two basic present conjugations (similar to noun declensions): thematic (with 
the thematic vowel *-e/o- between stems and endings and with no ablaut alternation) and 
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athematic (with endings right after the stems, and with ablaut alternations - most often 
*e in sg. and *0 in pi.), like *b h er-e-te ‘you are bearing (pi.)’ but *h[d-te ‘you are eating 
(pi.)’. Thematic and athematic verbs had the same/similar endings in the plural (*-me-, 
*-te, *-nt-), while the singular forms originally differed. Athematic (and partly thematic) 
endings *-m-, *-s-, *-t- (sg.), *-me-, *-te, *-nt- (pi.) are shared with the aorist (p. 95-97). 
Some endings (especially athematic ones) had a hie & nunc particle *-i, while the 1 pi. had 
also *-s or *-n (also sometimes in the 2 pi.) - the endings with these present particles are 
traditionally called “primary” (though they are not really primary). In later IE languages, 
the non-ablauting thematic forms usually win, but usually with incorporation of at least 
some originally athematic endings. Here is the PIE athematic paradigm with reflexes: 


TABLE 1.34 IF. ATHEMATIC PRESENT 



PIE 

Ved. 

Gr. 

Lat. 

OCS 

OLith. 

Goth. 

Hitt. 

1 sg. 

*h 1 es-m-i 

as mi 

SIJil 

{sum} 

jesmb 

esm{iy 

im 

esmi 

2 sg. 

*h,e(s)-s-i 

asi 

T 

81 

es 

jes{i] ' 

es{iy 

is 

essi 

3 sg. 

^es-t-i 

asti 

8GTI 

est 

jest\b\ 

esti 

ist 

eszi 

1 Pi 

* 11 ! s- me/o(-s/n-(i)) 

smasi b 

Horn. 81 JJ. 8 V 

{sumus} 

\je\snrb 

\e\sm\e\ 

{sijum} 

\e\s\uw\ani 

2 pi. 

^s-te 

st\h\a c 

S(JT8 

\e\sti\s\ 

\je\ste 
s\g\t\b \ f 

|e|^Ae\ 

{ sijup } 

h 

3 pi. 

*h,s-ent-i a 

santi 

Dor. svti 

s\u\nt d 

{esti} 

sind 

as\a\nzi 


a With a younger variant *-onti; b also smas ; c also stha-na ; d Umbr. sent ; e with middle *-oy (p. 99) instead 
of *-i; f Slav, also *s$tb (> Mac. ce), OCS dadqtv , 8 accented -weni ; h -lent. 


Athematic presents had a full grade (*h 1 es-) in the sg. and a zero grade (*h[S-) in the 
pi. Most endings had the present particle *-i. In the 1 pi. (like in thematic stems), numer- 
ous variants seem to have existed (the ending had ablaut variants *-me/o, together with 
particles *-s or *-n, and the present particle *-i in some languages): *-mes (Gr. Dor. -pa;, 
Ved. -mas, Cz. -me), *-mesi (Ved. -masi), *-men (Gr. Att. -pev), *-mo (Slav, -mo), *-mos 
(Lat. -mus, OCS -nrb, Goth, -m), *-meni (Hitt, -\w\eni), etc. (in some cases, different 
developments are possible). Their original distribution, exact formation, and number of 
existing variants is unclear. The athematic 3 pi. *-enti was replaced early on with thematic 
*-onti in many languages (perhaps facultatively/dialectally already in PIE). The thematic 
endings were: 


TABLE 1.35 IE THEMATIC PRESENT 



PIE 

Ved. 

Gr. 

Lat. 

OCS 

Lith. 

Goth. 

1 sg. 

*-o-h. 

-a\mi\ 

-(0 

-d 

-\p\ 

-u 

-a 

2 sg. 

*-e-h,-i 


-eilsl 



-i e 



*-e-s-i 

-asi 


-is 

-e\s\{i} 8 


-is 

3 sg. 

*-e 


-e|v| 


d 

-Vi\ 



*-e-t-i 

-ati 


-it 

-et\b\ 


-ip 

1 Pi 

*-o-me/o 

-amas(i) 

-OpEV b 

-imus 

-\e\mo 

-\a\m\e \ f 

-am 


(-s/n-(i)) 







2 pi. 

*-e-te ■ 

-at\h\a 

-8T8 

-iti\s\ 

-ete 

-\a\t\e \ f 

-ip 

3 pi. 

*-0 





-a 



*-o-nt-i 

-anti 

Dor. -ovxi 

-unt 

-gt\b\ 


-and 


‘ Cf. Toch. A -dc; b Dor. -ogee; c Slav, also *-esb; d -e in West and (modem) South Slavic; e reflexive -ie-si; 
• reflexive -me-si, -te-si < *-me, *-te (cf. perhaps the rare Ved. scansion -md in the aor. 1 pi.). 
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The 1 sg., 1 pi., and 3 pi. had thematic *-o-, and the other forms had *-e-. In the 1 sg., 
the reflexes point to *-o, which is interpreted as *-oH (on the assumption that there were 
no *-V# type endings 9 ) and further as *-o-h 2 (on the basis of the supposed relation to the 
perfect ending *-h 2 e and middle *-h 2 (e) - p. 98-99), which is frequently reconstructed, 
though in fact speculative. The usual reconstructions for the 2/3 sg. and 3 pi. are *-esi, 
*-eti, *-onti (attested in most languages), with the endings as in athematic stems. In con- 
trast to what is usually reconstructed, one can suppose that thematic stems originally had 
different endings, perhaps coexisting with younger athematic-influenced ones already in 
the last stage of PIE. The heterodox thematic reconstruction *-eh[i in the 2 sg. is based 
on Greek (if not from *-esi + *-s) and Lithuanian (the accent points to a laryngeal); *-e 
in the 3 sg. again on Greek/Balto-Slavic (with analogical thematic *-o in Lithuanian) and 
Toch. B -a-m (< *-e-nu); 10 *-o in the 3 pi. on Lithuanian (where sg. = pi.) and Toch. A -e 
< *-o (together with -enc < *-onti)- cf. Beekes 2011: 260. The original thematic endings 
in the 3 sg./pl. would thus be bare thematic vowels in different ablaut. Anatolian had no 
thematic present. 

The endings of the dual (athematic/thematic) are not easily reconstructable: 


TABLE 1.36 IE PRESENT DUAL ENDINGS 



PIE 

Ved. 

Gr. 

OCS 

Lith. 

Goth. 

1 du. 

*-(o)-we-(s) 

~(a)vas b 


-(e)v\e\ 

-(a)v\a\ 

-(o> 

2 du. 

*-(e)-tHV-(s/n) 

-(a)thas 

-(e)tov 

-( e)ta 

-(a)ta c 

-ts 

3 du. 

*-(e)-te/o-s/n a 

-(a)tas 

-(e)tov 

-{e)te 




Cf. Toch. B -tem < *-ton-V; b cf. Av. -uuahi ; c cf. perhaps Umbr. 2 pi. imp. eta-to < *-ta. 


Cf. the pronoun *we ‘us two’ (p. 84) for 1 du. Ved. 2 pi. -tha is perhaps influenced by 
the 2 du. 

The formation of the present stem is much more diverse than in other verbal forms - 
both thematic and athematic present conjugations were formed in numerous ways 
(usually with different affixes - some with a specific derivational meaning, others with- 
out it or with a meaning difficult to reconstruct). The list here is non-exhaustive and 
simplified: 

thematic - (a) root-stems (cf. PIE *weg h -e-, *b h er-e- above; with zero-grade PIE 
*g w rh 3 -e- ‘devour’ > Ved. girati, OCS zbrg); (b) reduplicated stems (PIE *pi-ph,-e- 
from *peh 3 - ‘drink’ > Ved. pibami, Lat. biboy, (c) *-y- suffix (PIE *leh 2 -y-e- ‘bark’ 
> Ved. rdvati, OCS lajg, Lith. lojw, also in denominatives like Ved. apasyati 
‘works’ from apas- ‘work’ and rare causatives like Lat. sopio ‘I put to sleep’ < 
*swop-y-e from *swep- ‘sleep’); (d) *-ey- causatives/iteratives 11 (PIE *mon- 
ey-e- ‘make somebody think’ from *men- ‘think’ > Lat. moneo ‘I remind’, Ved. 
manayati ‘honors’); (e) *-sk- (PIE *prk-sk-e- ‘ask’ > Ved .prcchati, Lat. posed)', (f) 
*-eh,- statives (PIE *sed-eh,-e- ‘to be seated’ from *sed- ‘sit’ > Lat. sedere, OCS 
sedeti, Lith. sedeti ‘to sit’); (g) *-eh 2 - factitives (PIE *new-eh,-e- ‘to make new’ > 
Hitt, newahh- from *newos ‘new’); (h) *-(e)s- (Lat. gem ‘I carry’ < *h,g-es-e- from 
*h 2 eg- > Lat. ago ‘I drive’; also desiderative *-s-, which became the new future in 
many IE languages) 
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athematic - (a) root-stems (cf. PIE *h,es- above or *h,ey- ‘go’ > Ved. emi, Gr. eipi; 
“Narten-presents” with long root, cf. Ved. dasti ‘waits’ < *dek-ti); (b) reduplicated 
stems (PIE *de/i-deh 3 - ‘give’ > Ved. dadami, Gr. SlScopi); (c) *-n(e)- infix (PIE 
*yu-n(e)-g- ‘yoke’ > Ved. vunakti, pi. yunjanti, Lat. iungo ); (d) *-new-/-nu- suffix 
(Ved. krnomi ‘I make’, krnutha ‘you (pi.) make’) 

One can distinguish primary (derived directly from a verbal root without a suffix or 
with a suffix with no exact derivational meaning) and secondary formations (derived 
with a suffix that changed the meaning of the verbal root - e.g., causative, iterative, 
etc. - or from a noun or adjective root). The primary ones are all athematic and the- 
matic (a)-(c). One verb could have had more than one coexisting primary present 
forms (with or without different connotations) - cf. from *g w em- ‘go, come’ both 
*g w m-sk-e- (Ved. gacchati ‘goes, comes’, Gr. panics! ‘come!’) and *g w m-y-e- (Gr. 
Paivco ‘I walk’, Lat. uenid ‘I come’). It is possible, even likely, that some assumed 
thematic formations were originally athematic. What the exact formation of a specific 
PIE verb was has to be reconstructed word by word (cf. LIV). For the middle present 
see p. 99-100. 

Aorist 

The PIE aorist denoted perfective aspect. Its endings were basically the same as those 
in the athematic present (p. 93), pointing probably to a common pre-PIE origin of the 
present and aorist. However, the aorist did not have particles attached to endings as in 
the present (particularly not the hie & nunc *-i), except in the dual (but different from 
the present dual). The endings in the aorist (traditionally called “secondary”) are thus 
actually unmarked (unlike the present ones). Vedic preserves the complete opposition 
of the present and aorist endings (cf. Ved. pres, -mas(i) but aor. -ma), while many lan- 
guages later generalized the original present endings in the aorist as well (cf. Gr. -pev in 
pres./aor.). Greek and Indo-Iranian are the only branches to preserve both the aorist and 
perfect. In Italic and Celtic the aorist merges with the perfect into new preterite forma- 
tions; in Germanic it is lost; in Slavic, Armenian, and Albanian it is preserved (while the 
perfect is generally lost there except for traces in Balto-Slavic and Armenian). Anatolian 
has no aorist (which is perhaps not an innovation). Greek, Indo-Iranian, Phrygian, and 
(limitedly) Armenian have a so-called augment in the past tenses (including the aorist), 
attached to the beginning of the form (Ved. a-, Gr. £-, Phryg./Ann. e-), which stems from 
the PIE adverb *h[e ‘at that time’. Some linguists reconstruct the augment as part of 
the aorist for PIE as well (where it may have been at the most a facultative or dialectal 
feature). 

Unlike the numerous present formations, the aorist had only these basic formations: 

1) root-aorist - root [sg. - full grade, pi. - zero grade] + endings *deh 3 -m - pi. *dh,-me 

2) sigmatic aorist - root [long grade] + *-s- + endings *weg h -s-m 

3) thematic aorist - root [zero grade] + thematic vowel + endings *wid-o-m 

The root-aorist is attested in Indo-Iranian, Greek, and Armenian (perhaps partially in Old 
Church Slavic) only but is often considered the most archaic (while sigmatic/thematic 
aorists are often considered secondary). It is formed from the endings attached directly 
to the root (often of those verbs that have a reduplicated athematic present), with the *e 
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grade in the sg. and the zero grade in the pi. (the original ablaut is preserved in some 
verbs in Greek): 


TABLE 1.37 IE ROOT-AORIST 



PIE 

Ved. 

Gr. 

I sg. 

*deh 3 -m a 

dam b 

|£5a>{ic<x} d 

2 sg. 

*deh 3 -s 

\d\dds 

|s|8to {Kaq} 

3 sg. 

*deh 3 -t 

\d\ddt 

|s|8to {ke} 

lpl. 

*dh,-me 

\a\d\d\ma 

|E|8opsv| 

2 pi. 

*dh 3 -te 

sth\d\ta 

e|8ote 

3 pi. 

*dh 3 -nt 

\a\d{ur} a 

|£j8o{aav} ' 


a Asyllabic because of Stang’s Law (cf. p. 70); b cf. apam ‘has drunk’; c cf. the original a-bhuv-an < *b h uh 2 - 
nt, Av. -at < *-nt; d cf. the original s-oxr|-v ‘stood’ < *steh,-m; B cf. the more original (Homeric) sax-av 
< *sth,-nt 


Cf. also *steh 2 m ‘stood’ (Ved. astham, Gr. eaxr|v), *d h eh,m ‘put’ (Ved. adham, Gr. 
£&r|Ka with secondary -Ka, 1 pi. eSepev, Arm. 3 sg. ed), etc. The root was perhaps excep- 
tionally zero grade in *b h uh 2 - ‘be’: 3 sg. *b h uh 2 t (Ved. abhut, Gr. etpu, (?) OCS by). In 
Vedic and Homeric Greek, one can also find augmentless aorists like Ved. bhut, Horn, tpu 
‘became’, etc. 

The sigmatic aorist, very productive in late PIE and later languages, was formed from 
a long-grade root, the suffix *-s-, and endings: 


TABLE 1.38 IE SIGMATIC AORIST 



PIE 

Ved. 

Gr. 

OCS 

I sg. 

*weg h -s-m 

\d\praksa\m\ 

Isl^EUsa 

ves\b\ 

2 sg. 

*weg h -s-s 3 

|a[ya\s\ b 

|g|^{a<;} 


3 sg. 

*weg h -s-t 

\a\prat 

|eKeu^{e} 

f 

lpl. 

*weg h -s-me 

\d\jaisma c 

|EiJsul;|ajps|v| 

ves\o\m{b} 

2 pi. 

*weg h -s-te 

\d\cchdnta d 

|sKEU^|a|xE 

veste 

3 pi. 

*weg h -s-nt 

\d\vaks{ur } e 

|sK£Usa|v| 

ves§ 


a Simplified to *weg h s probably already in PIE (p. 53); b ‘has sacrificed’; from yaj- < *Hyag-; 0 ‘have 
conquered’; from ji-', (l < *acchantsta ‘have seemed’, from chand-; e cf. Av. -at < *-nt; f cf. perhaps OCS 
2/3 by-st-b ‘were/was’. 


The long grade in the sg./pl. is well attested in Vedic, Old Church Slavic (vast) ‘con- 
veyed’ from vez- < *weg h -, bast, ‘jabbed’ from bod- < *b h od h -, grest, ‘rowed’ from greb- 
< *g h reb h -, etc.), and Latin, where some perfect forms preserve the ablaut and *-s- but 
not the original endings (uexi ‘I conveyed’ from ueh- < *weg h -, etc.). Some believe that 
the pl./du. originally had the *e-grade (which would then be generalized in Greek and 
attested in the Vedic middle). 

The thematic aorist has a zero-grade root (preserved in Vedic and Greek), thematic 
vowel (like in the present), and endings: 
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TABLE 1.39 IE THEMATIC AORIST 


PIE 

Ved. 

Gr. 

OCS 

1 sg. 

*wid-o-m 

\d\vidam b 

|e|15ov 

dvigb 

2 sg. 

*wid-e-s 

\d\vidas 

|e|I5s<; 

dvize 

3 sg. 

*wid-e-t a 

\d\vidat 

|e|!5s 

dvize 

lpl. 

*wid-o-me 

\d\viddma 

|e|i5ojj,e|v| 

dvigom{b} 

2 pi. 

*wid-e-te 

*|a|vidata 

e|i5ete 

dvizete 

3 pi. 

*wid-o-nt 

\d\vidan 

|e|!5ov 

dvigQ 


Cf. Arm. egit ‘found’; b cf. also augmentless vidam. 


Cf. also Gr. rjku&E, OIr. luid < PIE *h,lud h et ‘went’, which can hardly be secondary. For 
the zero grade, cf. also Gr. aor. stpuyov, Exponiov from pres, (psuyo ‘I run’, xp£;tco ‘I turn’, 
etc. Thematic aorists are often secondary, and exact correspondences are few - some 
researchers consider all (or most) of them as secondary/innovative. PIE may also have 
had a reduplicated aorist, cf. Ved. avocam, Gr. euxov < *(h,e)-we-wk w -o-m ‘I said’. The 
dual endings in Vedic/Greek differed from those of the present (the thematic vowel is in 
brackets): 1 *-(o)-we > Ved. -{d)va, OCS -(o)v\e\; 2 *-(e)-to-m > Ved. -(a)tam, Gr. -(e) 
tov; 3 *-(e)-te-h,m > Ved. -(a)tam, Gr. -(£)xr|v (< -(£)xdv). 

Some also reconstruct imperfect for PIE, based on Greek and Vedic, cf. Ved. asam, Gr. 
(Ion.) rja < (?) PIE *(h 1 e)-h 1 es-m; Ved. abharam, Gr. stpepov < (?) PIE *(h 1 e)-b h er-o-m. 
It is identical to the present except that it has aorist endings. This would then be the past 
tense of the imperfective aspect stem (the present being the present tense of imperfec- 
tives). Augmentless aorists and imperfects (sometimes used in a “timeless” meaning) 
found in Vedic, Gatha-Avestan, and Homeric/early Greek are often called injunctives. 
It seems that they were originally verbs unmarked for tense/mood when following pre- 
ceding fully marked verbs or after the prohibitive imperative particle *meh, ‘not’ (> Ved. 
md, Gr. pi), Alb. mo ‘do not! ’, Ann. mi). In both cases, there was no need for redundant 
marking of tense/mood so it was left out, which is perhaps a structure of PIE origin - e.g., 
PIE may have used the injunctive forms like *b h eront ‘they are carrying’ without the 
pres. ptcp. *-i, not the pres. *b h eronti, after the present was already marked on a previ- 
ous verb. 12 This is typo logically very similar to the Bantu consecutive (cf. Nurse 2003: 
101-102). For the middle aorist see p. 100. 


Perfect 

The “perfect” originally (whether in the last phase of PIE or in pre-PIE) most likely indi- 
cated the stative aspect. The frequent, and plausible, assumption is that the original sta- 
tive (expressing states like ‘I am dead’) at one stage, perhaps partially already in late PIE, 
became resultative (‘I have died [and I am dead now]’), and finally in post-PIE preterite 
(‘I died’). In most later IE languages, it has developed into a regular past tense. However, 
in Homeric Greek (with relics in Classical Greek) the perfect expresses the meaning of 
the state of the subject (this occurs less commonly also in Rgveda), cf. Homeric XE&vqKE 
‘is dead’, eox^ke ‘is standing’ (cf. p. 309). Some original perfect forms have a present 
tense (or stative) meaning in other languages as well - like *woydh 2 e ‘I know’ almost 
everywhere, Latin verbs like mem ini ‘ I remember’ (cf. Goth, man ‘thinks’ andGr. pspova 
‘I yearn for’), or the Germanic praeteritopraesentia class (p. 400). The PIE perfect is 
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preserved in Indo-Iranian, Greek, Italic/Celtic (mixed with aorist), Germanic, and Anato- 
lian (albeit in different form than in Indo-Iranian/Greek) - elsewhere, it was lost, usually 
with some remaining traces. 

The usual perfect formation had partial reduplication of the root (originally perhaps only 
*Ce- but also *CV-, cf. Gr. Xe-Xoma but Ved. ri-reca < *le/i-loyk w -h 2 e from *leyk w - ‘leave’), 
the ablaut alternation in the root (*o-grade in the sg., zero grade in the pi. - preserved regu- 
larly in Vedic, sometimes in Germanic), and special (athematic) perfect endings (*-h 2 e, *-th,e, 
*-e, *-me, *-e, *-r/-er). For reduplication cf. Ved .ja-gama ‘I went’ (from gacchati ‘goes’), 
Gr. ;i8-;ioi9a ‘I am persuaded’ (from 7idSco ‘I persuade’), Lat. ce-cim ‘I sang’ (from cand 
‘I sing’), Goth, hai-hait ‘I called’ (from haitan ‘to call’), OIr. ce-chan ‘I sang’ (from canid 
‘sings’), etc. Reduplication was regular in Indo-Iranian and Greek, and sporadic in Germanic, 
Italic, and Celtic. In PIE it may have been present in almost all forms (though we cannot be 
sure). The ablaut alternation is attested regularly in Indo-Iranian, in Germanic “strong” verbs, 
and some (early) Greek forms. The ablaut alternation includes the accent shift in Vedic (jujosa 
‘he has enjoyed’ - jujusur ‘they have enjoyed’) and Gennanic “strong” verbs, cf. OEng. ceas 
‘I/he chose’ - curon ‘we chose’ (with Vemer’s Law - p. 20, 393) < PIE *(gu)gowse - *(gu) 
gusr. Anatolian shows no special perfect stem (cf. p. 1 87 for possible reduplication), just dif- 
ferent endings. The exact relation of the PIE perfect to the Anatolian /?/-present, in spite of the 
formally close similarity of endings, is controversial (p. 187). 

The best-attested perfect fonn is the one that is rather aberrant - the perfect form 
*woyd-/wid- has no reduplication (it is the only such plausibly reconstructable form) 
and has a stative (in later languages present) meaning ‘know’, derived from *weyd- ‘see’ 
(p. 49). The apparent (already PIE) development was ‘I have seen’ > ‘I know’ (or, more 
precisely, ‘I am in the state of having seen/knowing’): 


TABLE 1.40 IE PERFECT 



PIE 

Ved. 

Gr. 

Goth. 

1 sg. 

*woyd-h 2 e a 

veda 

olSa 

wait 

2 sg. 

*woyd-th 2 e b 

vettha 

oia\D\a 

waist 

3 sg. 

*woyd-e 

veda 

oi5e 

wait 

1 Pi 

*wid-me 

vidma 

i'5pEv| 

wit\u\m 

2 pi. 

*wid-e 

vida 

\'a {te} 

wit {up} 

3 pi. 

*wid-r/-er 

vidxur\ 

Ta'uaij 

wit{un } 

1 du. 

*wid-we 

vidva 


witu 

2 du. 

*wid-tH-(?) c 

vid\a\th[ur] 


wit\u\ts 

3 du. 

*wid-t-(?) c 

vid\a\t[ur] 




a Cf. OCS isolated vecle ‘I know’ < *woydh,e-y (with hie & nunc *-i); b phonetically *woytsth,e (p. 54), 
cf. Hitt. pret. -tta; c phonetically *witst(H)- (p. 54). 


The PIE perfect endings were close to both the pres./aor. active and middle endings, 
though the latter is often overly emphasized. The 1 sg. *-h,e is attested directly as -hha in 
Anatolian (p. 1 87), and indirectly in the lack of Brugmann’s Law (p. 40, 206) in Ved. cakara 
‘I have made’ < *k w e-k w or-h 2 e (a closed syllable because of the laryngeal) but cakara ‘he 
has made’ < *k w e-k w or-e (an open syllable). The opposition of the 1 sg. *-a < *-h 2 e and 
the 3 sg. *-e is seen in the OIr. pret. 1 sg. gad ‘I asked’ but 3 sg. gaid ‘he asked’ (with 
palatalization fonn *-e). The 1 pi. *-me and the 1 du. *-we is the same as in the pres./aor.; 
the 2 pi. *-e (*-o is theoretically also possible) is attested only in Vedic but looks archaic 
(elsewhere we find the pres./aor. *-te). The variants in the 3 pi. are not clear - cf. Hitt. pret. 
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-er, older Lat. -er-e (with a secondary particle *-i) for *-er (from original *-er-s?, cf. p. 70); 
Av. -af, probably Ved. -ur, and Hitt, rare -ar for *-r (cf. also NPhryg. 5aKap(sv) ‘they have 
made’); (?) GAv. cikditar’s (and perhaps Ved. -ur and Lyd. -rs) for *-r-s (cf. also Ven. -ers 
(?)). The particle (?) *-s in some 3 pi. forms is perhaps the same as in the present *-me/o-s 
(and possibly identical to the nominal nom. pi. *-s?). The original perfect endings have 
been influenced by the aorist ones in a number of languages. The intrusion of the present *-i 
(p. 92-93) into Italic perfect endings (cf. Lat. 1 sg. -f < early -ei < *-h,e-y, 2 sg. -is-ti 
< *-th 2 e-y) points to the older stative (“present”) meaning of the perfect. 

Middle (mediopassive) 

The middle (medium) voice stands in opposition to the active voice (the present/aorist forms 
we have seen were all active - the perfect had no active/middle distinction but can be con- 
sidered active of a sort). The reconstruction of the concrete meaning of the definitely poly- 
functional PIE middle is rather speculative. The active typically expressed a simple action 
(or state) by the subject (cf. Gr. Afreet ‘I unbind (something)’), while the middle indicated 
that the subject is doing an action for his or her benefit or that it is affected by the action 
(cf. Gr. Afropon ‘I unbind (for) myself’). The meaning was thus reflexive (Gr. Aofropai ‘I 
wash myself’) and sometimes reciprocal (cf. Ved. vadete ‘two of them talk to each other’). 
Since the PIE middle yielded a passive voice (cf. Goth, gibada ‘is given’) in some families 
(Italic, Germanic), it seems that the PIE middle also had that function in some cases (hence 
the other name, mediopassive). It also may have had an impersonal function (cf. Lat. itur 
‘one goes’). Some verbs (called deponent or media tantum ) had exclusively middle endings 
but non-middle meaning (cf. Ved. save, CLuw. zTvari < PIE *key-or Ties’; Ved. vaste, Hitt. 
westa < PIE *wes-tor ‘wears’). Most families preserved the PIE middle (the opposition 
active : middle is best preserved in Anatolian, Indo-Iranian, and Greek), with the notable 
exceptions of Balto-Slavic (which preserves traces in active endings, cf. OPruss. assai ‘you 
are’ < *-so-y) and Armenian (with innovative mediopassive forms). 

The middle had special endings - for the middle present and middle aorist - similar to 
both the present/aorist and perfect endings (though the latter is usually overemphasized), 
the reconstruction of which is rather difficult. It was typified by two particles: *-o (in 
both the present and aorist) and *-r (only in the present). Instead of *-r (attested in Anato- 
lian, Italian, Celtic, Armenian, Tocharian, Phrygian, Venetic), posited as original by many 
researchers, some dialects have the hie & nunc *-y (Indo-Iranian, Greek, Germanic, Bal- 
to-Slavic, Albanian), which is likely secondary but possibly a dialectal feature already in 
PIE. In the 2/3 sg. and 3 pi., variant endings can be reconstructed (present/aorist-like and 
perfect-like) - their original function/distribution is not clear. Here is a tentative recon- 
struction of the present (primary) middle endings (both thematic and athematic): 


TABLE 1.41 IE MIDDLE PRESENT 



PIE 

Ved. 

Gr. 

Lat. 

Goth. 

OIr. 

Toch. B 

Hitt. 

1 sg. 

*-h,e-(r) 

-\e\ 

-|p|a|i| 

{-or} 

{-da} * 

I-'" - } 

{-mar} 

-(Wpm 

2 sg. 

*-s-o 

-s\e\ 

-aoji| 

-re 

-z\a\ 





*-th,e-r 


(dial.) d 



-th\e\r 

-ta\r\ 

-ta(r\i\) 

3 sg. 

*-t-o-r “ 

-t\e\ 

-T0\l\ 

-tur 

-d\a\ 

-thar 

{-tar} 

- Hr\i \ ) 


*-o-r 

-\e\ 

(dial.) e 



-ar 


-am 

lpl. 

*-me-(s)-d h h 2 

-mah\e\ 

-p£(a)9a 

{-mur} 

-nd\a\ 

{-mmar} 

-mt\a\\r\ 

-\w\asta 

2 pi. 

*-(s)-d h we 

-dhv\e\ 

-a9e 

{-mini} 

{-nda} 

-id 

-\Aa|r| 

-tum\a \ 8 

3 pi. 

*-nt-o-r 

-nt\e\ 

-vxo\i\ 

-ntur 

-nd\a\ 

-tar 

{-ntdr} 

-anta(r\i\) 


*-(e)r-o b 

-r\e\ c 

(dial.) 
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a Cf. Phryg. -Top; b also variant *-(e)r-o-r?; c cf. Av. ayha're ‘they are sitting’ < *asarai < *-er-oy; d Arc. kei-oi 
‘you lie’; e Cypr. kei-toi ‘he lies’; f cf. OIc. heite 'I am called’ < *-ai < PIE dial. *-oy; 8 cf. Luw. -tuwari 

For the 1 sg. (best preserved in Anatolian) cf. thematic pres. *-h 2 and perf. *-h,e (some 
reconstruct *-h 2 o(r) for the middle); for the 2 sg. pres./aor. *-s- and perf. *-th,e (some recon- 
struct *-th 2 o(r) for the middle); for the 3 sg. pres./aor. athematic *-t- and thematic pres./perf. 
*-e; for the 3 pi. pres./aor. *-nt- and perf. *-r/-er (the *n/r here reminds one of heteroclitic 
nouns - p. 77-78). Some linguists reconstruct the PIE fonns without the particle *-r, which 
they believe to be secondary (stemming supposedly, not too convincingly, from the 3 pi. 
ending). However, it is not impossible that *-r was facultative or dialectal in PIE. The pres- 
ent/aorist-like endings *-so, *-to(r), *-nto(r) are sometimes thought to be innovative (with 
*-o originating perhaps in the 3 sg. ending *-o), but they must have been there already in 
PIE. Others think that variant endings point to two different original categories 13 (one with 
the ending *-to and one with the ending *-o, etc.). Ved. fonns like brute ‘calls (for himself)’ 
but bruve (pass.) ‘is called’ and some other facts would perhaps point to a possible original 
meaning distinction. In the 1 pi., *-mc- is the ending attested in all verbal finite forms (the 
optional *-(s)-, attested in Greek/Hittite, is perhaps related to the pres. *-mes and then trans- 
ferred to 2 pi. *-(s)-d h we), and *-d h h, is probably some kind of particle. As in the active, the 
-w- in Hitt, -wasta may be from the old du. *-we-(s)-d h h, (cf. Ved. 1 du. pres, -vahe, aor. 
-vahi, Av. aor. -vadi). The 2 pi. *-d h we is unique to the middle, though the final *-e (cf. Gr. 
-cfte), perhaps accidentally, reminds one of the pres./aor. *-te and perf. *-e; some, however, 
reconstruct *-d h wo (cf. Luw. -tuwa-ri). Many later endings (like the 1 pi. Lat. -mur, OIr. 
- mmar ) are remodeled by analogy to the active. A thematic media tantum present with many 
reflexes is PIE *sek w -e-tor ‘follows’ > Ved. sacate, Gr. OTsxat, Lat. sequitur, OIr. sechithir. 

These are the aorist (“secondary”) endings; 


TABLE 1.42 IE MIDDLE AORIST 



PIE 

Ved. 

Gr. 

Toch. B 

Hitt. 

1 sg. 

*-h. 

-i 

{-priv} 

{-mai} 

-h\at(i)\ 

2 sg. 

*-s-o 

a 

-oo 




*-th,e 



-ta\i\ 

-ta 

3 sg. 

*-t-0 

-ta 

-TO 

-te 

-ta 


*-0 

-a 



-a\t(i)\ 

lpl. 

*-me-(s)-d h h. 

-mahi 

-|is(o)$a 

-mte 

-wasta Z 

2 pi. 

*-(s)-d h we 

-dhva\m\ 

-o§s 

-t 

-dum\a\\t\ 

3 pi. 

*-nt-o 

-ata, -nta 

-VTO 

-nte 

-anta\t(i)\ 


*-r-o 

-ra\n\ 





Cf. Av. -sa. 


The endings were probably mostly the same as the present ones but with no *-r (i.e., 
dial. *-y). Ved./Av. -i would point to *-h, (unlike the present *-h 2 e(r)). Perhaps the 1 pi. 
*-mes-d h h, was originally the present ending and *-me-d h h, the aorist ending. 

Subjunctive 

The present/aorist forms we have seen were all indicative (the perfect had no mood 
distinction but can be considered indicative of a sort). The subjunctive and optative were 
originally probably normal verbal suffixes (like many others that existed in the present - 
p. 94-95) that only later became full-fledged moods. The PIE subjunctive probably 
expressed something that the speaker thought was uncertain, a future action with a slight 
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reservation (e.g., *h,esoh, ‘I suppose/guess I will be’)- Thus, it is understandable that it 
yielded the future tense in Latin. The formation of the subjunctive was simple - root in 
the full grade + suffix *-e-/-o- + present/aorist endings. The suffix *-e-/-o- was formally 
(and probably etymologically) identical to the thematic vowel and followed the same 
qualitative pattern. The subjunctive thus in a way presents the “thematized” forms of the 
indicative, where the already thematic forms become “doubly thematic” in the subjunc- 
tive (with *-e-e-, *-o-o-) and the athematic forms become thematic. Why the thematic 
vowel (or a morpheme formally identical to it) functions as a subjunctive suffix is not 
clear. These are the forms of athematic verbs: 


TABLE 1.43 IE ATHEMATIC SUBJUNCTIVE 



PIE 

Ved. 

Gr. (Horn.) 

Lat. ' 

1 sg. 

*h,es-o-h 2 

asa\ni \ c 

ECO 

ero 

2 sg. 

*h,es-e-s-(i) * 

asas(i) 

s\p\i; 

eris 

3 sg. 

*h,es-e-t-(i) b 

asat{i) 

£\p\ 

erit 

1 Pi 

*h l es-o-me(/o-(s/n)) 

asama 

£\0)\|J,£V d 

erimus 

2 pi. 

*h,es-e-te 

asat\h\a 

e\t)\te 

eriti\s\ 

3 pi. 

*h,es-o-nt-(i) 

asan 

eVdVti 

erunt 


a Also *-e-h,-i? (p. 93-94); b also *-e-e? (p. 93-94); c cf. the older ending in brav-a ‘I speak’ (-a 13x in 
RV); d cf. Horn. 1-o-psv ‘we would go’ for the original short *-o-; ' future tense. 


It is not clear whether the endings in PIE were originally primary, secondary, or both 
(either mixed or facultative). Vedic has a curious mix of primary and secondary endings. 
The 1 sg. shows only (thematic primary/present) *-h, (unlike in the optative), which is 
in accord with the fact that all subjunctive forms were “thematic”. Thus, other primary 
endings might be expected as well. However, some Vedic endings (1 pi. -ma, 3 pi. -n, 1 
du. -va), parallelism to the optative, and structural reasons (a mood unmarked for present/ 
aorist could have used generic, and not present, endings) could point to originally sec- 
ondary endings. Both primary and secondary endings can be secondary in certain cases 
in later languages. If variants existed already in PIE, they probably carried no special 
(present/aorist) semantics. The forms of thematic verbs are: 


TABLE 1.44 IE THEMATIC SUBJUNCTIVE 



PIE 

Ved. 

Gr. 

Lat. 1 

1 sg. 

*b h er-o-o-h. 

vdha\ni \ c 

(psptG 

fer{am } 

2 sg. 

*b h er-e-e-s-(i) a 

bharas d 

#ph<; 

feres 

3 sg. 

*b b er-e-e-t-(i) b 

bhdrdt(i ) 

#pn e 

feret g 

lpl. 

*b b er-o-o-me(/o-(s/nj) 

bharama 

tpepcopev 

fer\e\mus 

2 pi. 

*b h er-e-e-te 

vdhat\h\a 

(psprjxe 

fereti\s\ 

3 pi. 

*b h er-o-o-nt-(i) 

vahan 

cpspcom 

fer\e\nt 


a Also *-e-e-h,-i? (p. 93-94); b also *-e-e? (p. 93-94); c cf. older arc-a ‘I may shine/praise’, GAv. yaoja 
‘I will yoke’; d cf. vahasi; e cf. Arc. exp ‘has’, Thess. 3 eai| ‘wishes’ < *-et (?); * future tense (3rd/4th con- 
jugation); B with -t < *-ti, cf. Osc. fakiiad ‘makes’ < *-t. 


*-ee- contracted to *-e-, and *-o-o- to *-o- (others believe that It. -e- in all persons is 
actually archaic). The contraction was probably post-PIE (Gatha-Avestan still has two 
syllables there). 
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Optative 

The optative probably expressed the wishes of the speaker (e.g., *h,syeh 1 m ‘I would 
be’). An athematic and thematic optative existed, though the latter was perhaps very late 
PIE, post-Tocharian (p. 91-92, 455). The PIE optative yielded the (so-called) subjunctive 
mood in Germanic and Italic, and the imperative in Balto-Slavic. The athematic forms 
were made of the zero-grade root, an alternating suffix (sg. *-yeh 1 -, pi. *-ih r ), and mostly 
generic (“aorist/secondary”) endings: 


TABLE 1.45 IF, ATHEMATIC OPTATIVE 



PIE 

Ved. 

Gr. (Horn.) 

OLat. c 

1 sg. 

*h 1 s-yeh 1 -m 

syam 

81T|V 

siem 

2 sg. 

*h 1 s-yeh 1 -s a 

syds 

81T|<; 

sie\s\ 

3 sg. 

*h 1 s-yeh 1 -t b 

svat 

SIT) 

sie\t \ d 

lpl. 

*h 1 s-ih,-me 

sy\a\ma 

sijxe v| 

sun {us} 

2 pi. 

*h 1 s-ih l -te 

sy\aVa 

SITS 

sit {is} 

3 pi. 

*h,s-ih 1 -ent 

sy{ur} 

818V 

sien\t\ 


a Cf. the OCS athematic imp. ezd-h! ‘eat!’ < *h,ed-y- (also dazd-kl ‘give!’); b cf. OCS esa ‘if only’ < 
*h,esyeh|t (with secondary *h,es-); c subjunctive; d with secondary -t < *-ti, cf. oldest sied with -d < *-t. 


The ablaut scheme, typical for athematic paradigms, is preserved in Greek and 
Latin. The zero-grade *-ih,- can also be seen in Vedic (middle opt. bruv-T- ‘talk’), Old 
Church Slavic (imp. dad-i-te! ‘give (pi.)!’), Lithuanian (dial. imp. sedzTt’e! ‘remain 
(pi.) seated!’), Gothic (pres, wil-ei-ma ‘we will’, cf. Lat. subj. uel-T-mus ‘we may 
want’), and Tocharian (opt. klyaus-i-m ‘I might hear’, p. 468). The 3 pi. ending was 
*-ent from the athematic present (not the aorist *-nt), based on Latin/Greek. The the- 
matic optative had a full-grade root + thematic vowel *-o- + suffix *-ih r + generic 
(“aorist”) endings; 


TABLE 1.46 IE THEMATIC OPTATIVE 



PIE 

Ved. 

Gr. 

ocs d 

Goth. 

1 sg. 

*b h er-o-yh 1 -m 

bhav\e\ya\m \ a 

(pspoi{(xi} c 

bgde{mb} e 

bairau 

2 sg. 

*b h er-o-yh,-s 

bhaves a 

(pspoii; 

beri ! f 

bairais 

3 sg. 

*b h er-o-yh,-t 

bhavet a 

(pspoi 

beri! 

bairai 

lpl. 

*b h er-o-yh,-me 

bharema 

(pspoips|v| 

berem\b\! 

bairaim\a\ 

2 pi. 

*b h er-o-yh r te 

Skr. bhareta 

(pspOlTS 

berete ! g 

bairaip 

3 pi. 

*b h er-o-yh,-ent 

vah\e\y{ur} b 

(pspoiev 


bairain\a\ 


a *b h u h 2 _ ‘be’; b *weg h - ‘convey’, cf. the old ending in Av. baraiwn; c cf. Arc. eljeLaitvota ‘I would drive 
out’; d imperative; e ‘I would be’ (*b h uh 2 - ‘be’); f cf. OPruss. wedaisl ‘lead (sg.)!’; g cf. OLith./dial. sakaite! 
‘will you (pi.) please say!’ 


The lack of qualitative ablaut of the thematic vowel is conspicuous. In most languages, 
the suffix *-oyh r yields (or reflects just like) *-oy-. However, the Slavic acute in *-eme/ 
o/t>, *-ete would point to a laryngeal. 
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Imperative 

The imperative, easily reconstructable, had forms only for the 2nd and 3rd persons. It had 
eventive endings in the 3 sg. and 2/3 pi., and was characterized by the particle *-u in the 
3 sg./pl. The athematic forms were: 


TABLE 1.47 IE ATHEMATIC IMPERATIVE 



PIE 

Ved. 

Gr. 

Lat. 

Hitt. 

2 sg. 

*h,ey 



f 

ep d 


*h,i-d h i B 

ihi b 

Tvh 


It 

3 sg. 

*h,ey-t-u 

etu c 



eptu d 

2 pi. 

*h,i-te 

it a 

ITS 

\i\te 

ltte\n\ 

3 pi. 

*h,y-ent-u 

yantu 



app\a\ntu d 

a Cf. OCS i 

i-d\i\! ‘ go ! ’ , v\ezd\-b! ‘ know ! ’ ; 

b cf. sru-dhi! ‘hear!’ for -dh-\ 

c cf. astu < *h| 

es-tu for the accent; 


d epp- ‘take’ (PIE *h,ep-). 


The 2 sg. is either the full-grade endingless root or the zero-grade root plus *-d h i (prob- 
ably a particle). Except for the *-d h i form, the singular has the full-grade and the plural the 
zero-grade root, as usual in the athematic stems. The thematic forms were: 


TABLE 1.48 IE THEMATIC IMPERATIVE 



PIE 

Ved. 

Gr. 

Lat. 

Goth. 

2 sg. 

*b b er-e 

bhara 

tpsps 

age * 

bair 

3 sg. 

*b b er-e-t-u 

bharatu 



bair\a\d\au \ b 

2 pi. 

*b b er-e-te 

bharata 

tpspSTS 

agite 

bairip 

3 pi. 

*b b er-o-nt-u 

bharantu 



bair\adau ] 


ag- ‘drive’ (PIE *h 2 eg-); b as if from a full-grade *-t-ow? 


The 2 sg. has a zero-ending (cf. also OIr. beir!). The 2 pi. imp. seems to have been 
identical in some cases (like *h,ite, *b h erete) to the 2 pi. pres./inj. 

There were also special future imperative forms that express the meaning of a delayed 
(future) command (*h,it5t ‘go later!’): 


TABLE 1.49 IE FUTURE IMPERATIVE 



PIE 

Ved. " 

Gr. c 

Lat. 

2/3 sg., 2 pi. 

*h 1 i-tot 

vittat b 

ITCO 

/fd d 

2/3 sg., 2 pi. 

*b b er-e-tot 


(pSpSTCO 

agito 

3 pi. 

*-(o)-n(t)-tot 


dial, -vxco 

euntd, agunto 


a Usually just 2 sg.; b ‘thou shalt regard’; c used as regular imperative for the 3 sg.; d cf. arch. Lat. da-tod 
‘let him give’, CIb. fa-fur 'he must give’; only 2/3 sg. in Latin. 


The ending *-tot (often reconstructed as *-tod) is usually explained as originally an 
abl. sg. (p. 63) of the pronoun *tod ‘that’, meaning originally ‘from that > after that’ or 
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the like, but the pronominal form was actually *tosmot (p. 85). The 2/3 sg. and 2 pi. had 
the same ending (probably from older * -0-tot, *-et-tot, *-ete-tot), and the 3 pi. was made 
from *-nt- + *-tot-. 


Non-finite verb forms 

Unlike many later IE languages, PIE had no infinitive but had participles, verbal adjec- 
tives, and verbal nouns. The participles were those of the active present, middle, and 
perfect. The present active participle had the *-ont- suffix (with no thematic vowel 
before it): m. +±+ holodynamic *b h er-5nt-s ‘carrying’ (Ved. bhar\a\n, Gr. (pepcov, Lat. 
fer\e\ns , OCS bery, Goth. bair\a\nds; Lith. nes\q\s; cf. Hitt, asanza ‘being’ (active!) but 
kunanza ‘killed’) - gen. sg. *b h er-nt-e/os (Ved. bharatas, Lat. ferentis ); f. *b h er-ont-ih, 
(Ved. bharantT, Gr. (pepouoa - cf. p. 38 for “laryngeal breaking”, OCS berg\st\i; Lith. 
nesanti) - gen. sg. *b h er-ont-yeh,-es (OCS bergst{g}, Lith. nesancios ) (cf. p. 67). Cf. also 
the athematic stem *h,s-ont-s ‘being’ (Lat. sons ‘guilty’ < *‘he who is’, OCS sy), *h[S- 
nt-ih, (f.) (Ved. satT, Gr. Dor. eaaaa). The present middle participle was a simple o-stem: 
*b h er-o-mh 1 n-o-s ‘carrying for himself/carried’ (Ved. bharam\a\nas, YAv. baramna-, Gr. 
(pepopevoq; OCS nesonn, Lith. nesamas ‘being carried’; CLuw. kisama- ‘combed’; cf. 
Lat. alumnus ‘nursling’) - f. *-mh,n-e-h 2 (cf. Lat .femina ‘woman’ < *d h eh|mh[neh 2 ‘she 
that is sucked’); athematic *-mh!nos (Ved. vida.nd-lvida.na- ‘finding’) The perfect par- 
ticiple was an 5-stem (*-wos- suffix): m. ±±+ holodynamic *weyd-w5s ‘in the state of 
knowing’ (Ved. v\i\dva\m\s~, GAv. v\A duua, Gr. eiSdx; ‘knowing’; Goth. weitwo\d\s ‘wit- 
ness’) - oblique *wid-us- (Ved. vidus-, OCS nesns- ‘having carried’, Lith. //fas- ‘having 
left’, cf. Goth, noun berusjos ‘parents’); f. *wid-us-ih, (Ved. vidusT, Gr. Horn. i6ma, OCS 
nesnsi, Lith. likusi), n. *weyd-wos (Gr. siSoq). Unlike the participle of *woydh,e (p. 98), 
other perfect participles were originally reduplicated (cf. Ved. ja-ghan-vd\m\s- from PIE 
*g wh en- ‘beat’). 

Besides the three participles, embedded paradigmatically in the PIE verbal system, 
there were also some verbal adjectives and nouns, less directly connected (in the realm of 
word formation) to verbal finite forms. The zero-grade verbal adjective in *-to- is widely 
attested, e.g., *sth,-t-o-s ‘placed’ (Ved. sthitas ‘standing, settled’, Gr. araxoq ‘placed, 
standing’, Lat. status ‘set’; cf. OCS pro-strbt-b ‘spread’, Goth, nasips ‘saved’). The adjec- 
tives in *-n-o-s (cf. Ved. vinnas besides vittas ‘found’; OCS znam, OEng. ge-cnawen 
‘known’; cf. also *plh[nos - p. 37) of a similar function are not as widely attested. The 
suffix *-t- was present also in the /-stem verbal noun: *g w m-t-i-s ‘going’ (Ved. gatis, Gr. 
Paaiq ‘stepping’). The ace. sg. of the *-t-u-s nouns might have been used as a supine 
(after motion verbs): *-t-u-m (cf. Lat. sup. - turn , OCS sup. -tn, Lith. dial. sup. -tif, Ved. 
inf. -turn), cf. Lat. cubi-tum uenerunt ‘they went to sleep’, OCS idg lovi-t-u ‘I go to hunt’. 


POSTPOSITIONS 

A considerable number of postpositions can be reconstructed for PIE, with some of them 
functioning also as adverbs (p. 80-81). Most modem and some older IE languages (like 
Latin, Greek, Old Church Slavic, or Gothic) predominantly have prepositions (which 
are to have an even more important role in younger IE languages with simplified or lost 
declensions), which often function as verbal prefixes (modifying the meaning of the verb) 
as well, cf. the Lat. preposition ad ‘to, toward’ and ad-uenid ‘I come to’. However, these 
verbal prefixes were originally independent of verbs (cf. Eng. come to), and some more 
archaic old IE languages (Anatolian, Vedic) predominantly had postpositions, which is 
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mostly held to have been the case in PIE as well (though it is possible that there were 
also some prepositions, or that at least some postpositions could have also been placed 
prepositionally, etc.)- Some postpositions can be found in languages that are mostly prep- 
ositional (like Latin, Greek, Old Persian), cf. Lat. me-cum ‘with me [lit. me-with]’. As 
examples of reconstructed postpositions (later becoming prepositions in many languages) 
cf., e.g., *(h,)en ‘in’ (Gr. ev, Lat. in, OPruss. en, Goth./Eng. in, OIr. i), *h,epo ‘(away) 
from’ (Ved. apa, Gr. ano/ano, Lat. ab, Goth, af, Eng. of), etc. There are a few suffixes that 
appear in postpositions like *-ti, *-b h i, *-r(i), cf. *pro-ti ‘towards’ (Ved. prati, Gr. epic 
Jipoii, OCS prot-ivb ‘against’ - Slav, also *proti>) from *pro (Ved. pra, Gr. npo ‘before, 
forth’, Lat. pro ‘in front of, before’, OCS pro-, Lith. pra- ‘through’, Goth, fra-, Eng .fro- 
nt, OIr. ro-). As already said, PIE (mostly plural) case markers, like instr. *-b h i, were also 
originally postpositions, and some of them were not completely transformed to proper 
endings (p. 63). 


CONJUNCTIONS 

Not a lot of conjunctions can be reconstructed. It is an old idea that PIE had almost no 
subordinate clauses (or that they were at least rare) 14 and expressed those constructions 
mainly through participles (e.g., ‘the going man’ instead of ‘the man that is going’ - but 
cf. the relative pronoun *yo-, p. 88-89), verbal nouns, and likely pronominal forms used 
as conjunctions. This means that the syntax of the subordinate clauses (and its conjunc- 
tions) is mostly innovative in later IE languages. However, not even the conjunctions in 
independent clauses were stable. The two best-known independent-clause reconstructed 
conjunctions are *-k w e ‘and’ (Ved. ca, GAv. -ca, Gr. re, Lat. -que, CIb. -kue, OIr. -ch, 
W -p, Goth. -/?, Hitt, -kku; for the negative *nek w e cf. below) and *-we (with optional 
monosyllabic lengthening - p. 54-55) ‘or’ (Ved./GAv. va, Lat. -ue, CIb. -ue, Toch. B wa- 
t). Unlike ‘and’ and ‘or’ in modern IE languages, these were enclitics (placed after the 
second word - one could call them postpositive particles), cf. Lat. Senatus populus-que 
Romanus ‘the Senate and people [lit. people -and] of Rome’, Ved. Mitram huve Varunam 
ca ‘I invoke Mitra and Varuna [lit. Varuna-and] ’ . As in modem languages, they could be 
used twice with slightly different meanings (*wlk w os h,ekwos-k w e ‘wolves and horses’ 
and *wlk w os-k w e h,ekwos-k w e ‘both wolves and horses’), cf. Gr. (Horn.) jiaxijp dv5pcov 
re ftecov T£ ‘father of (both) men and gods’, Ved. divas ca gmas ca ‘(both) of heaven and 
of earth’, naktam va hi diva va varsati ‘for it rains (either) by night or by day’. These 
conjunctions connected phrases, verbs, and sentences as well, cf. Ved. a devebhir vahi 
vaksi ca ‘come with the gods and sacrifice!’ In Rg-Veda (rarely in Greek as well), when 
connecting what should be vocatives only the first form is actually voc. (the other one 
is nom.), cf. Ved. Vdyav Indras ca. . . avatam ‘O Vayu [voc.] and Indra [nom.], come!’ 
This reminds one of the pres./inj. (p. 97) and instr. sg./nom. du. ellipsis (p. 63). As in 
later languages, some pronominal forms (like *yod ‘that’, *k w od ‘which’, *k w id ‘what’- 
p. 88-89) probably functioned as conjunctions already in PIE, cf. Ved. yad ‘because; 
when’, Lat. quod ‘(in) that; because; though’, Hitt, kuit ‘because’, etc. 

PARTICLES 

Particles are usually short and unaccented. Several particles can be reconstructed for PIE, 
the best known of which is the negative particle *ne ‘no(t)’ (p. 54). Cf. also the compound 
*ne-k w e ‘and not’ (Lat. neque ‘not, and not, also not’, CIb. nekue ‘nor, neither, and not’, 
Hitt, nekku ‘not?’, Alb. nuk ‘not’). A prohibitive particle *meh[ (Ved. ma, Gr. pirj. Arm. 
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mi, Toch. A/B ma, Alb. mo, cf. p. 97) can also be reconstructed, originally perhaps the 
imperative of *meh[-, cf. Hitt, reduplicated mi-mm(a)- ‘refuse’ (Kloekhorst 2008). A cou- 
ple of other particles can be reconstructed, but often the meaning is not too clear - cf. the 
emphatic particle *g wh e/o (Ved. ha, gha, OCS ze, -go in the gen. sg. m. pronouns like togo 
‘that’). In some cases, the particles can appear often after certain forms (cf. the verbal 
particles that had integrated with verbal forms already in PIE - p. 92), like *g/ge after 
personal pronouns, cf. Gr. eyco-ye ‘I’ - dat. sg. epoi-ye, Goth. acc. sg. mi-k ‘me’ (but also 
Goth, au-k ‘also’), Hitt. acc. sg. tu-k ‘you’ (p. 181), etc. 


INTERJECTIONS 

As usual, interjections are often onomatopoeic and can have otherwise non-existent or 
rare phonological and phonotactical characteristics. In historical view, their phonological 
development can be irregular, and cognates can be accidental, which makes their recon- 
struction provisory. Many of them are really simple, like *a ‘ah! ’ for surprise or pain (Skr. 
a, Gr. a, Lat. a, Lith. a, Goth, o ), *ay for surprise or pity (Skr. e, ai, Gr. ai, Lat. ai, etc.), 
or exclamation (sometimes used with vocatives) *5 ‘oh! ’ (Gr. &, Lat. o, OIr. a, a, Eng. oh, 
etc.). In some cases, the usual sound laws clearly do not apply (for obvious reasons), cf. 
the laughter onomatopoeia in IE languages (PIE *ha ha?): Skr. ha ha, Gr. a a, Lat. hahae, 
Slav, ha ha, Eng. ha ha, etc. In others, it (partially) does - cf. Lat. eheu ‘ah, alas!’ (Latin 
otherwise has no diphthong eu) and Skr. aho ‘o, oh, alas!’ (with *ew > o), where PIE 
*ehew (or *eHew?) is reconstructable. A perhaps less trivial reconstruction, but still with 
many irregular correspondences, is *way ‘alas, woe!’ (Gr. ouai, Lat. uae, GAv. vaiioi, 
Goth, wai, W gwae, Ann. vay). 

Note: I would like to thank David Mandic, Thomas Olander, Petra Sostaric, and my students Matija 
Muzek and Marul Kuljis for valuable comments on the first draft of the chapter. Of course, all the mistakes 
are just mine. 


FURTHER READING 

The literature on specific problems of PIE morphology is immense - however, there are 
no modem one -volume monographs dedicated to the whole of PIE morphology as such 
(or even to the nominal and verbal parts separately). Most PIE comparative grammars 
(cf. p. 56-57) deal with both phonology and morphology, though their overview of the 
morphology is often concise in many segments. Of those more recent ones, Szemerenyi 
1996 is rather detailed, with very helpful explanations of numerous reflexes of morpho- 
logical forms in separate IE languages, often providing commentary on different views 
on problems, as well as an extensive (pre-1990) bibliography. However, his views are 
in some instances dated, as is to be expected. The reconstructions in Beekes 2011 are 
sometimes idiosyncratic, though some of his heterodox views are very interesting (his 
take on nominal inflection, such as it is, is presented in Beekes 1985). Fortson 2010 is 
not too detailed in the PIE section and, for instance, does not provide separate paradigms 
for all nominal athematic stems. Meier-Briigger 2003 is very useful for his references to 
literature but also often not too detailed (e.g., he does not list the paradigms in separate IE 
languages or the reconstructions of all nominal athematic stems), while sometimes deal- 
ing too extensively with less important minutiae. A lot of information on the reconstruc- 
tion of PIE morphology can be found in handbooks on specific branches and languages, 
e.g., in Sihler 1995, Ringe 2006, or Olander 2015 (with useful references and short sur- 
veys of differing views on specific reconstructions). An overview of reconstructed PIE 
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nouns and adjectives (with their stems) is available in NIL. The problem of PIE gender 
is tackled in a recent monograph (Matasovic 2004) and a collection of articles (Neri & 
Schumann 2014). A recent monograph covering the IE dual is Fritz 2011. Schmidt 1978 
gives an overview of the material and earlier literature concerning personal pronouns, 
but his reconstructions are rather implausible (cf. Katz 1998 and Kapovic 2006 for more 
recent takes on PIE personal pronouns). For numerals, Szemerenyi 1960 is still useful but 
obviously dated; the Gvozdanovic 1991 collection provides a survey of numerals in all IE 
branches with a number of eminent scholars taking part. Rau 2009 deals with the decades 
(and also the Caland System). Monographs concerning various aspects of the PIE verbal 
system are more numerous than works on the nominal system. Hewson & Bubenik 1997 
give a useful overview of the verbal systems in various IE branches. LIV is an established 
handbook on PIE verbal formation, although it has its perks. Clackson 2007 gives an 
excellent short, problem-based overview of the current issues (the Greco-Aryan model, 
present/aorist origin, injunctive, optative, middle, etc.) in reconstructing the PIE verbal 
system. For different famous approaches to pre-PIE origins of the verbal system, cf. Rix 
1986 and Jasanoff 2003. The much older Watkins 1969 is still insightful as well, though 
very controversial. The following monographs deal with specific parts of the verbal sys- 
tem: the aorist (Hardarson 1993 - root aorist; Drinka 1995 - sigmatic; Cardona 1960 - 
thematic; Bendahman 1993 - reduplicated); the perfect (di Giovine 1 990/1 996a/b); the 
middle (Jasanoff 1978, Stempel 1996); the moods (the already mentioned Rix 1986); 
verbal reduplication (Niepokuj 1997). Further references are available in the mentioned 
works and chapters in this volume on separate IE branches. 


NOTES 

1 Hitt, -us may indeed point to *-ms, not *-ns (cf., e.g., Kloekhorst 2008: 929). 

2 With further possible *-m- cognates in Tocharian (p. 456). 

3 One must be aware that not all the endings of the words usually taken as examples 
are always attested. Some actual forms usually cited in full paradigms in handbooks 
(and sometimes also here) are also not always actually attested as such in the texts 
(but are assumed according to other attested forms). 

4 Cf. Matasovic 2004: 173-176. 

5 The original distribution of the thematic *-om (< *-o-om, *-eh 2 -om) and athematic 
*-om is possibly preserved in (part of) Slavic, cf. the Neo-Stokavian (BCMS) gen. 
pi. (o- and a-stems) -a < PSlav. *-L< PIE *-5m/-eh 2 om but gen. pi. (/-stems) -T (never 
**-(i)ja) < *-iji> < PSlav. *-i>ji> < PIE *-ey-om (e.g., the o-stem gen. pi. zuba ‘teeth’ < 
PIE *gomb h om but the old /-stem gen. pi. crvT ‘worms’ < PIE *k' v rmeyom). 

6 This did not occur in the gen. pi. ending *-om, because this was originally *-om in 
most cases (p. 70). The *-m did not drop in the Stang-form *g w om (p. 71) either. 

7 The PIE reconstruction here mostly follows the one in Kapovic 2006. 

8 English terminology is rather unfortunate because the traditionally reconstructed 
PIE categories perfect and imperfect may be confused with aspectually redefined 
categories of perfective (for traditional “aorist”) and imperfective (for traditional 
“present”). 

9 The claim that the Lithuanian original acute accent in the 1 sg. -ii (reflexive - uo-si ) 
points to a laryngeal is not completely compelling, since Balto-Slavic exhibits the 
acute accent on some (non-contractional) long-grade endings as well (e.g., in /-stem 
loc. sg. *-ey, cf. the Latvian reflexive infinitives in -tie-s from PIE abstract nouns 
with the loc. sg. *-t-ey). 
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10 Cf. Pinault 1992: 153-154. 

11 It is possible to connect the *-ey-(e)- (zero *-i-) suffix with *-y-e- originally. 

12 Such injunctives could have been the basis for the making of the Indo-Iranian/Greek 
imperfect and the Old Irish absolute berid ‘carries’ < *b h ereti, berait < *b h eronti but 
conjunct -beir ‘says’ < *b h eret, -berat < *b h eront, though this is highly controversial 
(p. 358, 372-373). 

13 Cf., e.g., Oettinger 1976, Rix 1986, 1988, Kiimmel 1996 for the reconstruction of 
stative (besides the middle). 

14 Cf. Hermann 1895. 
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CHAPTER 3 


PROTO-INDO-EUROPEAN 

SYNTAX 

Thomas Krisch 


After general remarks mentioning and evaluating older literature and the data basis for 
Proto-Indo-European (henceforth PIE) syntax, this article will describe major fields of 
present day research. Clause alignment and the ergative hypothesis are not dealt with in 
this article (see the treatment by Matasovic, p. 162-164). The end of this article contains 
a short conclusion. 1 


GENERAL REMARKS 

A very short survey of older literature on PIE syntax 

Berthold Delbriick (1842-1922) is the “father” of comparative PIE syntax. His mon- 
umental work on comparative syntax in three volumes encompasses phenomena that 
we now subsume under morphosyntax and syntax. Among them are the use of nominal 
number, case (including matters of valency of the verb), prepositions, the alternations 
between adjectives and nominal genitives, the use of tenses, moods and genera verbi and 
the organization of sentence structure (word order, elliptical constructions, appositions, 
agreement issues, question sentences, relative clauses). Delbriick analyses numerous 
examples from ancient Indo-European languages (AIELs), and he states that if construc- 
tions and meanings coincide in the AIELs, this could mean that the same was true also for 
PIE (Delbriick 1967=1893: 86). This statement still makes sense today. 

The syntactic framework of Delbriick and other eminent researchers like Jacob Wacker- 
nagel (e.g. Wackemagel (1920 and 1924), outstandingly translated and edited by Lang- 
slow (2009)) was based on traditional Latin grammar and philological interpretation. The 
next methodical landmark was an article in the spirit of structuralism by Calvert Watkins 
(1963) about word order in Old Irish with archaic verb-final position (which Watkins 
expects to reflect the PIE state). These are cases of Bergin’s Law (Watkins 1963: 24) and 
of tmesis. In Irish tmesis, the preverb, optionally followed by Wackemagel clitica, intro- 
duces the sentence, and the finite verb ends it. Assuming a univerbation process where the 
verb moves from the final position to “its” preverb in sentence-initial position, Watkins 
succeeds in integrating the deviating (unmarked verb-initial order) Old Irish data into a 
general picture of PIE with verb-final as the unmarked order. The preverb could not move 
to the end of the sentence because that would violate Wackemagel’s Law in case clitics 
are attached to it. This reasoning is still convincing and very modem. The first part of 
McCone’s outstanding dissertation (1979) takes ideas from Watkins 1963 and describes 
sentence patterns in Hittite, Vedic and Greek with Watkins’ formal devices. 

The 1970s laid the ground for much of what is still discussed today in studies on 
PIE syntax. They brought typology (in the Greenberg 1966 tradition, cf. also Matasovic, 
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p. 165) and theory to bear on the description of PIE syntax, with an emphasis on word 
order: Lehmann (1974) reconstructs *SOV, Friedrich (1975) thinks *SVO and *SOV are 
equally important, and Miller (1975) considers *VSO as well. Referring to this typologi- 
cal approach, Watkins (1976: 305) deplores . . that the rebirth of Indo-European syntax 
has taken place in the bed of Procrustes”. This criticism is true as long as one looks at 
the word order typology as rigid order (as was partly done in the cited literature) and 
wants to sweep the counterexamples under the carpet. The development of research in 
the last 40 years has brought some progress in this respect; see our section on word order. 
Another criticism of Watkins, however, is still to be kept in mind today: He suspects 
that the picture of PIE syntax could also be influenced by “Teeter’s Law” (the language 
of the family you know best always turns out to be the most archaic; cf. Watkins 1976: 
310). I think that every researcher (including myself, of course) faces this problem, but 
this does not have to be evaluated in a purely negative way. Also, the chance of getting 
precious new insights is thus increased. 

Avery influential article of the 1970s was Dressier 1971. In that paper, the author men- 
tions a number of important fields of research on PIE syntax, which are, among others, 
still elaborated in current research: typological issues (cf. also Dressier 1968), the syntax 
of case (see our sections on case functions and on argument structure), sentence prosody 
(see our section on Wachemagel’s law), word order and text syntax (see our section on 
verb positions with arguments taken from text syntax). 

Today’s research on PIE syntax encompasses a broad spectrum with regard to contents. 
Most of it shows a sharpened awareness of methodology and search for points of contact 
with general linguistic ideas. This new perspective is an outcome of research done in the 
1960s and 1970s. 

The data 

The comparative method of reconstructing the PIE language system has been very success- 
ful in the realm of phonology, morphophonology (especially ablaut) and morphology. It was 
developed in the 19th century (e.g. Franz Bopp (1791-1867), Jacob Grimm (1785-1863) 
and the Neogrammarians like Karl Brugmann (1849-1919)) and is still being refined on 
today in numerous publications. Using this method, one collects probable cognates (units in 
fonn and function, normally words) in a number of related languages, investigates regular 
correspondences between them and finally reconstructs proto-fonns. These, in turn, give rise 
to sound correspondences, to morphophonological theories of nominal and verbal inflection 
and derivation (accent-ablaut types), to the reconstruction of compound types, etc. 

It is very difficult to apply this comparative method to syntax since the oldest texts in 
a number of ancient Indo-European (IE) languages do not even share a type of text. In 
Hittite, one finds prose texts with laws, narratives and rituals. In Greek (apart from the 
Mycenaean inventories), one has the great monument of Homer, a poetic text; prose texts 
are attested only in later times. In Vedic and Avestan, we have metrical texts with ritual- 
istic content, etc., etc. There are no primary data for PIE sentences available, and there 
are no attestations of exactly the same sentence with all words cognate in more than one 
language in the AIELs with the exception of frozen syntagms (see below) in the language 
of poetry. The attempted material reconstruction of “real” PIE sentences normally reveals 
only the state of research for the respective reconstruction, and therefore one cannot use 
it as primary data, of course. 2 

In the case of attested texts in ancient languages, one at least has access to some data 
which have been handed down and have survived time largely by chance. Even these 
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data suffer from big disadvantages when compared with linguistic data from living lan- 
guages. One cannot freely form sentences which can be judged by native speakers as 
to whether they are grammatical or ungrammatical. Normally syntactic reconstruction 
can just reconstruct schemata, abstract sentence structures, and thus differs from phono- 
logical, morphological and lexical reconstructions with more “substance”. But there are 
very rare cases where one can reconstruct, e.g., word order including “substance”, with 
the same (ideally etymologically related) words occurring in more than one language. 
This could offer a “punctum Archimedis”, an Archimedean starting point. Let us start 
with the one which to my knowledge is the best syntagm that can be reconstructed with 
“substance” for PIE, namely that a hero is killing a dragon (cf. also Watkins 1995, esp. 
chapter 27 and 28; Krisch 2001). There is good evidence to reconstruct this syntagm as 
offered in (1), with government of the verb to the right (data from Vedic, Avestan and 
Greek (see the examples in (2a-d)): 

(1) PIE *(e)- g wh en-t *(H)e/og" h -im 

(Augment)-kill-3sG.iNj.PRES./(iMPF.) serpent-Acc.SG. 

► 

‘He kills /(killed) the serpent’ 

(2) a) Vedic: RV 1,32,2 ( ahann < PIE *e-g wh en-t; ahim < PIE *(H)e/og wh -im) 

ahann ahim 

kill.3sG.iMPF. serpent.Acc.SG. 

► 

‘he killed the serpent’ 

b) Avestan: Y.9,8 (janat< transposed PIE *g wh en-e-t; azun < PIE *(H)e/og wh -im) 

VO janat azlm dahakam 

who kill. 3 sg.inj. pres. serpent.Acc.SG. Dahaka.Acc.SG. 

► 

‘who killed Azi (“the dragon”) Dahaka’ 

c) Greek: Pindar P. 10,46 (OTS(pv£(v) < PIE *e-g wh e-g wh n-e reduplicated aorist; Gor- 

gon: lit. ‘the terrible one’; winged daemon with serpentine locks in the hair) 
s;t£(pv£v i£ Topyova 

kill.3sG.AOR. and Gorgon.Acc.SG. 

► 

d) Pindar, P. 4,249: [kteive < PIE *tkenye°/kj)enye 0 (injunctive present) or < PIE 
*tken-s°/ kjjen-s 0 (injunctive aorist); ocptv < PIE *(H)e/og wh -im] 

kteive . . . ocptv 

kill.3sG.iMPF. / aor. serpent.Acc.SG. 

► 

‘he killed the serpent /dragon’ 

If one combines the Greek sentences (2c) and (2d), one gets Greek £^£(pv£(v) ocptv, which 
can be projected back to *e-g wh e-g wh n-e *He/og wh im. Except for the tense of the verbal 
form, this reconstruction is the same as the one in (1), which was arrived at by the Vedic 
and Avestan forms. 
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But from this evidence one must not draw the conclusion that the ancient languages in 
question and PIE were necessarily languages with a VO order. This is confirmed by the 
fact that the inverse order (OV) of the same syntagm is attested in a number of ancient 
IE languages as well and can be reconstructed for the proto-language preceding Vedic, 
Greek and Hittite (which of course is PIE as well). Cf. the reconstruction in (3) with 
government to the left: 

(3) PIE *(H)e/og wh -im *(e)- g wh en-t 

serpent-Acc.SG. (Augment)-kill-3sG.iNj.PRES./(iMPF.) 

◄ 

‘He kills /(killed) the serpent’ 

This reconstruction is supported by syntagms attested in Hittite, Vedic and Greek (see 
the examples in (4a-d)): 

( 4 ) 

a) Hittite KUB 17.5 Vs I (Middle Hittite): ( kuenta < PIE *g wh en-t(o)) 

(17) D IM-a.s ii-it nu-kan m:,& il-lu-\ya-an-ka-ar\ (18) ku-en-ta DINGIR MES -sa 

kat-ti-is-si e-se-ir 

D IM-a5 uit nu=kan MU s illu\yankan ] (18) kuenta 

Weathergod come.3sG.PRET. sc=loc.ptcl Illuyanka.ACC.SG. kill.3sG.PRET. 

◄ 

DINGIR ME5 -5=a katti=ssi eser 

God.NOM.PL. = but with=him (lit. poss.pron.) be.3pL.PRET. 

‘The weathergod (i.e. Tarhunnas) came and killed (the serpent) Illuyanka; but the 
gods were with him’ 

b) Vedic RV 2,15,1 (alum < PIE *(H)e/og wh -im Jaghana < PIE *g wh e-g wh on-e) 

trikadrukesu apibat sutasv a asva made 

Trikadruka-day.LOC.PL. drink.3sG.iMPF. Soma. gen. sg. he.GEN.SG. intoxicationxoc.SG. 
alum indro jaghana: 

serpent.ACC.SG. Indra.NOM.SG. slay.3sG.PERF. 

◄ 

‘At the Trikadruka-days he drank from Soma; in the intoxication (caused) by this 
(one), Indra has slain the serpent’ 

c) Greek: H. Ap. 300-301 [Kietve < PIE *tken-ye°/k|3en-ye 0 (injunctive present) or 
< PIE *tken-s°/k|3en-s 0 (injunctive aorist)] 

evOa Spaicaivav / Kisivev aval; Aid; uio; 

there she-dragon.Acc.SG. kill.3sG.iMPF./A0R. lord.NOM.SG. Zeus.GEN.SG. son.NOM.SG. 

◄ 

and Kpatepoto (3toto 

with (lit. from) strong bow.GEN.SG. 

‘there, the lord (i.e. Apollo, TK), the son of Zeus, killed the (female) dragon with 
the strong bow’ 

d) Greek: Pindar, O. 13,88 ff. (ene(pv8(v) < PIE *e-g wh e-g wh n-e reduplicated aorist) 
Kai Xlpaipav ^up irveotaav Kai SoA,upou<; ewecpvev 

◄ 
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Kod Xipaipav mip Jivsoiaav icai 

and Chimaira.ACC.SG.F. fire.ACC.SG.F. breathe. ptcp.aor.acc.sg.f. and 

Eo7.U|.lO'U<; £7I£(pV£V 

Solymoi.Acc.PL. kill.3sG.AOR. 

‘And he (i.e. Bellerophontes, TK) killed the fire-breathing Chimaira and the 
Solymoi’ 

The Hittite example in (4a) shows a cognate verb form that fits with the reconstruction 
in (3), and the Vedic instance in (4b) is (with the exception of tense) almost an exact 
equivalent to (3). In the Greek example (4c), the ‘dragon’ word and the verb have only a 
semantic correspondence to the reconstruction in (3), and in (4d) only the verb is etymo- 
logically related to the reconstruction, and the object shows words and names for dragons 
with etymologies different from *(H)e/og wh i-. But, nevertheless, a word for ‘dragon’ is 
put in the accusative case in both Greek examples. This is as close as one can get when 
one wants to reconstruct genuine PIE sentences. 

The reconstructions in (1) and (3) seem to contradict each other. A number of conclu- 
sions from this fact are possible, e.g. the following: 

a) IE and the oldest languages were fluctuating between OV and VO. This would 
describe the fact that our sentence appears in both varieties in Greek and in Vedic 
Sanskrit. Note, however, that Hittite shows no variant with VO. This scenario could 
lead to a project which tries to disentangle the OV and VO features of the ancient IE 
languages and the reconstructed proto-language, using, e.g., typological criteria like 
those of Greenberg (1966) and his followers. 

b) IE was like Hittite, which, as Luraghi has shown in her seminal book from 1990, is a 
fairly strict OV language. Note also that our sentence in Hittite shows no variant with 
VO. The languages with VO order for this sentence, namely Greek, Sanskrit and 
Avestan, could reflect a younger state, a shift to VO. If one adopts this scenario, this 
could result in looking for layers of OV or VO in the oldest history of the languages 
showing variation and depicting developments from OV to VO in these languages. 

c) The IE base-type was OV, but under certain pragmatic conditions it allowed for a 
movement of the verb to the front of the sentence. This is the view I will adopt. 

We have seen that frozen syntagms in cognate languages can lead to revealing con- 
clusions for PIE syntax and may lead to further investigation. Most of these frozen 
syntagms go back to poetic language. An up-to-date overview of all aspects of IE poetic 
language including syntax (using a generative framework) is Hajnal 2008. An extensive 
collection of text correspondences and of frozen syntagms in AIELs can be found in 
Schmitt 1967. 

MAJOR PRESENT FIELDS OF RESEARCH ON PIE SYNTAX 
Word order 

Inspired by Mark Hale ( 1 987) and Paul Kiparsky (1995) and expanding some earlier ideas 
of Krisch 1990, I have used generative grammar (the Government and Binding model) 
to describe word order in AIELs and in PIE in a number of publications. The model I use 
now is slightly different from the models I have used in earlier publications, but it is 
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(5) cf. also Krisch 2015 
CP 

XP 


topicalized 

MATERIAL (TOPIC 
or FOCUS); Df-slot 


WL 2 -WL, 


VP 


V (finite) 


FIGURE 1.2 SENTENCE SCHEME 1 


(6) cf. also Krisch 2015 


XP 


CP 


WL 2 CP 


TOPICALIZED 
MATERIAL; Df-slot 


XP 


C’ 



C WL 1 VP 


V (finite) 


FIGURE 1.3 SENTENCE SCHEME 2 


compatible with the older model. Following Keydana (2011), I have integrated the label 
Df-slot, and I have renumbered the Wackemagel clitics following Keydana (2011) and 
Liihr & Zeilfelder (2011). For practical purposes (mainly motivated by the treatment of 
Wackemagel particles) I assume two schemes of phrase structure: (5) and (6). The two 
schemes are compatible with Government and Binding (the labels form a subgroup of 
the universal set of phrase structure), and they are in fact only one structure: The second 
scheme is derived from the first one by “Chomsky adjunction”. For the abbreviations 
used see the appendix above the bibliography. The Df-slot (discourse functional slot) is 
a “landing-site” for topical and focal material. The number of categories used is held as 
small as possible. The morphology is not done in a generative way; the word forms are 
inserted fully inflected into the syntactic tree. 

One can use these schemes for a number of phenomena, which I will shortly character- 
ize in the following sections. 

Discontinuous constituents (hvperbaton/tmesis) 

One of the most striking features of AIELs are discontinuous constituents. Most of them 
(except for movements inside DP and PP) can be quite simply described by moving parts 
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of constituents to the left or the right of the VP. In (7) (Scheme 1), the interrogative pro- 
noun xiq is fronted out of its DP to the front, into the Df-slot, which is not surprising for 
a clear focal element. In this case, only Wackemagel particles intervene (cf. also Krisch 
1998:372). 

(7) Horn. II. 1,8: 

Ti<; rap acpcoe 0£cov epiSi 

Who.NOM.SG. there they.Acc.DU. god. gen.pl. quarrel. dat.sg. 

^uver|K£ !idx£o0ai 

bring.together.3sG.AOR. fight. inf. mid. 

“Who on earth 3 of the gods brought both of them together to fight in quarrel?” 


FIGURE 1.4 STRUCTURE OF HOMER, ILIAD 1,8 



In (8), part of the DP is fronted, and part of it is extraposed to the right of the VP (by 
adjunction): The Homeric article/determiner oi, which has stronger demonstrative value 
than the article of later Greek (cf., e.g., Chantraine 1981=1953: 158), and which conveys 
anaphoric-topical information, is put into the Df-slot to the left, whereas the head of the 
NP, 0£oi, is extraposed to the right. 

(8) Horn. Od. 4,353: 

oi 8’ aid Pou^ovxo 0£oi 

The(se)NOM.PL. but always want.3pL.iNj. mid. god.NOM.PL. 

!l£|.ivfjo0ai £(p£Xp£COV 

remember.iNF.PERF.MiD. command.GEN.PL. 

“But the(se) gods always want to remember their commands” 
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FIGURE 1.5 STRUCTURE OF HOMER ODYSSEY 4,353 



Hyperbata are widespread among AIELs, and thus they are a syntactic device which 
can easily be reconstructed: In (9) one finds examples for this phenomenon in Vedic {utim 
sahasrasatamam is extraposed), (10) is an example from Latin ( pulmoneum is fronted), 
and there are also examples in Hittite ((11); the genitive is extraposed to the right). 

(9) Vedic RV 1,10,10: 

vrsantamasya humaha utim 

extremely.bull-like.one.GEN.SG. call.lPL.PRES.MiD. help.Acc.SG.F. 
sahasrasatamam 
most.winning.a.thousand.ACC.SG.F. 

‘We ask for the most bull-like one’s (i.e. god Indra’s, TK) help, which excels in 
winning a thousand’ 

( 10) Old Latin Plaut. Rud. 511: 

Pulmoneum edepol nimis 

Consisting. of.lung.Acc.SG.M. by.Pollux! too much 
uelim uomitum uomas 

wish.lso.suBj. vomit, acc. sg.m. vomit.2sG.suBj. 

‘I wish very much that you vomit a vomit consisting of lung particles.’ 
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(11) Hittite StBoT 8, Vs I 22 (Old Hittite ritual) (Otten & Soucek 1969: 20) 

II D Hantasepus harwani GI[(S-as)] 

Two Hantasepa-deities.Acc.PL. have. Ipl. pres. wood.GEN.SG. 

‘We have two Hantasepa-deities (made) of wood.’ 

Normally, the phenomenon of hyperbaton/tmesis is looked upon as a poetic device. 
But it is also a feature which appears in everyday speech. This is shown by the Plautus 
example in (10) and by the fact that inscriptions also show the phenomenon (cf. (12), 
where the location from which Dropides comes (the adjective AipiSvatoq) is moved to 
the right) 

(12) Greek IG II IIP 2636 (Attika) (cited after Pfohl 1980: 169): 

[A] pojTriSr^ KaT8^a(3e [djvaaa^tpov 

Dropides.NOM.SG.M. take.possession.of.3sG.AOR. equipped.again.Acc.SG. 

gLTU/./.OV Acpiiwuoc 
mine.ACC.SG. from.Ap h idna.NOM.SG.M. 

‘Dropides from Ap h idna took possession of a mine which was restarted’ 

A surely old PIE construction pattern is a sentence where a preverb (originally adverb) is 
fronted, thus appearing apart from “its” verb. This is usually called “tmesis” (Gr. xpfjatq 
lit. ‘cutting’). Already Wackernagel observed this for Greek: “from the earliest period, 
tmesis . . . is commonest when the preverb is in clause-initial position” (Wackernagel 1924: 
174 = Langslow 2009: 616). Actually, in this case, for PIE the term “tmesis” is mislead- 
ing. The most probable scenario for the oldest texts and for PIE is that the “preverb” had 
only a loose connection to the verb and has to be looked upon more as a (normally local) 
adverb (cf. also Krisch 1984, especially chapter 3), which can appear in sentence-initial 
position (Df-slot) to give a frame to the sentence. Of course, the sentence-initial position 
is not the only position of these adverbs in PIE and in the AIELs, but as already sketched 
above, Watkins (1963) has taught us that this construction helps to understand the devel- 
opment of the Celtic verb-initial construction. Examples (13) (15) offer some examples 
for this construction in Hittite, Vedic and Greek. In all three examples, the preverb can 
be understood as an independent adverb or as a preverb. For the Hittite example in (13) 
note that, e.g., German has a prefixed verb hinterherrufen ‘to call after someone’ with the 
prefix hinterher ‘behind’. 

(13) Hittite StBot 8 (Old Hittite ritual) Rs. Ill, 4 (Otten & Soucek 1969: 30-31): 

appananda=ma=sse ke memahhi 

behind.after=but=he.DAT.SG. this.Acc.SG./PL. say.lso.PRES. 

‘but from behind I say this to him (i.e. to the eagle who is set free, TK) / but after 
him (i.e. the eagle set free, TK) I say this’ 

(14) Vedic RV 6,53,4 

vi patho vajasatave cinuhi 

away path. acc.pl. obtaining.good.DAT. sg. arrange. 2 sg.imp. pres. 

vi mrdho jahi 

away adversary.ACC.PL. strike. 2 sg.imp.pres. 

‘open up paths for obtaining goods, slay enemies’ 
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(15) Greek Horn. II. 1,308-11: 

£v 8’ spsxaq EKpivev 

In but/and rower.Acc.PL. chose. 3 sg.impf. 


eeikogiv, tq S’ 
twenty in but/ 
and 


£Kai6p(3r|v / 
Hekatomb.Acc.SG. 


Pfjae Os® diva 8e Xpuor|i5a / 

make.to.go.3sG.iNj.AOR. god.DAT.SG. onto but/and Chryses.Acc.SG.F. 

Ka>Aurdpr|ov eigev aycov 

fair.cheeked.Acc.SG.M./F. make.sit.3sG.A0R. lead.PTCP.PRES.NOM.SG.M. 

‘ (But the son of Atreus launched a swift ship on the sea,) and chose for it twenty 
rowers, and drove on board a hecatomb for the god, and brought the fair-cheeked 
daughter of Chryses onto it and made her sit.’ 


Wackernagel ’s Law 

In 1892 Wackernagel published his long famous article “Uber ein Gesetz der indog- 
ermanischen Wortstellung”. He states (elaborating earlier work by Delbriick and Bar- 
tholomae) that certain clitics tend to occupy the second position in the PIE sentence. 
He mainly bases this claim on data taken from Ancient Greek, but he also deals with a 
small amount of Latin, Vedic and Old Iranian and Celtic material. Already eight years 
later, Delbriick integrated the law into his handbook (Delbriick 1967=1900: 49-50). 
The following remark by Watkins dating from 1964 is still valid today: “. . . one of the 
few generally accepted syntactic statements about IE is Wackemagel’s Law, that enclit- 
ics originally occupied the second position in the sentence” (Watkins 1964: 1036). 
Decades after Wackernagel, this law has also been confirmed by newly discovered 
AIELs, especially the Anatolian languages (see the Hittite examples in Krisch 1990) 
and Mycenaean, thus making it a robust candidate for reconstruction. In the following 
examples, the clitics in Wackernagel position are in boldface. For reasons why some 
of the Greek clitics (see (16)) carry an accent and nevertheless “count” as clitics, see 
Krisch (1990: 75-76). 

(16) GreekHom.il. 18,394: 

fj pd vi) pot Seivi) xe 

in. truth then now I.dat.sg. terrible.NOM.SG.F. and 

0e6<; evSov 

goddess.NOM.SG. inside 

‘then, in truth, now the terrible and honorable goddess 

(17) Vedic RV 1,175,2 

a nas te gantu matsard 

here we.Acc. you. gen. sg. come.3sG.iMP.AOR. bull-like. nom.sg.m. 
vrsa mado varenyah 

intoxicating.NOM.SG.M. drunkenness.NOM.SG.M. excellent.NOM.SG.M. 

‘The intoxicating, bull-like, excellent drunkenness of you (i.e. God Indra, TK) 
shall come to us.’ 

(18) Hittite (Held 1957: 14) KUB XIII 2 III 16: 


Kai ai8oir| 

also honorable.NOM.SG.F. 


(is) inside for me’ 
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kuis= an= salt EGIR-pa tarnai n= an 

who.NOM.SG. he.ACC.SG. LOC.PTCL. back let.3SG.PRES. SConn. he. ACC. 

sakuwanzi 
lock.Up.3pL.PRES. 

‘(Anyone) who lets him (come) back they (shall) put in prison’ 

Wackemagel’s Law has become clearer through the research of the last few decades. This 
research is mainly based on Vedic, which probably can be taken as a model for PIE. Now 
one divides the clitics into three groups (cf. also Keydana 2011). Only the first two are 
genuine Wackemagel clitics. 

- WL, clitics (clitic pronouns) typically occupy the second position in the sentence, but in 
sentences with wh -words they always follow the wh-word (see examples (21) and (22)) 

- WL 2 clitics (sentence connectors and sentence adverbs) always occupy the position 
after the first word of the sentence 

- WL, clitics are adjacent to the word or constituent they take scope over. Typically, 
these are emphatic particles. If their host is put at the front of the sentence, they also 
occupy the Wackemagel position (cf. already Hale 1987: 44). Luhr (2009) deals with 
such particles from an information structural point of view. 

In the Rgveda, the following (types of) data have to be described (cf. Keydana 2011: 122): 

(19) RV 1,176,4 

( asunvantam samam jahi dunasam vo net te may ah |) asmabhyam asva vedanam 
daddhi ( sun's cid ohate) | 

‘(Kill everyone who does not press soma, the inaccessible one, who is not your 
delight|) Give to us his property! (He boasts (being) an inciter (of a sacrifice))’ 
asmabhyam asva vedanam daddhi 

we. dat. this. gen. sg. property.Acc.SG. give.3sc. imp. pres. 

As the context (19) shows, the accented pronoun asmabhyam is in contrastive focus to the 
asunvant-, the ones not pressing soma. Therefore, it stands in the Df-slot (cf. (5)) and the 
WL, asva is immediately following. 

(20) RV 1,76,1 

Kena va te manasa dasema 

Which.iNSTR.SG. or you. dat. sg. spirit, instr.sg. sacrifice. Lpl.opt.pres. 

‘With which spirit should we sacrifice you?’ 

In (20), we see that the WL, clitic va precedes the WL, clitic te (the wh-word + WL, clitic 
form a prosodic word to which the WL, clitic attaches). 

(21) RV 6,27,1 

indraft kim asva sakh t ve cakara 

Indra.NOM.SG. what.Acc.SG.N. this. gen. sg.m. friendship. loc.sg. do.3sG.PERE. 
‘What has Indra done in his (i.e. Soma’s, TK) friendship?’ 
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Example (21) shows a remarkable feature of the WL[ clitics: they follow the wh-word, 
even if another word/constituent precedes it. This construction can also be found in Greek 
(scheme 2): 

(22) Od. 20,47^18 

(auidp eycb Seoq dpi), diapnepeq r\ as (piAaaaco ev Jiavzacoi wovota’ 

‘(But I am a goddess,) (I) who incessantly protect you in all toils’ 

8ia|.ursps<; r) as (piAaaaco sv 7tdvTsaai 

incessantly who.nom.sg.f. you.Acc.so. protect. Isg.pres. in all.DAT.PL. 


rofvoia’ 

toil. DAT. PL. 



t. t t... (piAacroco 

I K (j; 


FIGURE 1.6 STRUCTURE OF HOMER, ODYSSEY 20, 48 


These facts have led to a number of proposals for how to characterize them. Hale 
(1987) is not very explicit about whether WackernageTs Law is a syntactic process (a 
movement operation leaving a trace) or another type of process interacting with syntax. 
He describes example (21) and others in the following way: “It is clear from these exam- 
ples that the WL clitics take second position, defined before the topicalization but after 
WH-movement places ka- (i.e. the wh-question word, TK) in COMP” (Hale 1987: 42). 
Applied to (21) and to our model in (2) this means that kim is in the XP position of the 
second CP (like he in (22)), then WL clitics “take” second position, and finally irtdrah is 
topicalized. “Take” could either mean a syntactic operation or some kind of intervening 
prosodic operation. Keydana (20 11: 115) criticizes this account by calling attention to the 
fact that an example like (19) cannot be described this way. The WL, clitic asva could not 
appear in this position because it cannot move there before topicalization. 

In a later paper, Hale abandons rule ordering and describes Wackernagel clitics as 
“syntactic movement to C°” (Hale 1996: 192). I agree with Keydana (2011: 119) that 
such a movement appears to be strange. The base position of the pronouns should be a 
full XP, and movement from an XP to a head position (C°) is disallowed by the concept 
of the structure preservation principle (cf. Haegeman 1994: 338). But, on the other hand, 
there are examples with clitics of the WL, type far away from the left periphery (cf. the 
clitic personal pronoun t u va in (23): 
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(23) RV 1,45,5 (cf. Hale 1995: 263) 

( ghftahavana santyema u su srudhT girah |) 
yabhih kam’asya sunavo havante d vase t u va \ \ 

‘ [you, the one to whom the ghee oblation belongs, honest one, listen well to these 
songs of praise] with which the sons of Kanva invoke you for help’ 
yabhih kanvasva sunavo havante ^ vase t u va 

which.iNSTR.PL. Kanva.GEN.SG. son.NOM.PL. invoke. 3 pl.pres. help.DAT. sg. you.ACC.SG. 

Hock’s (e.g. 1996) account of the Vedic left periphery offers a prosodic (not syntactic) 
template solution. The idea of prosody is taken up by Keydana (2011), who analyses the 
WL, clitics as prosodically attached to the first word in the sentence, whereas the WL, 
clitics always stand at the right of the left periphery, having the prosodic prefield of the 
sentence as their host. 

In my model (see above, examples (5) and (6)) these “Wackemagel particles” occupy 
positions which are marked with dotted lines, which means that I think their syntactic status 
is rather loose, representing an interface between syntax and prosody. This solution could 
suit Keydana’s idea of insertion of the clitics into the prosodic prefield of the sentence (in 
my model: before VP) but also allows taking into account that the WL, clitics can occupy an 
argument position in the sentence. For the last point see coindexed oe in parentheses in (22). 
Cf. also (23), where prosody attaches the clitic t u va to tonic avase at the end of the sentence. 
In (23) the syntactic base position of t„va would probably be a position in front of the verb (a 
fully fledged DP would occupy this position). Thus, also in the case of (23), the prosody and 
syntax interact. Coindexing oe in parentheses in (22) should indicate that I do not assume a 
movement operation here which is motivated by syntax alone. 

A view completely different from the ideas just presented is promoted by the phonol- 
ogists Agbayani and Golston (2010), who work with the concept of postpositives (Dover 
1960) like the clitic *k w e (= Lat. -que, Gr. -re) ‘and’ or (in their view) non-clitic Greek 
-8 e that cannot begin a phonological phrase. In their analysis, conjunctions (also the post- 
positive ones) originate syntactically between the conjoined phrases or clauses, and in 
the case of a postpositive conjunction like *k w e an adjacent (phonological) word of the 
second conjunct is moved to the left of it (a phonological, not a syntactic process). In the 
case of clauses which are not conjoined, but would syntactically start with postpositives, 
the first (phonological) word of the second conjunct would also be moved to the left. An 
example of the last case is (24): kev is a postpositive clitic particle denoting a “possible 
world” (cf. Krisch 1986: 18), and piv is a postpositive clitic personal pronoun. According 
to Agbayani and Golston, the sentence (without a conjunction) would start syntactically 
with two postpositives, and phonology would trigger movement of tote ‘then’ to the front 
of them, since postpositives cannot begin a phonological phrase. But this would mean 
that (syntactically) the Wackemagel clitics ksv (WL 2 ) and |iiv (WL,) do not occupy the 
second position but rather that the sentence starts with ksv in the syntactic component. 

(24) Horn. II. 1,100 (Agbayani & Golston 20 10: 18; (]) = phonological phrase; the authors 
underline the word that is moved and indicate the position from which it has moved 
with underlining as well) 

( tots ksv piv iXaaadiiEvoi)^ (;i£;u0oi|.i£v) + 

then ptcl. him appeasing let. us. persuade 

‘then let us persuade him by appeasing him’ 

But, as far as I see, this interesting approach does not really solve the problem of the 
different behaviour of WL, and WL, clitics as outlined above. 



124 THOMAS KRISCH 


The position (s) of the verb in AIELs and in PIE 

The author of this chapter has worked extensively on the position(s) of the verb in PIE. 
The model used is again the one presented in (5) and (6). 


The unmarked order: verb-final 

I start with unmarked verb-final in the AIELs and in PIE. This position of the verb can be 
found in many texts in AIELs at every turn, and it is unproblematic to reconstruct it for 
PIE. Already Delbriick (1967=1900: 83) wrote about PIE word order: “Das Verbum stand 
im unabhangigen Aussagesatz am Ende . . .” (In an independent declarative sentence the 
verb appeared at the end). One may refer to the following examples with verb-final taken 
from AIELs in this chapter: (4a-d), (13), (14), (15). 

Now, in the texts of all AIELs one meets sentences where some linguistic material is 
put after the verb in the right periphery. Syntactic research by Gonda (1959) and McCone 
(1979) could clarify that much of this extraposed material falls into categories which are 
non-obligatory. Gonda (1959) calls sentences like that “amplified sentences” (cf. also 
Krisch 1997; 2001). One may discern the following types of extraposition (cf. Krisch 
2001: 162-164): (a) apposition(s) to constituents overtly appearing in the VP before the 
verb; (b) extraposition of the subject (which is already expressed in the verbal form in 
AIELs and in PIE); (c) part of a Determiner phrase/Noun phrase, the other part being 
inside the VP (hyperbaton, see above); and (d) non-obligatory constituents. The follow- 
ing sentences are examples for these categories from Mycenean Greek (25), Celtic (26), 
Vedic prose (27) and Hittite (28).The finite verb forms are in boldface. 

(25) Mycenean Greek 

a) apposition: 

Ep 613,8 (Pylos): 

[sa-sa-wo] o-na-to e-ke ka-ma-e-u 

[Sasawo] benefieiary.right.ACC.SG. have.3sG.PRES. man.who.has.a.kama.NOM.SG.M. 

‘Sasawo has a fief as Kamaeus.’ 

b) subject: 

Le 641 (Knossos): 

o= a-po-te de-ka-sa-to a-re-i-io 

so= from.afar receive.3sG.AOR.MiD. Areios.NOM.SG.M. 

‘So from outside Areios received (a delivery)’ 

c) part of an NP/DP: 

Ep 704,3 (Pylos): 

e-ri-ta i-je-re-ja o-na-to e-ke 

Eritha priest.NOM.SG.F. benefieiary.right.ACC.SG. have.3sG.PRES. 

ke-ke-me-na ko-to-na va-ro da-mo 

estate. gen. sg. plot.of.land.GEN.SG. from community.DAT. sg. 

‘Eritha, the priestess, has (got) a fief (consisting) of an estate, of a plot of land from 
the community. ’ 

d) non-obligatory constituents: 

Ma. 225,2 (Pylos): 

o-da-a 2 ka-ke-we o-u= di-do-si za-we-te 

and.so bronze. smith.NOM.PL. not= give.3pL.PRES. this.year 
‘and so the bronze-smiths do not deliver this year’ 



PROTO-INDO-EUROPEAN SYNTAX 125 


(26) Celtic 

a) apposition: 

Celtiberian: Botorrita 95-99 (Meid 1993) 

Tocoitei ios urantiom= ue auseti 

Tocoit.DAT.-Loc.SG. who.NOM.SG. pasture. acc. sg. or make.use.of.3sG.AOR.suBj. 
aratim= ue. . . . 

arable.land.ACC.SG. or . . . 

‘Who wants to use pasture land or arable land in the Togoit-(district) (Togoit is the 
name of a Celtiberian god, TK), . . .’ 

b) subject: 

Gaulish (Genouilly, Stele) (after Lambert 1994: 94) 

ELVONTIV IEVRV. ANEVNO OCLICNO . LVGVR1X 
Elontios.DAT.SG. make.3sG.PRET. Aneunos.NOM.SG. Oc1os.adj.nom.sg. Lugurix.NOM.SG. 
ANEVNICNO 
Aneunos.ADj.NOM.SG. 

‘ Aneunos. (son - ) of Oclos. (and - ) Lugurix (son) of Aneunos have made (3sg.pret.! 
TK) (me) for Eluontios (name of a god? TK) 

c) part of a DP/NP 

Celtiberian: Botorrita 23-25 (Meid 1993) 

uta oscues stena uersoniti, silapur 

and whoever.NOM.SG. this. acc. pl.n. carry.through.3sG.PRES. silver.Acc.SG. 

sleitom conscilitom capiseti, cantom 

sleitom.Acc.SG. conscilitom. acc. sg. take.3sG.A0R.suBj. 100 

sancilistara otanaum tocoitei eni . eni. 

saneilistara.ACC.PL.N. giving.to.ACC.SG. Togoit.DAT.-LOC.SG. into 
‘And whoever carries this through, will take sleitom- and conscilitom-silver, 
(namely) 100 sancilistara (monetary unit, TK), for the giving-to into the (temple 
of) Togoit-’ 

d) non-obligatory constituents 
Celtiberian: Botorrita 23-35 (Meid 1993) 

uta oscues stena uersoniti, silapur 

and whoever.NOM.SG. this. acc. pl.n. carry.through.3sG.PRES. silver.Acc.SG. 


sleitom 

conscilitom 

capiseti. 

cantom 

sleitom.Acc.SG. 

conscilitom. acc. sg. 

take.3sG.A0R.suBj. 

100 

sancilistara 

otanaum 

tocoitei 

eni. 

sancilistara. acc.pl. giving.to.ACC.SG. 

Togoit.DAT.-LOC.SG. 

into 


‘And whoever carries this through, will take sleitom- and conscilitom-silver, 
(namely) 100 sancilistara (monetary unit, TK), for the giving-to into the (temple 
of) Togoit-’ 

(27) Vedic 

a) apposition: 

JB 1,28 

tam eva tabhir ahutibhis samavitv s pjam 

this.Acc.SG.M. so this. instr.pl. f. rite.iNSTR.PL.F. appease.ABS. power.Acc.SG. 

lokanam juyati vamam devam devanam 

world.GEN.PL. win.3.SG.PRES. Yama.Acc.SG. god.Acc.so. god. gen.pl. 

‘having appeased him, the god of the gods Yama with these rites, he wins the power 
of the worlds.’ 
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b) subject (with an emendment by Caland 1970=1919: 27, fn. 1) 

JB 1,117,1: 

etasya ha va idam samnah krte varsati 

This. gen. sg.n. ptcl. ptcl. present song. gen. sg.n. for. the. sake. of rain.3sG.PRES. 
ca varianva lie ca grhnati. 

and Parjanya.NOM.SG. out and seize 

‘for the sake of this present song (god) Parjanya lets it rain and stops (raining).’ 

c) part of a DP/NP 
AB 2,14,6: 

amrtam va ajyam, amrtam 

Ambrosia.NOM.SG. ptcl. butter.NOM.SG. ambrosia.NOM.SG. 

tatra sa kama up dp to 

there he longing. loc.sg.m. arrive.PERF.PTCP 
tatra sa kama upapto 

there he longing. loc.sg.m. arrive.PERF.PTCP 


va 

which. nom.sg.m, 


hiranyam 

gold.NOM.SG. 

dive, 

butter.LOC.SG. 

hiranye . 

gold.LOC.SG. 


yo 

which.NOM.SG.M. 

‘Butter is ambrosia, gold is ambrosia. There he has arrived in the longing which (is) 
in butter, there he has arrived in the longing which is in gold. ’ 
d) non-obligatory constituents 
AB 2,11,1: 

pratibudhv s gnimavTh puras 
perceiving. abs. fiery.Acc.PL. citadel, acc.pl. 
parydsyanta vajnasva £, 

place.around.3PL.iMPF. mid. sacrifice, gen. sg. and 
suptyai 

preserving. dat.sg. 


te devah 

this. NOM.PL. god.NOM.PL. 

tripuram 

triple . fortification, acc. sg. 
atmanas ca 
and 


Self.GEN.SG. 

‘The gods, perceiving, placed around fiery citadels, a triple fortification for the pres- 
ervation o f the sacrifice and of themselves. ’ 


(28) Hittite 

a) apposition 

Ges. 2, §86a, 27-29 (Friedrich 1971: 86) 

(27) takku DUMU-an an[nanumanzi ] (28) kuiski pui nassu 

If son.Acc.SG. be.educated.iNF. anybody give.3sG.PRES. either 

^ NAGAR n \ (asm a L )- S(IMUGA -(- US.B ] AR (29) nasma ^ ASGAB nasma 

carpenter or smith weaver or leather. worker or 

t ^TUG . . . 
fuller . . . 

‘if anybody gives a son to be educated, either as a carpenter or as a smith, weaver or 
as a leather worker or as a fuller . . . ’ 

b) subject 

Ges. II § 76, 29 (Friedrich 1971: 82) 

takku=ssan GEDIM-it tianzi LU-A7/1 

if= ptcl. ghost.of.a.dead.person.iNSTR. step. forward. 3 pl.pres. man=alas 
MUNUS roM 
woman.NOM.SG. 

‘if they step forward, man - alas- (and) woman.’ 

c) part of a DP/NP 

StBoT 8, Vs I, 21 ’-25’ (cf. Otten-Soucek 1969: 20) 

(21) apiis hantezumni tehhi, ues=a namma anda (22) 

this. acc. pl. forecourt. dat./loc.sg. put.lsG.PRES. we=CoT again inside 
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[p]aiwani II a h antasepus harwani GI[(S-asj\ 

go. I pl.pres. two Hantasepa-deities.Acc.PL. have. I pl. pres. wood.GEN.SG. 

‘I put these (unclear reference in the text, TK) into the forecourt and we go inside 

again. We have two Hantasepa-deities (made) of wood. ’ 

d) non-obligatory constituents 

StBoT 8, VS I, 14’— 15’ (cf. Otten-Soucek 1969: 18) 

DUMU.E.[(G)]AL suppi watar para epzi LUGAL-/ 

palace-official clean.Acc.SG.N. water. acc.sg.n. out hold.3sG.PRES. king.DAT.SG. 
MUNU S . LU G AL=ya 
queen=and 

‘The palace-official holds out clean water for the king and the queen’ 

(28c), part of which has already been cited as a hyperbaton construction (11), shows the 
amplified construction in the context of a number of regular Hittite OV sentences. 


Marked order 1: verb-initial (V-l) 

As we have seen, the PIE base-type was OV, a type which also encompasses the “ampli- 
fied” structures in 2. 1.3.1. 

Under certain conditions, a movement of the verb to the top of the sentence was 
allowed. Whereas, e.g., in NHG this move is connected with certain sentence types (yes/ 
no questions and imperative sentences show the finite verb at the beginning of the sen- 
tence), the AIELs and therefore PIE had a pragmatic condition for putting the verb at the 
top (V-l) (see below). 

In the beginning of this paper we reconstructed word order variation (VO vs. OV) for 
PIE, leading to the results in (29a) and (29b): 

(29) a) (=(1)) (based on Vedic, Avestan and Greek) 

PIE *(e)- g wh en-t *(H)e/og wh -im 

(Augment)-kill-3sG.iNj.PRES./(iMPF.) serpent- acc.sg. 

► 

‘He kills /(killed) the serpent’ 

b) (=(3)) (based on Vedic, Greek and Hittite) 

PIE *(H)e/og wh -im *(e)- g wh en-t 

serpent-Acc.SG. (Augment)-kill-3sG.iNj.PRES./(iMPF.) 

◄ 

‘He kills /(killed) the serpent’ 

Much has been written about the reasons for fronting the verb in AIEL and in 
PIE, e.g. 

• Delbriick (1967=1900: 81) (my translation): “a word [moves] to the front or to the 
front edge of the sentence respectively, if it is of paramount importance for the state- 
ment ... A verb can be important, if, in an isolated sentence, the action is more import- 
ant than the other components . . . [I]n a series of sentences, it pushes the narrative or 
the discussion ahead.” 

• Dressier (1969: 3) (my translation): “Verb-first is textually cataphoric or (much 
more frequent) anaphoric and (as a word order variant) characterizes textually bound 
sentences.” 



128 THOMAS KRISCH 


• Dressier (1971: 18) (my translation): “Verb-first points to another sentence (either 
before or after) to which it is connected.” 

• Klein in his article on Vedic verb-first states that the fronting of constituents seems 
universally to be associated with salience (Klein 1991: 135). He lists some environ- 
ments where one encounters verb-first in Vedic texts: in imperative sentences, in iter- 
ative anaphoric structures, in chiastic constructions, in cases of simultaneity between 
the event referred to in the hymn and actions occurring at the same time within the 
ritual, in the situation of fronting of verbs of speaking before a quotation, and in case 
of the Vedic verb han- ‘smite’ a tendency to front it. 

• Luraghi (1995: 380): “On account of its lower frequency, the verb initial order was in 
a certain sense ‘abnormal’. By an iconic principle, it came to be used in cases where 
something abnormal was going on, either in the arrangement of the discourse, or in the 
course of the events reported.” 

• Krisch (2004: 116): “Proto-Indo-European functions: . . . V-l: establishing a very 
close connection between sentences /sequences of action.” 

• Bauer (2011: 47) (about Hittite; my translation): “Thus, one can note that it is the 
distribution of topic and comment in the sentence which is crucial for V 1 . The verb is 
the dummy-topic and the comment is the focus.” 

Depending on the text, there is some truth in all of these statements. In my present view, 

• verb-first introduces an important (often unexpected new) topic for the narrative with 
the consequence that further comments are given on it. In a coherent narrative, its 
general function is to accelerate narration, to drive forward the action. 

Let us look again at (29a)=(l) in a fuller context, namely stanzas 1-3 of our Vedic exam- 
ple (2a): 

(30) RV 1,32,1-3 

1. indrasya mi vTrydni pra vocam, yani 

Indra.GEN. now manly.deeds. acc.pl. prv. proclaim. 1 sg.inj.aor. rp 

cakara prathamani vajn \ ahann dhim, 

make.3sG.PERF. first.Acc.PL. thunderbolt-wielder.NOM.so. kill.3sG.iMPF. serpent. acc.sg. 

dn u apas tatarda, pra vaksana abhinat 

prv. water.Acc.PL. disclose. 3 sg.perf. prv. cave. acc.pl. cleave. 3 sg.impf. 

pdn’atdnam || 

mountain.GEN.PL. 

‘I now proclaim the manly deeds of Indra, the first ones that he achieved, the thun- 

derbolt-wielder. He killed the dragon, he disclosed the waters, he cleft the caves of the 

mountains.’ 

2. ahann dhim pdrvate sisriyanam. 

kill.3sG.iMPF. serpent.Acc.se. mountain.Loc.SG. lean.PERF.PTCP.MiD.Acc.SG. 

H’dsta <t smai vajram svar i yam tataksa \ 

Tvastar he.DAT. thunderbolt.Acc.SG. shining. acc.sg. produce. 3 sg.perf. 

vasra iva dhenavah svandamana 

mooing. nom.pl. like cow.nom.pl. hurrying.up. pres.ptcp.mid.nom.pl. 

ahjah samudram ava jagmur apah\ \ 

straight confluence.Acc.SG. prv. go.3PL.PERE. water.NOM.PL. 
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‘He killed the dragon who was lying on the mountain. God Tvastar has produced a 
shining thunderbolt for him. Like the mooing cows (go to their calves) hurrying up, the 
waters glided straight downwards to the confluence.’ 

3. vrsayamano a vrnita 

Behave. like. a.bull.PTCP.PRES.MiD.NOM.SG. choose. 3 sg.impf. 
trikadrukes u apibat sutasva \ 

trikadruka-day.LOC.PL. drink.3sG.iMPF. pressed.one.GEN.SG. 

a savakam maghava jdatta 

prv. missile.Acc.sG. munificent.one.NOM.SG. take.3sG.iMPF.MiD 
ahann enam prathamajam ahlnam 

kill.3sG.iMPF. he.Acc. first-born. acc. serpent.GEN.PL. 

‘The one behaving like a bull (i.e. god Indra, TK) chose Soma, in the Trikadruka-days 
he drank from the pressed one. The munificent one took the missile, the thunderbolt. He 
killed him, the first-born of the dragons.’ 

Note that the VO order with the lexical material of ‘kill’ and ‘snake’ reoccurs another 
two times in the three stanzas in (30). In all these cases the order is VO and not OV. 
Example (30) 1 starts with an OV sentence with the verb form vocam in absolute final 
position. Then follows the most important action of the god Indra, that he slew the dragon 
Vrtra, a verb-initial sequence (starting with ahann). That Indra disclosed the waters is a 
consequence of his slaying of the dragon who locked them up. Also, the cleaving of the 
mountain serves to get the waters free. These actions, since they are not so important, are 
presented in a sentence which is absolutely verb-final ( tatarda ) and in an OV sentence with 
amplification (genitive parvatanam belonging to vaksana). In (30) 2, the important fact of 
killing Vrtra is again emphasized by a verb-initial sentence (starting with ahann). In the 
following context, the scene is broadening again and describes things preceding the kill- 
ing (that a thunderbolt was made for him) and, again, the situation afterwards (the waters 
glide downwards). (30) 3 goes back to the fighting scene and describes how the god Indra 
prepared himself to fight: He drinks soma and takes his weapon. Then the main action of 
Indra, the killing of the dragon, is repeated with the verb ahann in initial position. 

There is no coherent narration; aspects of the topic are presented in little scenes jump- 
ing forward and backward. This function of comments, namely subdividing the scene 
opened by ahann ‘he kills’, is chosen in the whole hymn RV 1,32, where subdivisions of 
the act of killing are brought into little scenes with repetition. This hymn is like a musical 
piece consisting of a theme plus variations. 

In contrast to this VO order, the OV order of the sentence dealing with Indra and the dragon, 
cited again in (3 1), does not indicate the TOPIC of what follows. It is part of new information 
given in the hymn, thus belonging to the informational focus of the text, the comment. 

(31) = (4b) Vedic RV 2,15,1 (ahim < PIE *(H)e/og wh -im Jaghana < PIE *g wh e-g wh on-e) 
trikadrukesu apibat sutasv a asva made 

Trikadruka-day.LOC.PL. drink.3sG.iMPF. Soma. gen. sg.m. he.GEN.SG. intoxicationxoc.SG. 
ahim indro jaghana: 

serpent.ACC.SG. Indra.NOM.SG. slay.3sG.PERF. 

◄ 


somam 

Soma.Acc.SG. 


vajram. 

thunderbolt.Acc.SG. 


‘At the Trikadruka-days he drank from Soma; in the intoxication of /(caused by) this 
(one), Indra has slain the serpent’ 
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The hymn RV 2,15 contains a short enumeration of all the noteworthy deeds of Indra 
without going into details. To illustrate this fact, (32) cites the first three stanzas of this 
hymn, which consists of ten stanzas: 

(32) RV 2,15 (translation Griffith (1896); bold passage marked by TK) 

1 . Now, verily, will I declare the exploits, mighty and true, of him the True and 
Mighty. 

In the Trikadrukas he drank the Soma then in its rapture Indra slew the Dragon. 

2. High heaven unsupported in space he established: he filled the two worlds and 
the air’s mid-region. 

Earth he upheld, and gave it wide expansion. These things did Indra in the 
Soma’s rapture. 

3. From front, as ’twere a house, he ruled and measured; pierced with his bolt 
the fountains of the rivers, And made them flow at ease by paths far-reaching, 
These things did Indra in the Soma’s rapture. 

In this hymn the scene that Indra slew the dragon appears only once, namely in stanza 
1, cf. the passage given in boldface. In the other verses various deeds of Indra are men- 
tioned. Even stanza 3, which one could associate with the myth of the dragon, is formu- 
lated in a neutral way without any overt hints toward the dragon-story. 

This enumeration of the deeds of Indra is underlined by a refrain starting with stanza 2 
and going until stanza 9: “These things did Indra in the Soma’s rapture.” 

Let us now turn to the Pindar examples (2c-d above) in Greek, which are interesting 
because they contain a native speaker’s, namely Pindar’s or, more correctly, the narra- 
tor’s, comment on how to interpret the order with the verb in the front. The example for 
VO from the Pythian Ode Nr. 4, which has been mentioned earlier, is repeated as (33) 
(Kieive ocptv): 

(33) = (2d) 

Pindar, Pyth. 4,249: [kteive < PIE *tkenye°/k[)enye 0 (injunctive present) or < PIE 
*tken-s°/k[)en-s 0 (injunctive aorist); ocptv < PIE *(H)e/og wh -im] 

Kisive . . . ocptv 

kill.3sG.iMPF./AOR. serpent.Acc.SG. 

‘he killed the serpent /dragon’ 

This text is about the Argonauts and the winning of the golden fleece. Jason the Argonaut 
has to get the golden fleece, which lies in a sacred grove in Kolchis. In (34), the context 
of (33) is offered in an English translation: 

(34) Pindar, P. 4, 240-252 (translation, see: Odes, translated by Diane Amson Svarlien [The 
Annenberg CPB Project, the Perseus Digital Library, accessed 9 September 2014]) 

But at once the marvellous son of Helios spoke of the shining fleece, telling where 
the sword of Phrixus had stretched it out. He expected that Jason would not be able 
to accomplish this further labour. For the fleece lay in a thicket, held in the ravening 
jaws of a serpent, [245] which in thickness and length surpassed a ship with fifty 
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oars, built by the blows of a hammer. It is too long a way for me to go by the beaten 
track; for time presses, and I know a shortcut. In poetic skill I am a guide to many 
others. Jason killed the grey-eyed serpent with its dappled back by cunning, [250] 
Arcesilas, and stole away Medea, with her own help, to be the death of Pelias. And 
they reached the expanses of Ocean, and the Red Sea, and the race of the Lemnian 
women, who killed their husbands.” 

There is a remarkable breaking-off in the middle of this passage, where Pindar inter- 
venes as a poet and reflects on his technique of presentation. These verses interrupt the 
calm flow of narration, and they are, on the one hand, a rather boastful demand the narra- 
tor makes of himself to speed up, and, on the other hand, they characterize the following 
lines as a short, condensed, so to speak “fast flowing,” version of the narrative. The orig- 
inal text of this passage is cited in (35). 

(35) Pindar P 4, 247-248 


jiaKpa jioi 

VSIG0CXI 

K(XT’ 

apa^ixov 

a>pa 

yap 

long.ADV I.DAT. 

gO. INF. MID. 

along 

beaten.track.Acc. 

time.NOM. 

for 

GDV&7rUSl 

KOU 

uva 

3. 

a 

> 

o 

zL 

*o 


Ppaxuv 

be.nigh.at.hand.3sG.PRES. and 

some 

way. acc. know. 1 sg.pres. act. 

short.Acc. 

jio^oioi 6’ 

ayr)(iai 


Gocpiaq sxspoiq. 



many.DAT. pl. but lead. 1 sg.mid.perf. skill. acc. pl. other.DAT.PL. 

‘It is too long a way for me to go by the beaten track; for time presses, and I know 
a shortcut. In poetic skill I am a guide to many others.’ 

Pindar’s remarks in (35) offer a rather unique native speaker’s intuition about the func- 
tions of VO in quite a clear way. The next sentence, our sentence (33)=(2d), starts with 
marked verb-first, showing the poetic skills of Pindar, which he himself just mentioned. 
The VO sentences that follow (35) are cited in (36) in full. The two verbs in sentence-ini- 
tial position are put in bold: 

(36) (cf. (33)) Pindar P 4, 249-250: 

kt£iv£ p£v yXavK&na T£%vai<; jioikiTAvcotov ocpiv, 

Kill.3sG.iMPF. ptcl. grey-eyed.Acc. cunning.DAT.PL. dappled.back.Acc. serpent.Acc. 
& ApKeaila, KJii|/£v is MqSsiav cruv aura xdv 

o Arkesilaos.voc. steal.AOR.3sG. and Medea. acc. sg.f. with she.DAT. she.Acc.SG.F. 
IleJaao (povov £v x’ Okecivou nekayeaai 

Pelias. gen. death. acc. prv. and Ocean.GEN. flood. dat.pl. 

piyev 

make . acquaintance . with . 3 pl . aorII 

‘He (i.e. Jason, TK) killed the gray-eyed serpent with its dappled back by cunning, 
[250] Arcesilas, and stole away Medea, with her own help, to be the death of Pelias. 
And they reached the expanses of Ocean (the name of a god and a river which is the 
origin of all oceans, TK) 

Basically, the function of verb-first in (36) is similar to the one which we have seen 
in Vedic: The scene is opened up by the verb, which states an important topic pushed 
forward in a lively way and leads to further comments: the action is pushed ahead by a 
second verb form in initial position (verb kA£V|/£v). With the last verb-final sentence (verb 
piyev), the scene comes to a rest. 
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On the other hand, the Pindar citation (37)=(4d), the evidence for verb-final position, 
has a context similar to the comparable Vedic sentences with the verb in final position 
((31) and (32)): The deeds of the hero Bellerophontes are simply enumerated and not 
expanded; they do not form the basis for a narration. The killing of the dragons is part of 
this enumeration. The context of (37) is cited in (38). 

(37)=(4d) Greek: Pindar, O. 13,88 ff. (£7i£(pve(v) < PIE *e-g wh e-g wh n-e reduplicated 
aorist) 

Kai Xlpatpav nup JivEotoav Kai EoWipoix; £7t£(pv£v 


Kai Xipaipav Jiup ;rv£oioav Kai 

and Chimaira.ACC.SG.F. fire.ACC.SG.F breathe. ptcp.aor.acc.sg.f. and 
EoA,U|.lOU<; £7l£(pV£V 

Solymoi.ACC.PL. kill.3sG.AOR. 

‘And he (i.e. Bellerophontes, TK) killed the fire-breathing Chimaira and the Solymoi’ 

(38) Pindar, O. 13,88 ff. (translation based on Odes, translated by Diane Amson 
Svarlien [The Annenberg CPB Project, the Perseus Digital Library, accessed 9 
September 2014]) 

guv 8e KEivtp Kai hot’ Apa^ovlScov aiOfipo:; 

with but that.one.DAT.SG. and once Amazones. gen.pl. heaven.GEN.SG. 

v|/uxpcov and ko>,wcov Eprjpou xo^oxav 

cold. gen. pl. from bosom. gen.pl. lonely.GEN.SG. areher.anny.ACC.SG. 

(3a>Acov yuvatKEiov oipaxov Kai 

attaek.PTCP.PRES.NOM.SG.M. female.ACC.SG. army.ACC.SG. and 
Xlpaipav jiup 

Chimaira.ACC.SG.F. fire.ACC.SG.F. 

jivEoioav Kai Xo/.nponc ettecpvev. 

breathe. ptcp.aor.acc.sg.f. and Solymoi.ACC.PL. kill.3sG.AOR. 

‘And with that one (i.e. Pegasus, TK), from the chilly bosom of the lonely air, 
he (i.e. Bellerophontes, TK) once, - attacking the Amazons, the female army of 
archers, - killed the fire -breathing Chimaera, and the Solymi.’ 

From these and similar examples one may hypothesize that Greek and Vedic, and pre- 
sumably also IE, had both possible verb orders, verb-initial and verb-final, and that the 
difference between them is that the final position of the verb shows a regular sequence 
with the topic in front of it and the comment in the predicate part, whereas the verb-initial 
construction serves stylistic purposes indicating a (new) topic, represented by the action 
expressed by the verb and its object, which leads to further comments directly derived by 
the verbal action. 

In my model (see (5) and (6)), sentences with an initial finite verb can be repre- 
sented by movement of the finite verb into the C position. This is a sort of movement 
which is well established in generative grammar (cfi, e.g., Haegeman 1994: 302). Since 
Vedic (like PIE) is a fully inflecting language, the empty subject pm is licensed by the 
agreement morphology present in ahann (cf. also Axel 2007: 314 and Sternefeld 2006: 
613, referring to Barbosa 1995). A syntactic tree of part of the Vedic sentence in (30) 
2 is (39): 
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(39) 



FIGURE 1.7 STRUCTURE OF RIGVEDA 1,32,2 


For a Greek instance of V-l cf. example (41). 


Marked order 2: verb-second (V-2) / verb-third (V-3) 

Whereas the existence of verb-first in PIE is uncontroversial, the reconstruction of 
verb-second (as in modem German) can be found mainly in the work of the author of this 
article. But also Keydana (2011: 109-110) assumes verb-second at least for Vedic. The 
ideas presented here are work in progress. 

In fact, verb-second is structurally identical to verb-initial with scheme 1 in the 
approach used here (cf. (5)): The verb is moved into C. The only difference is that (in the 
case of verb-initial, cf. (36)) the Df-slot is empty, whereas in verb-second the Df-position 
is filled. If one takes scheme 2, V-2 is possible if the Df-slot is filled, the verb is moved to 
C, and the second XP in front of the C position is empty. With scheme 2, verb-third (V-3) 
is a possible option when the verb is moved into C and the two XP positions in front of it 
are also filled. According to Krisch 2004 the function of verb-second in PIE is similar to 
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the function of verb-initial. Its function is “establishing a connection between sentences/ 
sequences of action, but not as close as verb initial” (Krisch 2004: 116). It is viewed as 
a probably late IE feature, since this phenomenon can be observed very frequently in 
Ancient Greek and in the Germanic languages, whereas it is rare in Vedic and seems to be 
absent in Hittite (cf. Krisch 2004: 119). Whether verb-third has a special function is still 
unclear but it probably does not differ much from unmarked V-end. 

A clear instance of V-3 is II. 18, 468 (pev is interpreted as a WL3 particle (see sec- 
tion 2. 1 .2) which is attached directly to its host, xqv). Since the local specification auiou 
is demanded by the verb in my model, this cannot be an extraposed constituent. 

(40) II. 18,468 

T2<; euicbv iqv pev Xuiev auiou 

thus speak.PTCP.AOR.NOM.SG.M. she.Acc.so. verily leave. 3sg.aor. there 
‘After he had spoken thus, he (i.e. the god Hephasitos, TK) left her (i.e. Thetis, 
AchilTs mother, TK) there.’ 


XP 


CP 


WL 2 cp 


TOPICALIZED 
MATERIAL: Df-slot 


[‘‘£2? djwav]j 



FIGURE 1.8 STRUCTURE OF HOMER, ILIAD 18,468 / 1 


This sentence is part of the shield description of Achilleus in the Iliad. As shown in 
Krisch 2001: 169-170, this sentence starts a sequence of sentences describing the lively 
scene where Hephaistos does his blacksmith’s job and one action follows the other like 
the V-l sentence immediately following 40 in the text, cf. (41): 

(41) II 18,468 

(3fj 8’ oil cpuoaq 

gO.3SG.AOR.INJ. but to belloW.ACC.PL. 

‘And he (i.e. the god Hephaistos, TK) went to the (pair of) bellows’ 
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FIGURE 1.9 STRUCTURE OF HOMER, ILIAD 18,468 / 2 

In the following illustration from the fifth book of Homer I want to show the phenomenon: 
that V-l and V-2 appear in the same function (cf. also Krisch 1997: 299-300). But I also want 
to draw the reader’s attention to the fact that it is quite often not possible to decide formally 
whether a sentence is V-2 or verb-final, if the sentence is too short for a decision: 

(42) II. 5,97-101 

atv|/’ s;ii Tu8etSr| etitouveto KapjruA,a 

quickly against Tydeus-son.DAT. bend.3sG.iMPF. mid. eurved.ACC.PL.N. 

xotp. Kai Pal’ EJiaiaaovxa 

bow.and.arrow.ACC.PL.N. and hit.3sG.AOR. assault.PTCP.PRES.ACC.SG. 

tx>x®v Kara Se^iov copov 

sueeeed.PTCP.AOR.NOM.SG.M. on right.Acc.SG.M. shoulder.Acc.SG.M. 

Ocopr|Ko<; yvakov Sta 8’ etituto 

corselet.GEN.SG. hollow, acc.sg.n. through but fly.3sG.AOR.MiD. 

jiiKpo<; oiaxoi;, dvxiKpu Se Sieo/e, 

sharp. nom.sg.m. arrow.NOM.SG.M. straight.through but go. through. 3 sg.aor. 

TraldocETO 8’ a'ipaxi Odippc. xra 8’ 

besprinkle. 3 sg.pass.impf. but blood.DAT. corselet.NOM.SG. then but 

otI paKpov aiioE AuKaovoq ay^aoq uioq - 

thereupon loud.ADV. cry.out3so.AOR. Lykaon.GEN. splendid.NOM.SG. son.NOM.SG. 
‘Quickly he (i.e. Pandaros the son of Lykaon, TK) bent the curved bow and arrow 
against the son of Tydeus (i.e. Diomedes, TK) and hit the assaulting one success- 
fully on the right shoulder, (on) the curvature of the corselet. And the arrow went 
straight through and the corselet was sprinkled with blood. And then, thereupon 
Lykaon the splendid son of Lykaon cried out. ’ 



136 THOMAS KRISCH 


The first sentence in (42) is a V-3 sentence according to our model. The accusative 
Krx|i7iij).a xo^a cannot be an amplification, since it is an obligatory constituent. One can 
describe it with scheme 2 (cf. (6)): 


(43) 


CP 


XP 


TOPICALIZED 
MATERIAL; Df-slot 


ai\|/' 


WL 2 CP 



XP 

[eiri To8ei8r|] k 



C 


exixodvexo , 



VJL-, 


VP 


V (finite) 

pro t k Kag7tuka xoi;a tj 



FIGURE 1.10 STRUCTURE OF HOMER, ILIAD 5,97 


The next sentence (starting with Kcd pdk’) is a V-l sentence (like (39) and (41)), since the 
sentence connector Kai does not “count”. The sentence starting with Sid 8’ ETrxaxo could 
be either a verb-final sentence or a V-2 sentence with Sid in the Df-slot and 8’ in WL 2 - 
position, again using scheme 2. (44) offers a tree for the interpretation as V-2: 


(44) 



FIGURE 1.11 STRUCTURE OF HOMER, ILIAD 5,99 
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Also, the following short sentence (aviiKpu 5e Sieo/e,) could be either an example for 
verb-final or a case of V-2. The tree for V-2 would look like (45): 

(45) 



FIGURE 1.12 STRUCTURE OF HOMER ILIAD 5,100 

The sentence starting with naXaaoETO is unambiguously V-l. The last sentence is unam- 
biguously verb-final (with amplification); see (46): 

(46) 



( paxpov a-uoe AuKaovo? aykabq uio^ 


FIGURE 1.13 STRUCTURE OF HOMER, ILIAD 5,101 

If our interpretations are right, the scene narrated starts with a V-3 sentence when Pan- 
daros stretches his bow against Diomedes. The scene where the arrow hits Diomedes’ 
shoulder, penetrates the corselet and causes the blood to sprinkle is described by sen- 
tences which can be analysed as V-l and V-2. 

Conclusion: 

• V-l and V-2 have the function of driving forth the action in a narrative text, of making 
it lively. V-l presumably is the “livelier” of the two. 
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Object ellipsis 

A remarkable feature of PIE word order is object ellipsis. It is ubiquitous in Greek (cf. 
(47)), but also Latin, Vedic and Hittite show plenty of examples. 

(47) Ancient Greek Horn. II 5, 22-24, cf. Krisch (2009: 207) (Dares, a priest of the god 
Hephaistos, has two sons, Phegeus and Idaios. In the Trojan war, they fight against 
Diomedes. Diomedes kills Phegeus and would have killed Idaios as well, but Hep- 
haistos rescues Idaios). 

ov8e yap ov8e kev amoc, tmeKqruye Kijpa pe^oavav, / aXk' "Hcpaicrco:; tipDTO, aataas 
8e vuKii Kalvyaq, / diq 5rj oi pq nayxv yepcov dKaxqpsvo:; £iq. ‘Nay, he [Idaios, TK] 
would himself not have escaped the black goddess of death, but Hephaistos guarded 
HIM , rescued HIM , enfolding HIM in darkness so that the aged one [= Dares, TK] 
might not be utterly fordone with grief’ 


aXk' 

MIN 

"Hqxxicrtoq 

£pt)TO, 


oacoae 

but 

HIM 

Hephaistos 

guard. 3 sg. aor.mid. 

rescue.3sG.AOR. 

8e 

MIN 

VUKll 

TV 4 TXT 
iVlll'l 



and 

HIM 

night.LOC.SG. 

T TJJVjj 

1 lilVl 

hide . PTCP .AOR.NOM.SG.M. 


After discussing examples from Latin, Vedic and Hittite and recent scholarly literature on 
the topic, Krisch (2009: 211) concludes that “PIE had a rule that allowed deleting objects 
which refer back to already mentioned ‘thematic’ material. This ‘null object’ type of ellip- 
sis thus is operating in a forward direction.” Their frequency in coordinate conjunction is 
underlined by Keydana and Luraghi (2012). 

Absolute constructions 

Absolute constructions (henceforth ACs) of the type Latin me absente (I.abl.sg. absent.PTCP. 
pres.abl.sg.) ‘in my absence’ or pirn florente (pear.tree.ABL.SG. blossom.PTCP.PRES.ABL.SG.) 
‘when the pear tree is in blossom’ may be found in a number of AIELs and are a candidate 
for reconstruction. Ruppel 2013 is the latest reference work on this topic with an extensive 
bibliography. According to her, ACs denote spaces or points in time with an obligatory qual- 
ifier (nonnally a participle/verbal adjective) when the respective head noun does not refer to 
time, but is “put into time” (Ruppel 20 1 3 : 3 1 ) by the predicatively used participle. She sees 
the IE origins in regular locative expressions of nouns denoting time, like “at dawn” and “in 
spring”, which, when applied to nouns not referring to time, are “put into time” by the qual- 
ifier. Thus, compare regular Vedic usasi ‘at dawn’ (dawnxoc.SG., e.g. RV 4,2,8; expressing 
a definite point in time, conveyed by a locative) with swye udvati (sunxoc.so. up-go. ptcp. 
pres.loc.sg.) ‘at sunrise’, where ‘sun’ is not per se expressing a point in time. 

If one takes (as Ruppel 2013 does) the Vedic situation with the locativus absolutus as 
reflecting the PIE state, one may easily explain the Latin situation (with the ablative as 
the case fonn in the AC, which also encompasses the PIE locative via syncretism) or the 
Germanic situation (with a dative as the case form in the AC, which can also continue a 
PIE locative via syncretism). Greek, however, is problematic because the genitive which 
appears in the Greek AC (a genitivus absolutus construction) does not go back to a PIE loc- 
ative, but can represent either a PIE ablative or genitive. One may argue here that genitives 
can be used as case fonns for locative meaning in Greek also outside of ACs, e.g. WKToq 
(night.GEN.SG.) ‘at night’, noXepm kcu eipavap (war.GEN.so. and peace. gen.sg.) ‘in war and 
peace’ (Elis, 200 BC) (cf. Ruppel 2013: 220). Such genitives are traditionally classified 
as “partitive” in the handbooks (e.g. vvxrdq ‘(time portion of) night’, noT-epco mi eipavap 
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‘(time portion of) war and peace’). These constructions compete with constructions show- 
ing (preposition +) dative, cf. ev siprjvfl Kai ev 7toXe|i© (in war.DAT.se. and in peace. dat.sg.; 
e.g. X.Lac. 11.1 .2). The only attestation of this phrase in the dative which I could find with- 
out a preposition is in a Scholia in Iliadem 18, 490. The Greek dative can continue the PIE 
dative, instrumental and locative. Starting from such competing constructions, there could 
have been a transfer to the genitive. Unlike others (including Krisch 1988, Keydana 1997: 
9), Ruppel is not interested in the semantic equivalence of ACs to finite subordinate clauses. 

Krisch (1988: 8-9, following Holland 1986) reconstructs a nominativus absolutus 
(nominative + participle) for PIE. This construction is very rare but appears in a greater 
number of AIELs than the ACs with oblique cases. One can imagine these nominatives, 
loosely connected with the neighbouring sentence, as forming a frame for the adjacent 
sentence and can interpret (with Krisch 1988) the (presumably later, but still PIE) devel- 
opment of ACs with oblique case forms as a stronger integration of the temporal frames 
into the sentence. Since the locative with its temporal function is the ideal candidate for 
this, the rest of the story could roughly go as sketched above when dealing with Ruppel 
2013. An example for a nominativus absolutus is (48): 

(48) Greek, Horn. II. 3,211: 

cxpip© 8’ eijopev©, yepapcbiepo:; fjev 'OSuooefx; 

both.NOM.Du. but sit.PTCP.MiD.NOM.DU. well-built.coMP.NOM.DU. be.iMPF.3sG. Odysseus 
‘when both (i.e. Menelaos and Odysseus, TK) were seated, Odysseus was the more 
impressive’ 

Other languages where this construction appears are Vedic, Latin, Germanic, Hittite and 
Lithuanian. The Hittite examples are doubtful, though (cf. Keydana 1997: 288). 

An attempt to draw the semantic paths from an original temporal meaning of ACs to 
other meanings of ACs (conditional, causal, concessive, modal) is made by Krisch (1988: 
9). Keydana (1997) offers a very extensive selection of attested ACs in AIELs arranged 
according to a number of important syntactic categories. Like the author of this chapter, 
he considers a late PIE origin for ACs (Keydana 1997: 34), but he does not take the tem- 
poral value of ACs as basic and wants to explain the relationship between the ACs and the 
rest of the sentence by relational semantics/pragmatics (Keydana 1997: 45). 

Case functions in the AIELs and in PIE 

Introduction 

In IE studies, the semasiological approach has a long tradition. One takes the case form of 
a particular language and describes the functions which this case form can be associated 
with. This type of research is still ongoing and now centres on the question of determin- 
ing the “core” meaning of the cases and their relation to more peripheral uses and on the 
question of which uses of the case in question can be taken over by other case forms. This 
type of research is surely still a rewarding task if it is carried out meticulously. 


The instrumental 

The Vedic instrumental case 

As an elaborate example for such an investigation, I mention Hettrich 2002, a speci- 
men from a work in progress on Rgvedic uses of case forms. 4 The article deals with the 
instrumental. 
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Hettrich (2002: 46) defines the core/prototypical meaning of the instrumental in the 
Rgveda (RV) as the ‘instrumental of means’. According to him, an NP carrying the pro- 
totypical instrumental case exhibits the following features: 

(49) 

a) It is -animate, concrete and easy to handle. 

b) It is a (physical) object the existence of which is independent of the actual situ- 
ation and of the participants in this situation. 

c) This (physical) object is under the control of the one controlling the action and 
stays there during the action. 

d) This (physical) object enables the controller to carry out an action which is 
controlled by him or her, which is dynamic and oriented towards a patient, or 
facilitates such an action. 

This prototypical usage is presented in (50): 

(50) RV 2,15,6b: 

vajren s ana usasah sam pipesa 

Vajra.iNSTR.SG. travelling. eart.ACC.SG. Usas.GEN.SG. together demolish. 3 sg.perp. 
‘With the Vajra (i.e. the thunderbolt, TK) he (i.e. the god Indra, TK) demolished the 
cart of Usas (i.e. the goddess of dawn, TK)’ 

The other shades of meaning of the instrumental case are explained linguistically by the 
absence of one or more of the features of the prototypical usage enumerated above. If one 
takes away feature c) one gets sentences like (51): 

(51) RV 2,10,4a: 

jigharmy agnim havisa ghrtena 

sprinkle. Isg.pres. Agni.Acc.SG. oblation.iNSTR.so. butter.iNSTR.SG. 

‘I sprinkle Agni with an oblation, with butter’ 

A further example is the causal meaning of an NP using an instrumental. This meaning 
is achieved if the NP slot is headed by a noun which is -concrete (i.e. +abstract), and it 
is sometimes doubtful whether there exists a full and sole control over it by the subject 
controlling the action. Consider example (52), where the subject is Indra, the mighty god 
who sets free the waters and makes springtime possible: 

(52) RV 2,13,7 (Hettrich 2002: 49) 

yah puspinTs ca pras,yas ca dharmana 

who.NOM.SG. blossoming.ACC.PL.F. and fruitful. acc.pl. f. and divine.law.iNSTR.SG. 
adhi done viv avanlr adharayah 

onto.ADV. field.Loc.SG. asunder river.Acc.PL.F. bear.forth.2sG.iMPF. 

‘you who distributed the blossoming and fruitful (flowers) (and) the rivers onto the 
field according U^ecause of divine law’. 

The causal usage of the instrumental in (52) corresponds to non-prototypical uses of the 
locative (53) and the accusative (54) (with the adverb anu), and with a reading of the 
ablative (abstract source) which is fairly near the ablative prototype (55). 

(53) RV 9,7,1: 

asrgram indavah padia dharmann 

release.3pL.AOR.PASs. sap. nom.pl. m. path.iNSTR.SG. divine. law.Loc.SG. 
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rtasya susriyah 

truth, gen.sg. very.beautiful.NOM.PL.M. 

‘The saps poured out along the path because of/according to divine law of truth, the 
uttermost excellent (ones).’ 

(54) RV 3,17,1: 

samidhyamanah pratham a - anu dharma 

kindled.PRES.PASs.PTCP.NOM.SG. first.Acc.PL.N. according.to divine. law.ACC.PL.N. 
‘kindled according to the first divine laws (referring to Agni, TK)’ 

(55) RV 6,70,3: 

pra prajabhir jdyate dharmanas pari 

forth.ADV./pRV. descendant.iNSTR.PL. create.3sG.RASS. divine.law.ABL.SG. around.ADV./pRV. 

‘he is created further through descendants because of/according to divine law 
(being) around’ 

Hettrich uses the (slightly adapted and translated) diagram in Figure 1 . 14 to show the core 
and the peripheries of the usages of the instrumental in Vedic. This method of describing 
the function of cases in AIELs in my opinion is true progress compared to the traditional 
descriptions of the function of cases: It makes use of ideas from general linguistics (the 
use of features and the application of prototype theory), and it illustrates the results in 
a geometric symbolization (see Figure 1.14). Hettrich’s method is extendable to other 
languages and may lead to more detailed functional reconstructions. Any reconstruction 
is of course much less secure than the philological reasoning in a language with a clearly 
marked morphological instrumental. 



FIGURE 1.14 HETTRICH 2002: 47 (slightly adapted): THE SPECTRUM OF MEANING OF THE 
RG VEDIC INSTRUMENTAL (prototypical meaning in the circle, other meanings radiat- 
ing with arrows) WITH CONCURRING CASES (inside the rectangles) 
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The Hittite instrumental case and questions of reconstruction 

Let us have a look at another AIEL, Hittite. We here make use of the very detailed and 
philologically excellent dissertation by Melchert 1977, dealing with the functions of the 
ablative and instrumental in all historical strata of Hittite. 

Like in Vedic, the most common uses of the instrumental in Old Hittite (arguing func- 
tionally, not formally) are near its prototypical use, the instrumental of means. In (56) the 
prototype is evident: 

(56) (cf. Neu 1980, 105) Ritual text Nr. 43, Vs. I 12’ (= KBo XVII 43 12’): 

appa=ma=sta nea LU menean KV& sarazzit walahzi 

baek-but-PTCL. turn.3sG.MiD. ritual.official.Acc.SG. waterhose.iNSTR. beat.3sG.PRES. 
‘but he turns then, beats the ritual official with a waterhose.’ 

The instrumental KVS sai~azzit ‘with a waterhose’ (the Hittite instrumental ending -(i)t 
is not a morphological cognate of any Vedic instrumental form, and its formal origin 
remains uncertain) fulfills all the criteria mentioned above for a prototypical instru- 
ment. As in Vedic, there are other uses, e.g. one where the feature c) (cf. (49)) is taken 
away: 

(57) Old Hittite, Zalpa-Text (cf. Otten 1973: 6) A Vs. 2: 

tuppus sakanda sunnas 

container.Acc.PL. excrement.iNSTR. fill.3sG.PAST 
‘she filled the containers with excrement’ 

Taking into account the Vedic and Old Hittite examples just mentioned, one may 
reconstruct the functions “prototypical instrumental” (cf. the examples (50) and (56)) 
and “prototypical instrumental use minus feature c)” (cf. the examples (51) and (57)) 
for PIE. 

If one seeks for parallels to Vedic examples with a causal nuance like the one under 
(52) one does not find comparable uses in Hittite for the instrumental case, neither in Old 
Hittite nor in the subsequent stages of the language. This is a caveat for reconstructing 
this use of the instrumental case for PIE. 

On the other hand, the ablative in Hittite (already in Old Hittite) can be found in causal 
contexts, just like the Vedic ablative (cf. (55) above): 

(58) IBoT I 36 I 58-59 (cited with translation after Melchert 1977: 289 f): 

nu lu NI-DU 8 punussanzi nu GIS SUKUR.HI.A apas 

and doorkeeper interrogate. 3 pl.pres. and spear.Acc.PL. he.NOM.SG. 

nahsaraz (!) (i.e. nahsarattaz) uskizzi 

fear.ABL. see.3sc.PRES. 

‘They interrogate the doorkeeper and he will (always) see the spears out of fear.’ 

For a PIE reconstruction this means that the causal reading of the ablative is a candi- 
date for reconstruction, whereas the causal reading of the instrumental seems to be an 
Indo-Aryan innovation. 

Thus, an example like the Latin text from Plautus in (59) with a causal ablative will 
probably continue a PIE ablative and not a PIE instrumental: 
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(59) Plaut. Amph 1118 

nam mihi horror membra misero 

since 1 sg.pron.dat. horror.NOM.SG. limb.Acc.PL. poor.DAT.SG.M. 
percipit dictis tuis 

seize. 3 sg. pres. word. abl.pl. your.ABL.PL. 

‘since because of your words horror seizes the limbs of miserable me’ 


Case syncretism 

“Case syncretism” is the linguistic term for a formal collapse of originally separate func- 
tions of cases, so that one case fonn covers different functions. In more recent times there 
have been attempts to make use of models to get insights into this phenomenon. 

In his model of a causal order of participants, Croft (1991: 186) states that the (uni- 
versal) causal chain of constituents in a sentence is influenced by a cause-result scheme, 
“cause” starting the causal organization of sentence and “result” ending it. Conse- 
quently, he distinguishes semantic roles which belong to the “antecedent” type (such as 
AGENT, COMITATIVE, MEANS, MANNER, INSTRUMENT, SOURCE and CAUSE) 
from semantic roles belonging to the “subsequent” type (such as DIRECTION, GOAL, 
RESULT, BENEFACTIVE, RECIPIENT and LOCATIVE). Croft claims that case syn- 
cretism is preferably possible inside these types, viz. that case syncretism is preferred 
among antecedent roles or among subsequent roles, respectively. Croft offers a table of 
35 languages surveyed (Croft 1991: 237-239). In principle, he takes into account syn- 
thetic case forms and adpositional phrases, but he does not discuss data from AIELs in 
any detail. 

If we interpret the syncretism of the PIE ablative and genitive into the Greek geni- 
tive (cf. also Luraghi 1987: 362-363) along these lines, one could argue that the main 
function of the genitive case (also in PIE) is the marking of a (nominal) dependency. In 
the case of the frequent genitivus subjectivus (e.g. Gr. (po(3oq' xcov wo^epicov ‘fear of the 
enemies’ (meaning: ‘the enemies fear s.b.’) the genitive denotes an Agent and belongs to 
the “antecedent” group, just like the ablative with its core semantics (Source). 

It appears, though, that this model is too simplistic for the description of the syncre- 
tisms observed in AIELs. Luraghi (2001) argues that “illicit” syncretisms across Croft’s 
two groups do occur. Her main point is that not only semantics has to be considered 
when dealing with syncretism but also syntax and the lexicon. This point was already 
made by Serbat (1992: 281), who deals with the syncretism of the PIE ablative, instru- 
mental and locative (note: for Croft, the locative belongs to a different group than the 
ablative and instrumental) into a single case called “ablative” in Latin. He emphasizes 
that this syncretism has to be interpreted as mainly syntactic, since these cases form the 
central domain of the circumstantial cases (“adjuncts”), whereas the rest of the cases 
represent the core syntactic relations (“arguments” in the narrow sense). The Latin syn- 
cretism thus underlines a syntactic opposition formally. But one has to be careful about 
making too general statements: The ablative in Latin is also a formal mixture: The con- 
sonant stems still preserve the locative ending as the “ablative” in the singular, and the 
pure “ablative” in Latin was used mainly to express instrumental meanings, while the 
other functions were expressed either by certain prepositions + ablative or by lexical 
restrictions on certain verbs (cf., e.g., the pure “ablative” designating a location with 
incolit ‘lives (in)’ (PI. Rud. 907)). In her analysis of the syncretism of the PIE dative, 
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instrumental and locative into the Greek dative, Luraghi (2001 : 42) emphasizes that lex- 
ical features like +/-animate are important: If the Greek dative reflects a PIE dative, it 
has to have the feature [+animate] ; if it reflects the PIE instrumental, the relevant feature 
is [-animate]; and in the case of an original PIE locative, the following lexical features 
have to co-occur: [-animate], [+place] / [+ time]. In the exceptional case where some- 
one who is [+animate] is used as an instrument, a prepositional phrase 8ia ‘through’ + 
genitive is used (Luraghi 2001: 42). 


Argument structure 

This section, though it deals with one of the most discussed issues in PIE syntax, will be 
kept very short here, due to lack of space and also due to the fact that a long article with 
an overview of PIE argument structure and current research in this area by the author of 
this chapter will soon appear (Krisch, forthcoming). 


Intransitive, transitive, ditransitive and labile verbs 

Intransitive verbs, i.e. verbs that do not govern an accusative, do exist in the AIELs and 
in PIE. However, most instances of these verbs are transitives used “absolutely”, i.e. with- 
out an accusative. PIE also has some true intransitive verbs, now termed “unergative”, 
where the subject has agentive qualities, like the verb ‘to yawn’, PIE *g h i-ne/n-h r (LIV 2 : 
173). Another group of PIE intransitives are the “unaccusatives”, which have a subject 
exhibiting the properties of a direct object, e.g. PIE *key- Tie (outstretched)’ (LIV 2 : 20). 
The verbs of movement are situated between these two classes, since, on the one hand, 
when moving takes place, there is normally agentive activity involved but, on the other 
hand, with these verbs the mover and the moved one are identical, e.g. PIE *h,ey- ‘go’ 
(LIV 2 : 232). Unergatives show a tendency to take active endings (e.g. PIE *wiwek w -ti 
‘s/he is speaking’), whereas unaccusatives take stative/middle voice (=medio-passive) 
endings (e.g. PIE *key-(t)oy ‘s/he lies’) 

Transitive verbs, i.e. the verbs that govern an accusative, can be reconstructed for 
PIE. Also, the “absolute” use of transitives without an overt accusative is a heritage from 
PIE (see Krisch forthcoming for examples of AIELs with the root PIE *g wh en- ‘beat/kill 
someone’ (LIV 2 : 218)). 

Ditransitive verbs take a subject, object and “indirect object” and express transfer. 
Both of the objects either may be expressed or may remain unspecified in the AIELs and 
in PIE. Examples are PIE *deh,- ‘to give something to someone’ (LIV 2 : 105) and *b h er- 
‘to carry/ bring something to someone’ (LIV 2 : 76). 

Labile verbs, sometimes also called P(atient preserving)-labile verbs, are common in 
a number of modem IE languages but are rare in AIELs, and until now I have not seen a 
convincing example of a reconstruction. The object of these verbs can also serve as a sub- 
ject with the same verb without any formal change on the verb (as in English: John opens 
the door vs. The door opens). The reason these verbs are so rare in the AIELs: The early 
IE languages normally use a specific set of endings (the so-called middle or medio-passive 
endings) in contrast to active endings to indicate a decrease in valency in the way of the 
“labile” verbs. If one looks at the diachrony of Modem English, where this phenomenon is 
most conspicuous, the modem language definitely shows more productivity than the older 
stages in history (cf. Kulikov 2003: 96 with literature). There are also examples of labile 
verbs in Vedic Sanskrit (Kulikov 2003: 99; 2014). A good case is svadate (3 sg. middle/ 
medio-passive): transitive ‘someone makes sweet something’ (RV 3,54,22) vs. intransitive 
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‘something is sweet’ (RV. 9,74,9). Investigating Vedic Sanskrit, Kulikov (2003: 103-104; 
2014: 1145-1147) regards the intransitive forms of a few labile verbs such as pus ‘pros- 
per; make prosper’ as older and the transitive constructions as derived from constructions 
originally containing cognate accusatives. Lavidas (2009) demonstrates that a later devel- 
opment of labile verbs can be proven also for Greek. 


Subjecthood 

This part of the argument structure has received growing attention in the last years. The 
normal case form for the subject in the AIELs and in PIE was the nominative. But there 
does exist so-called non-canonical marking for the underlying subject in a number of 
languages. This phenomenon (sometimes also labelled “quirky subject” or “oblique sub- 
ject”) is most prominently present in Icelandic, but there is a growing group of research- 
ers around Johanna Barddal (Bergen) who are collecting and analysing instances of this 
construction from many IE languages (http://org.uib.no/iecastp/IECASTP/people.htm, 
seen 31 October 2016). In one of the latest publications, one finds the following defi- 
nition for the oblique subject: “With the terms oblique subject construction and dative 
subject construction, we . . . refer to constructions where the so-called logical subject is in 
an oblique case, for instance the dative” (Barddal & Smitherman 2013: 29). The authors 
consider the oblique subject construction inherited from PIE. They claim an inherited 
predicate structure (cognate argument structures; Barddal & Smitherman 2013: 53) and 
also present some cognate sets. This seems to me the right way to argue. One of their 
examples for a cognate set with this kind of construction is the Gennanic-Latin-Ancient 
Greek corresponding construction of the PIE adjective *sweh,d-u- ‘sweet’: 

(60) (cf. Barddal & Smitherman 2013: 42) 
a. Old English, Blick. Horn, 17: 74—75 
& him swete wceron to arcefnenne 

and him. dat. sweet were to perform. inf. 

‘and he enjoyed performing’ 

b) Latin, Lucr. 4,658 
quod suaue est aids 
which sweet is others.DAT. 
which others like/find sweet 

c) Ancient Greek (Homer), Od. 24,435^136 

ouk av epoi ye peid cppsaiv 1)66 yevouo ijcoepev 

not ptcl. me .dat. at.any.rate with sou 1 s.dat. sweet becomes. 3 sG.OPT. live.iNF. 

‘Then in my heart, I would not find it at all sweet to live.’ 


CONCLUDING REMARKS 

After touching upon the history of research and the data situation, this chapter tried to 
present major fields of study in PIE syntax today: word order; absolute constructions; 
case functions, including syncretism; and argument structure. Because of the author’s 
own research interests, the section on word order was especially extensive. The field of 
PIE syntax radiates in many directions and hopefully will lead to a coherent picture of the 
syntactic properties of the proto-language. 
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FURTHER READING 

The text of this chapter has already contained a number of references for the topics dealt 
with here. One of these references, the English translation of Wackernagel’s lectures on 
syntax (1920, 1 924) by Langslow (2009), is an especially interesting read because it com- 
bines a masterly translation of one of the most influential books on comparative syntax 
of the classical languages and Germanic of the 20th century with many footnotes, com- 
paring Wackemagel’s ideas with modem viewpoints. Gotz Keydana, who has been cited 
with a number of contributions in this chapter, also wrote an article with a survey of PIE 
syntax that has been published on the internet, which is worth reading (Keydana 2008). 

Some 20 years ago, the introductory textbooks on IE linguistics either did not have 
chapters on syntax at all (e.g. Szemerenyi 1996) or devoted only two or three pages to 
it (Beekes 1995: 93-95). In more recent introductions, the chapters on syntax are still 
shorter than those on phonology and morphology, but one can observe an increasing 
interest in integrating syntax into the picture of PIE. I can recommend Clackson’s (2007: 
157-186) chapter on PIE syntax, where general methodological questions of syntactic 
reconstruction, word order (including Wackemagel’s Law), clause-linking, alignment 
change and PIE phraseology are treated. There is also a very useful “Further Reading” sec- 
tion. Fortson (2010: 152-169) deals with topics like constituency (tmesis), subject-verb 
agreement, Wackemagel’s Law, negation, absolute constructions and the interaction of 
prosody and syntax followed by a short list of “further readings”. Meier-Briigger (2003: 
238-276; the syntax part is by Matthias Fritz) offers a short discussion of and many 
bibliographical references to (morpho-)syntactic issues like parataxis and hypotaxis, non- 
finite constructions, tense-aspect-mood, diathesis, case and gender. 


SOME ABBREVIATIONS USED 

Abs.: absolutivum/gerund; AC: absolute construction; AIELs: ancient Indo-European 
languages; AP: adjectival phrase; C or COMP: complementizer (= subordinate conjunc- 
tion not forming a phrase in the sentence, e.g. that, if, because', this slot may also be 
occupied by the finite verb, see the German generative tradition; cfi, e.g., Haider 2010: 
57); CoT: change of topic particle; CP: complementizer phrase; Df-slot: discourse- 
functional slot; DP: determiner phrase; Mid.: middle (=medio-passive); NP: noun phrase; 
O: object; t: trace left behind when syntactic movement occurs; PP: prepositional phrase; 
PRO: empty subject, e.g. in infinitival constructions; pro: “small pro”, non-overt noun 
phrase; rp: relative pronoun; S: subject; SConn: sentence connector; V: verb; V-l: verb 
in sentence-initial position; V-2: verb in second position; V-3, verb in third position; 
VP: verb phrase; WL 2 , Wackemagel clitics serving as sentence connectors; WL„ other 
Wackemagel clitics (pronouns); WL 3 , particle adjacent to the word or constituent it takes 
scope over; XP: any phrase. 


NOTES 

1 I thank Stefan Niederreiter for proofreading the text. The remaining mistakes are mine, 
of course. 

2 Cfi, e.g., the various reconstructions of Schleicher’s fable in http://en.wikipedia. 
org/wiki/Schleicher’s_fable#L.C3.BChr_.282008.29 [seen 31 October 2016] or in 
Birkhan 1985: 307-308. If one compares the syntax of Schleicher’s reconstruction 
from 1868 (cited in Lehmann & Zgusta 1979: 456), dadarka akvams (see.3sG.PERF.ACT. 
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horse. acc. pl.), and Peters’ reconstruction from 1980 (in Birkhan 1985: 308), dedorke 
(hjeRuons (see.3sG.PERP.ACT. horses.Acc.PL.), with the reconstruction of Lehmann and 
Zgusta, ekwons espeRet (horses.Acc.PL. catch. sight.of.3sG.AOR., Lehmann & Zgusta 
1979: 462), there is a V(erb)-0(bjeet) order in the first two versions and an OV order in 
the last one. The OV order clearly reflects Lehmann's syntactic views (cf. also above). 
One may find a very witty example for a “new” PIE text in the starting episode of the 
Austrian film MA 2412. Die Staatsdiener (MA 2412. The civil servants) by Harald 
Sicheritz (2003). The PIE text of this film (with German subtitles) was provided by 
David Stifter (now a professor at Maynooth, Ireland) (2003). 

3 For the meaning and origin of the Greek particle rap, cf. Katz 2007. 

4 A much more complete version of the project can be accessed through the internet 
(Hettrich 2007). There is much other recent work on functions of cases in AIELs, 
including monographs (e.g. Hajnal 1995, especially on Myc. -pi; Luraghi 2003 on 
prepositional phrases in Greek) and articles on the accusative (e.g. Boley 2002, Goto 
2002), the dative (e.g. Nowicki 2002, Hettrich 2006) and the genitive (Neumann 2001, 
Serbat 1992). For Ancient Greek (mainly prepositional phrases) I can recommend a 
number of very thoroughly argued articles by Conti (e.g. 1996, 2003). 
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PART 2 


PROTO-INDO-EUROPEAN 
AND LANGUAGE TYPOLOGY 

Ranko Matasovic 


INTRODUCTION 

This chapter will give an overview of the typologically relevant features of Proto-Indo- 
European. By “typologically relevant” we mean those features that are cross-linguistically 
rare, or those that are correlated with other features, thus defining a linguistic “type”, a set 
of languages sharing a number of logically independent features. Moreover, we will con- 
sider those features of PIE that show genetic and/or areal patterning, in the sense that they 
statistically tend to be more common in certain linguistic families and/or language areas 
than in others. Particularly relevant are diachronically stable features that tend to be shared 
by most genetically related languages of a single family. Such features are important for 
questions about the deep genetic relatedness of Indo-European and about early language 
contacts between PIE and its neighbours. It has been long suggested that Indo-European 
languages and Uralic languages are related, forming a putative Indo-Uralic family (e.g. 
Collinder 1960), and Vladimir Illic-Svityc (1971-84) included both Indo-European and 
Uralic into a wider family, “Nostratic”, which would also include Altaic, Dravidian, Kart- 
velian and Afro-Asiatic languages. Several variants of this hypothesis are currently in 
circulation, including the daring proposal by Joseph Greenberg (2000), who also includes 
Gilyak, Ainu, Chukchi-Kamchatkan and Eskimo- Aleut (but excludes Afro-Asiatic, Kart- 
velian and Dravidian) in his “Eurasian” language family. None of these hypotheses can 
be considered as proven, but many linguists believe that Indo-European is indeed related 
to Uralic, and that it is more likely to be related to Altaic and Kartvelian than, say, to NW 
Caucasian or Sino-Tibetan. As this chapter will show, PIE shares a number of typologi- 
cally relevant features with Uralic and, to a lesser extent, with Altaic and other language 
families of Northern Eurasia. However, PIE also shares a considerable number of features 
with languages of the Caucasus, including not only Kartvelian - which is included in the 
putative Nostratic macrofamily - but also NE and NW Caucasian languages, to which it 
is probably unrelated. These shared features can be understood only as results of prehis- 
toric language contacts, so they are very relevant for the question of the homeland of the 
speakers of PIE (Mallory 1989, Anthony 2007). 


PHONOLOGICAL TYPOLOGY 

There is far more agreement, among linguists, about the reconstruction of PIE phonology 
than there is about the reconstruction of the proto-language’s morphology and syntax. 
Today, it is almost universally agreed that PIE had three series of stops, traditionally 
labelled as voiceless, voiced and (voiced) aspirated (i.e. *T, *D and *D h , respectively). 
The phonological interpretation of these segments remains an open matter. The traditional 
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interpretation has been challenged by Roman Jakobson (1962) and other linguists, since 
there are apparently no languages that have voiced aspirates unless they also have voice- 
less aspirates. Moreover, the very term “voiced aspirates” is somewhat misleading, since 
consonants that are represented (and transcribed) as /b h , d h , g h /, etc., in Hindi and other 
Indo-Aryan languages actually have breathy voice. It is still not clear whether there are 
languages with voiceless and voiced stops as well as those with breathy voice (but with- 
out voiceless aspirated stops), but if such languages exist, they are certainly very rare. 
Moreover, it has been pointed out that the apparent rarity, or non-existence, of PIE *b 
is at odds with the typological generalization that if one of the labial stops is missing in 
a language with a voice opposition in stops, it is the voiceless labial /p/ rather than its 
voiced counterpart Pol that is missing (e.g. in Classical Arabic). These facts led to a pho- 
netic re-interpretation of stops in PIE, known as “glottalic theory”, first proposed by Paul 
Hopper and, independently, by Tamaz Gamkrelidze and Vyacheslav Ivanov, in the early 
1970s (see Gamkrelidze & Ivanov 1984 and Salmons 1994, with references). According 
to this theory, the stops traditionally interpreted as voiced were actually glottalized, pre- 
sumably ejectives. It has been noted that the bilabial ejective stop (/p’/, corresponding 
to the *b of traditional Indo-European reconstruction) is highly marked, and that it is 
sometimes lacking in languages having ejective consonants. The change of glottalized 
stops into voiced stops, which has to be posited for several branches of PIE (e.g. for Italic, 
Celtic and Slavic), is well attested in a number of languages, and the developments in 
Germanic and Armenian (where voiceless stops correspond to PIE *b, *d, *g, etc., of the 
traditional reconstruction) are seen as simple deglottalization. Moreover, certain sound 
developments, such as Winter’s Law in Balto-Slavic (the lengthening of vowels before 
voiced stops, perhaps only in closed syllables), have been claimed to be more plausible, 
from the phonetic point of view, if the glottalic theory is correct. However, the glottalic 
theory has not been generally accepted by Indo-Europeanists, and many scholars prefer 
to remain agnostic about the correct phonetic interpretation of the PIE system of stops. 

In terms of the number of segments, PIE was well within the average of Eurasian 
languages. However, the number of stops, with three series and three different types 
of gutturals (plain, palatalized and labiovelar), is unusual, as such complex systems are 
found mostly in the languages of the Caucasus and in those of South Asia and SE Asia. 
If one accepts the glottalic theory, it is highly significant that Caucasus is the only area 
in Eurasia where ejective consonants are common. They even spread to some intrusive 
languages, such as dialects of Armenian and Ossetic. 

One disputed matter in PIE phonology is directly relevant to the typological profile 
of the proto-language. It is the number of vowels: in traditional handbooks, it is usually 
assumed that PIE had the typologically most widespread five-vowel system, with a length 
contrast, i.e. that there were ten vocalic phonemes (*a, *e, *i, *o, *u, *a, *e, *1, *o, *u; cfi, 
e.g., Cowgill & Mayrhofer 1986, Szemerenyi 1989). However, the discovery of laryn- 
geals has allowed Indo-Europeanists to show that *a, *1 and *u were actually sequences 
of short *a, *i and *u and laryngeals (i.e. that we should reconstruct *eh 2 instead of *a, 
and *iH and *uH instead of *T and *u respectively), and it has been suggested that all 
instances of alleged PIE *a can be eliminated, either by positing new sound laws or by 
assuming that etymons with this vowel were early borrowings from unknown sources, 
chiefly in the languages of the European branches of Indo-European (Lubotsky 1989). It 
has also been argued that the PIE long vowels *e and *o were certainly rare, occurring 
only in a number of morphological categories, such as the sigmatic aorist and the nomi- 
native singular of certain nominal stems, and it appears likely that they developed in Late 
PIE by lengthening of *e and *o, respectively, in certain environments. For example, the 
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long vowel in the noin. sg. of *ph 2 ter ‘father’ has been plausibly derived from earlier 
short *e in Early PIE *ph,ter-s, with the nom. sg. ending *-s, by compensatory length- 
ening, i.e. with the development *-ers > *-err > *-er. Furthermore, if the high vowels *i 
and *u are analysed as syllabic allophones of the glides *y and *w, we are left - at least 
in Early PIE - with a binary vocalic system with only *e and *o, typologically similar to 
the binary systems found in NW Caucasian languages such as Abkhaz and Ubykh. Pho- 
netically, *e and *o could have been hi and /a/, since languages with two-vowel systems 
appear always to oppose a low and a mid vowel, rather than a front and a back vowel. 

PIE vocalism was characterized by productive ablaut. In PIE, this type of non-linear 
morphology was used both in the nominal inflexion (to distinguish case forms, e.g. nom. 
sg. *pod-s vs. ace. sg. *pod-m and gen. sg. *ped-os or *pd-os) and in the conjugation of 
verbs (e.g. to distinguish the singular forms of athematic presents from their plurals, cf. 
Skr. asmi ‘I am’ < PIE *h[esmi vs. smds ‘we are’ < PIE *h 1 smes). Similar morphologi- 
cally conditioned vowel alternations exist in several language families of Eurasia, most 
notably in Afro-Asiatic and Kartvelian (Gamkrelidze & Macavariani 1982). However, 
in those language families ablaut is far more productive in the verbal system than in the 
inflection of nouns. 

It is very likely that PIE had a simple, free tone accent, similar to that found in Vedic, 
where two phonological tones (high and low) must be distinguished. In Ancient Greek, 
the accent was tonal, but its position was wholly predictable on verbs and partially pre- 
dictable on nouns. The accent in Latin was described as tonal by Roman grammarians, 
but its position is completely predictable and it is likely that all Italic languages went 
through a stage with a dynamic accent which caused syncope and the reduction of vowel 
timbres in unaccented syllables. The evolution of tonal accent in Balto-Slavic is still con- 
troversial: most scholars would posit an opposition between an acute and a circumflex on 
long vowels and diphthongs, and a simple high tone on short syllables, but it is unclear 
whether this system developed from PIE tonal oppositions or arose independently after 
the loss of PIE laryngeals and some Balto-Slavic phonological processes such as Winter’s 
Law. The nature of accent in Anatolian and Tocharian is unclear, and all the other early 
Indo-European languages had dynamic accent (stress). With very few exceptions, mod- 
ern Indo-European languages have dynamic accent; the old tonal oppositions were pre- 
served in parts of the South Slavic area and in the remaining Baltic languages, and new 
tonal accents arose independently in parts of Scandinavia and in some West Germanic 
dialects (Franconian). On the whole, languages with tonal accent are rare in Eurasia: 
Altaic, Uralic, Kartvelian, Afro-Asiatic, Dravidian and most languages of Siberia (with 
the exception of Ket, which belongs to the small Yenisean family) are characterized by 
dynamic accent, while complex tonal systems are clearly an areal phenomenon in SE 
Asia, in languages belonging to different families such as Sino-Tibetan (e.g. Mandarin 
and Myanmar), Austro-Asiatic (e.g. Vietnamese), Daic (e.g. Thai), etc. If PIE had tones, 
its accentual system was certainly not similar to systems found in SE Asia, but it may 
have been similar to the simple tonal system reported for the NW Caucasian language 
Abaza and some NE Caucasian languages such as Ingush. 

MORPHOLOGICAL TYPE, WORD CLASSES 
AND LEXICAL TYPOLOGY 

PIE was a synthetic, fusional language, which means that its words were polymorphemic 
and that single morphemes could express more than one grammatical category (e.g. the 
Lat. dat. pi. ending -ibus in ped-ibus ‘feet’ cannot be segmented into separate morphemes 
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for case and number). In this respect, it was similar to Afro-Asiatic and Kartvelian lan- 
guages, which are also predominantly fusional, while the North Caucasian, Uralic and 
Altaic languages are mostly agglutinative (e.g. the Hungarian acc. pi. ember-ek-et ‘peo- 
ple, men’ contains separate morphemes for plural, -ek-, and accusative, -ef). It has been 
claimed that there are traces of an early, more agglutinative morphology in PIE (cf., 
e.g., Szemerenyi 1989): for example, the loc. pi. ending of the o-stems *-isu has been 
segmented into the plural morpheme, *-i- found in pronouns, and the locative ending, 
*-su, found in consonant stems, which developed from an earlier postposition. Likewise, 
the first person plural ending *-mes/-mos (e.g. Gr. Dor. -pe<;, Lat. -mus) may have been 
originally segmentable into the first person marker *-m and the plural marker *-es/-os), 
but such analyses are very speculative. In terms of the level of synthesism, PIE had 4—5 
categories per word, which is the most common type cross-linguistically according to 
WALS (feature 22A, “Inflectional synthesis of the verb”), in between the extreme cases 
of the radically isolating languages of SE Asia (e.g. Vietnamese, with 0 categories per 
word) and polysynthetic languages, such as Circassian (NW Caucasian), with as many as 
12 categories per word. 

Like Uralic, Altaic, Kartvelian and NE Caucasian languages, Indo-European lan- 
guages are dependent-marking (Nichols 1992), which means that the dependent element 
of a syntactic constituent is morphologically marked, rather than its head. Dependent- 
markedness can be realized on different levels: 

1) in dependent-marking adpositional phrases, the governed noun is case-marked, 
while the governing adpositions are unmarked, e.g. in the Latin phrase in urbe ‘in 
the city’, where the noun urbs ‘city’ is in the ablative singular, and the preposition 
in is unmarked. The converse of this is the head-marked adpositional phrase, where 
the adposition is marked (usually by person and/or gender markers), while the noun 
is unmarked (such a construction exists, for example, in the NW Caucasian language 
Abkhaz). 

2) in the dependent-marked possessive construction, the possessed noun is marked 
(usually by the genitive case), while the possessor is unmarked, e.g. in Lat. domus 
uin ‘the man’s house’. In a head-marking possessive construction, the possessed 
noun would be unmarked, while the possessor would get a possessive affix, e.g. in 
Hungarian az ember haz-a (the / man / house-his) ‘the man’s house’. 

3) on the clause level, in dependent-marking languages objects are case-marked (usu- 
ally by the accusative case), while verbs are unmarked if they have direct objects, e.g. 
in Lat. uide-o uir-um (see-lsg.pres / man-acc.sg.) ‘I see the man’. In a head-marking 
construction, the object would not be case-marked, while the verb would express 
both the person of the subject and the person of the object (again, this is what we find 
in Abkhaz, among other languages). 

On the whole, dependent-marking languages predominate in Eurasia, but there are excep- 
tions like the NW Caucasian languages and the isolates Burushaski (in the Hindukush) 
and Ket and Nivkh (in Siberia). 

Like the vast majority of languages of Northern Eurasia, PIE was a predominantly suf- 
fixing language. Whether it had any prefixes at all, apart from the negative prefix *n- (> 
Lat. in-, Gr. a-, Skr. a-), is questionable. The prefix *ni-, identified in *ni-sdo- ‘nest’ (Lat. 
nidus, etc., from the root *sed- ‘sit’ > Lat. seded, OCS sesti, Gr. e^opai, etc.) is an often 
mentioned candidate, but it is possible that it should actually be interpreted as a nominal 
element *h,ni-, a locative singular of the same root found in Gr. civco ‘up’, Lith. nuo 
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‘from’ and OCS na ‘on’. In that case, we should not reconstruct a prefixed noun *ni-sdo- 
but rather an old compound *h,ni-sd-o- ‘upward seat’ > ‘nest’. There was one infix in 
Late PIE, the present-stem infix *-ne-/-n-, e.g. in Lat. tango ‘I touch’ vs. the perfect tetigT 
(without the infix), or the Skr. 1 sg. pres, yunajmi ‘I harness’ vs. the aor. ayojam (without 
the infix). It is probable that this present-stem infix developed from an earlier nasal suffix, 
by a metathesis in certain types of roots, cf. hdi.pando ‘I spread’, which developed from 
*pat-n- by metathesis (Gr. 3iiivr|pi < PIE *pt-ne-h,-). Verbs with a nasal present in PIE 
almost universally have roots ending in stops, never in resonants or *s, so the metathesis 
was perhaps regular in structures of the type *C[VC 2 -n- > *C,VnC 2 -, where C 2 was a stop. 

Almost all the Indo-European languages remained predominantly suffixing, with the 
possible exception of modem Insular Celtic languages such as Irish, which is classi- 
fied as equally prefixing and suffixing by WALS (feature 26A). However, some prefixes 
developed in nearly all branches of Indo-European, mostly in the verbal system, e.g. the 
aspectual prefixes in Slavic, or the Germanic verbal prefixes with directional meaning 
such as German bei-, auf-, ein-, etc. These prefixes usually developed from adpositions, 
and this is a process that may be observed in its development in the early history of 
Greek. In Homer, verbal prefixes are still detachable from the verbal root, and interven- 
ing elements occur freely. This is the so-called tmesis, e.g. Tekaaae 5e naaa ;rspi ^Qrov 
‘the whole earth laughed around’, II. 19.362 (TeLaaae . . . Jispi instead of nepiyekaaae). 
However, in later Greek, the association of prefixes with verbal stems becomes fixed. 
Adpositions are prepositions in most Indo-European languages, but in Vedic and in the 
Old Anatolian languages they are postpositions, so most linguists assume that they func- 
tioned as postpositions in PIE as well. It has also been argued that PIE did not have any 
adpositions, but that they developed from adverbs independently in early Indo-European 
languages (Cornrie 1998). Languages without adpositions are rather rare typologically, 
but there are such languages, especially among the Aboriginal languages of Australia. 
However this may be, it is worth noting that many, and perhaps most, PIE adpositions 
can be derived from earlier nouns, petrified in some grammatical case. For example, the 
adposition *h,enti ‘against, in front of’ (Gr. dvxi, Lat. ante ) is actually the petrified loc. 
sg. of a noun preserved in Anatolian, which meant ‘face’ (Hitt, hanza < PIE *h,ent-s). 

The lexicon of PIE and all of the early attested Indo-European languages was verb- 
based, rather than noun-based (Nichols 2007). This means that simplex verbs were an 
open class and many nouns were derived from verbal roots, e.g. the word for ‘word, 
utterance’, *wok w -s (Skr. vak-, Lat. uox), is derived from the root *wek w - ‘to say, utter’ 
(LIV 614, cf. Ved. vivakti ‘says’); the word for ‘eye’, *h 3 ek' v - (Lat. oc-ulus, OCS oko, Gr. 
Horn. du. oaas), is derived from the verbal root meaning ‘to see’ (LIV 264, cf. Gr. fut. 
ov|/opat, Skr. fksate ‘perceives’); and the word for ‘wheel’, *k w ek w los (Gr. kukLo<;, Skr. 
cab-a-), is derived from the root *k w elh,- ‘to turn’ (LIV 345, cf. Ved. carati ‘moves’). In 
noun-based languages, on the other hand, lexical verbs are a small, closed class, and new 
verbal meanings can be derived by using constructions with auxiliaries (light verbs) and 
nouns, e.g. in Ingush (NE Caucasian) Cuo nab ju (He-erg. / sleep / does) ‘He sleeps’. 
Of course, PIE also had nouns underived from verbal roots, but they seem to have been 
a closed class, e.g. *kwon ‘dog’ (Skr. svan-, OIr. cu, Lat. cants , etc.), *ped- ‘foot’ (Skr. 
pad-, Lat. pes, etc.). Languages with noun-based lexica include Basque, the Dravidian 
languages and many NE Caucasian languages, while verb-based lexica are typical of the 
NW Caucasian languages and (to some extent) Semitic languages. 

Cardinal numerals from one to four were inflected for gender, case and number, like 
nouns and adjectives, and the higher numerals were uninflectable. From the grammati- 
cal point of view, the PIE numerals are hardly distinguishable from adjectives; i.e. their 
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identification as a separate word class is questionable. Likewise, participles were syn- 
tactically adjectives derived from verbs. They distinguished different gender forms, just 
like ordinary adjectives, and were inflected for case and number. The participles included 
the present active participle (PIE *b h er-o-nt- ‘carrying’ > Skr. bharant-, OCS berv, Gr. 
(pepcov, etc.), the perfect participle (PIE *weyd-wos- > Skr. vidvan-, Gr. eiSdx;), the pres- 
ent middle participle (PIE *b h er-o-mh,nos > Skr. bharamanas, Gr. (pspopevoq) and a ver- 
bal adjective in *-to- with passive meaning (PIE *b h rtos > Skr. bhrta-, OIr. - breth ). In 
areal terms, rich systems of participles are very common in Uralic and Altaic languages, 
but they are also well attested in many Caucasian languages. Verbal adverbs (also called 
“gerunds” or “converbs”), forms translatable as ‘while doing X’, where X is the verb 
from which the adverb is derived, cannot be posited for PIE. They are very common in 
Altaic and North Caucasian languages. 

PIE also had interjections and particles (e.g. the negation *ne > Lat. ne, Skr. na, OCS 
ne, etc.), and there may have been some conjunctions, though these are usually clit- 
ics in the early Indo-European languages (e.g. Skr. -ca, Lat. -que, Gr. -re < PIE *-k w e 
‘and’). These word classes are probably universal, or nearly so. We may note the lack 
of proper conjunctions in Altaic languages, except for some loanwords from Arabic or 
Indo-European languages. 

Although a lot has been written about the word classes in PIE, there has been relatively 
little research into the lexical typology of the proto-language. A recent study on the dia- 
chronic typology of motion event encoding (Verkerk 2014) found some evidence that PIE 
tended towards the satellite-framed type of language, which means that motion events were 
more commonly encoded by verbs and adpositions, as in English walk across (the river), 
rather than by verbs alone, as in the verb-framed type of language, such as French, where 
the equivalent construction is traverser (la riviere). The distinction between satellite-framed 
and verb-framed languages, proposed by Leonard Talmy (e.g. Talmy 2007) is not absolute, 
of course, and PIE, like most of its descendants, certainly had both verb-framed and sat- 
ellite-framed constructions. However, some branches tended to develop more towards the 
verb-framed end of the scale (e.g. Romance and Indo-Iranian), while others evolved more 
towards the satellite-framed type (e.g. Germanic and Balto-Slavic; see Verkerk 2014). 

GRAMMATICAL CATEGORIES 

The PIE nouns, adjectives and pronouns distinguished the grammatical categories of 
case, number and gender. The case system of PIE is fairly typical of a large number of 
languages and language families of Eurasia. The case system in PIE, with its eight or nine 
cases (nominative, accusative, vocative, genitive, dative, ablative, locative and instru- 
mental), is comparable to the one found in several Altaic languages (especially in the 
Turkic branch), but it is significantly smaller than the exuberant case systems, with many 
local cases, found in Uralic or NE Caucasian. A typologically unusual feature of the PIE 
case system is the fact that the nominative case of the masculine and feminine gender is 
marked (usually with the morpheme *-s), cf. PIE *h,ekwo-s ‘horse’. In most languages 
with the nominative-accusative alignment system, the nominative is the unmarked case, 
while the accusative is usually marked by an affix. In PIE, the accusative of the feminines 
and masculines (and the nominative-accusative of the thematic neuters) is marked with 
the suffix *-m (cf. PIE ace. sg. *h[ekwo-m ‘horse’, nom./acc. sg. n. *yugo-m ‘yoke’). The 
nominative-accusative of the athematic neuters was unmarked (cf. PIE *med h u ‘mead’). 
Another typologically interesting feature of the PIE case system is the widespread syn- 
cretism of core and non-core inflectional case -marking, the pattern in which two different 
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case functions are expressed by a single morpheme (WALS feature 28A). In PIE, this 
type of syncretism affected both core cases (e.g. the nominative, vocative and accusa- 
tive of neuters) and non-core cases (e.g. the ablative and genitive singular, except in the 
thematic inflection), but this pattern is rather rare cross-linguistically, as case syncretism 
usually affects only core cases. However, the PIE pattern of case syncretism is found in 
some Uralic languages (e.g. Finnic and Nenets) and in some languages of the Caucasus 
(e.g. Georgian and Lezgian). 

Some PIE nouns belonging to the neuter gender (on which see below) had heteroclitic 
inflection, which means that they formed their oblique cases from a stem different from 
the stem of their nominative-accusative singular, e.g. PIE *yek w r ‘liver’ (nom./acc. sg.) 
vs. *yek w en-s (gen. sg.). This pattern of inflection is preserved only in a few archaic cases 
in early Indo-European languages (e.g. Lat. iecur ‘liver’, gen. sg. iocineris, Skr. yakrt, 
gen. sg. vaknas), but it is still fairly common in the Anatolian branch, and it was probably 
more widespread in PIE. This kind of inflectional pattern, where a noun has different 
stems in grammatical and in oblique cases, is rare in Eurasia, but it is well attested in the 
NE Caucasian family (Matasovic 2012; cf., e.g., Dargi nes ‘mother’ with oblique stem 
nes-li-, dat. sg. nes-li-s). 

The number system, with its opposition of singular, dual and plural, is not so common 
in Eurasia. In many languages, the number oppositions are not obligatorily expressed; i.e. 
nouns and verbs are inherently neither singular nor plural (nor dual), although there may 
be affixes indicating number which may be used when this is appropriate (e.g. in Japa- 
nese). In PIE, the expression of number was obligatory for all nominal and verbal forms, 
and the dual number is probably as old as the singular and the plural, although it appears 
to have been lost in Anatolian (Fritz 2011). The dual is particularly rare in languages 
of Northern Eurasia: it is found in some Uralic languages (especially in the Samoyedic 
branch), but not in Altaic, Sino-Tibetan (though there are exceptions) and Dravidian, and 
it is generally absent in the languages of the Caucasus. However, early Semitic languages 
had the dual, but it was gradually lost in most of them. 

Late PIE had three genders: masculine, feminine and neuter. It is highly probable 
(although not generally accepted) that this system developed from an earlier one in 
which only two genders were distinguished, a common gender (for animates and some 
inanimates) and a neuter gender (for the remaining inanimates). Such a system could be 
reflected in the Anatolian languages, which may have separated from the other branches 
of Indo-European before the feminine gender developed (Matasovic 2004). The gender 
system of PIE is typologically somewhat unusual when compared with the gender sys- 
tems of other language families of Eurasia, particularly with those found in Dravidian 
and NE Caucasian (e.g. Chechen). In most of them, purely semantic principles of gender 
assignment are found; i.e. the gender of a noun is predictable only from its meaning. In 
PIE, on the other hand, gender can be assigned to only a few nouns on the basis of their 
meaning only: nouns denoting male persons and higher animals are masculine, while 
nouns denoting females are feminine. For all the remaining nouns, gender is partly pre- 
dictable from the declension class that the noun belongs to. For example, nouns derived 
with the suffix *-ti- are feminine (e.g. *mntis ‘thought, memory’), while those derived 
with the suffix *-r-/-n- (the heteroclites) are neuter (e.g. *yek w r ‘liver’). Surprisingly, the 
only language in Eurasia with gender assignment principles and three genders, similar 
to those found in PIE, is Ket, the only surviving language of a small Yeniseian family in 
Siberia (Matasovic 2004). 

Most modem Indo-European languages preserved gender, but the reduction of the 
three inherited genders to a binary system was common, e.g. in Insular Celtic, in most 
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Romance languages (the neuter gender is partially preserved in Romanian), in Lithuanian 
and Latvian, and in many Indo-Aryan languages such as Hindi. However, gender was lost 
almost without a trace in Armenian and in some Iranian languages such as Farsi. 

Most language families of North Eurasia are genderless, and it has been shown that 
gender is a genetically very stable feature, in the sense that languages belonging to gen- 
derless families almost never acquire it, while it is seldom lost in languages (Nichols 
1992, Matasovic 2004). Even in the Indo-European family, gender was completely lost in 
only a few languages, such as Armenian and a number of Iranian languages. Thus, Altaic, 
Uralic, Sino-Tibetan, Daic, and most language isolates of Northern Eurasia are gender- 
less, while Dravidian, Afro-Asiatic and NE Caucasian have gender. 

PIE pronouns distinguished case, number and gender, just like the nouns and adjec- 
tives, but their inflection patterns were very different. It is not only that they had different 
case/number endings (e.g. the nom. pi. ending *-i vs. the ending *-es found in masculine 
and feminine nouns), but they were characterized by a high degree of suppletivism. For 
example, the first person singular pronoun had the stem *(h[)eg- in the nominative (cf. 
Lat. ego, Gr. eyco), but the stem *(h[)me- in the oblique cases (cf. Lat. acc. me(d), Gr. 
eps), and one of the demonstrative pronouns had the stem *so- in the masculine and 
feminine nominative singular, but *to- for the oblique cases and the neuter nominative/ 
accusative singular (cf. the opposition between Skr. nom. sg. m. sas ‘this’ and nom./acc. 
sg. tad). Moreover, the personal pronouns had different stems in the singular and plural 
(1 sg. *hjeg- vs. 1 pi. *we-/*ne-, 2 sg. *tuH- vs. 2 pi. *we-/*yuH-), which is a common 
pattern in Caucasian languages and in the Uralic languages spoken in Europe (e.g. Finn- 
ish and Hungarian). Cross-linguistically, this pattern of suppletivism is also common in 
other macroareas (e.g. in Africa and parts of South America), but not in most parts of 
Eurasia (e.g. in Altaic, Sino-Tibetan and most Uralic languages). It is difficult to ascertain 
how many distance contrasts were found in PIE demonstratives. Early Indo-European 
languages usually have three, roughly corresponding to Lat. hie / iste / ille, but in some 
languages a two-way contrast is also found (as in the English opposition between this 
and that). Cross-linguistically both patterns are equally common and widespread in most 
of the world (WALS feature 41 A), with the two-way contrast slightly predominating in 
Eurasia. 

It has been argued that PIE pronouns distinguished between inclusive and exclusive 
forms of the first person plural (e.g. Lehmann 1993, Gamkrelidze & Ivanov 1984). The 
exclusive form in PIE would have been built with the stem *ne- (Lat. nos), while the 
inclusive would be the stem in *we- (Skr. vdyam, Eng. we). This would explain not only 
why there are two different stems for the first person plural pronoun, but also why the 
stem *we- is also found in the second person plural in some languages (e.g. Lat. uos): it 
could be argued that in those languages the inclusive fonn (originally referring to both 
the speaker’s and the hearer’s group) narrowed its reference to the hearer’s group only 
(i.e. to the plurality of the addressees, the second person plural). However, the hypothesis 
is strictly unprovable, and it is worth recalling that the opposition between exclusive and 
inclusive pronouns is unattested in the early Indo-European languages (it developed in 
a few modern Indo-Aryan languages, such as Gujarati and Marathi, presumably under 
the areal influence of Dravidian and/or Munda languages, where it is sometimes found). 
Moreover, it is not characteristic of most language families of Central Eurasia, with the 
single exception of NE Caucasian languages (it is found, for example, in Chechen). 

There is no consensus about the grammatical categories reconstructable for the PIE 
verb. It is clear that verbs were inflected for person/number of the subject, as in most 
language families of Eurasia. Agreement with the object (found in languages like Basque, 
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or the NW Caucasian languages such as Abkhaz) cannot be posited for PIE, nor can we 
assume that gender distinctions were ever made on PIE verbs (as in Afro-Asiatic or NW 
Caucasian languages). 

The moods reconstructable in Late PIE are (besides the unmarked indicative) the 
imperative (*b h ere ‘carry!’), the subjunctive (*b h er-e-e-si ‘that you carry’), the optative 
(*b h er-o-ih 1 -s ‘may you carry’) and, perhaps, the desiderative (*b h er-is-ye-si, which is 
semantically difficult to distinguish from the optative). Note, however, that the Anato- 
lian languages distinguish only between the indicative and the imperative, and it is not 
entirely certain that they lost the optative and the subjunctive (although many specialists 
on Anatolian believe they did). Be that as it may, the subjunctive was also lost in some 
Indo-European languages (e.g. Balto-Slavic), or it was replaced by new subjunctive for- 
mations, e.g. in Celtic and Germanic. In Latin it lost its modal function and became the 
future tense of the verbs of the 3rd and 4th conjugation (e.g. Lat. leget ‘will read’). The 
optative was also lost in many languages, but in Slavic it became the new imperative (e.g. 
OCS beri ‘take!’, with developing regularly from the PIE optative marker *-ih,- added 
to the thematic stem *-o-). The desiderative exists as such only in Vedic, but it left its 
traces in the forms of the future tenses in Old Irish, Lithuanian and elsewhere. It is unclear 
whether prohibition was expressed by using the imperative and the default negative par- 
ticle, as in OCS ne beri ‘do not take!’, or there was a special prohibitive negation used 
with some form of the verb other than the imperative, as in Gr. pi) (pepoiq ‘do not carry’ 
(with the prohibitive negation pi) and the optative, rather than imperative, form of the 
verb). The prohibitive negation *meh,- might be a dialectal form with reflexes in Greek, 
Indo-Iranian, Albanian and Armenian, but it might also be an archaism preserved in those 
branches only. Languages with special forms of prohibitive negation and non-imperative 
forms used in prohibitions are more common in Eastern Eurasia than in its western part 
(though there are exceptions on the extreme western periphery, such as Irish and Welsh; 
see WALS feature 71 A). 

On the whole, the PIE system of moods is not typologically unusual, except for one 
of its features: according to WALS (feature 73A), the morphological optative (i.e. the 
morphologically expressed mood whose primary function is expression of wishes) is 
very rare cross-linguistically, except in some linguistic areas. In the whole of Eurasia, 
the only such area is the Caucasus, where most languages have a morphological optative 
(Matasovic 2012). 

The PIE verb was characterized by three different stems: the present stem, the aorist 
stem and the perfect stem, cf. the following forms of the first person singular of the verb 
*leyk w - ‘leave’: *li-ne-k w -mi (present, formed with the nasal infix), *lik w -o-m (aorist), 
*(le-)loyk w -h,e (perfect), corresponding to Gr. 7.ip;idvco (an innovative thematic present 
formation, besides equally innovative ^eutco), e^mov, XeXouta, respectively. The imper- 
fect, which is formed from the present stem by adding the aorist endings and the past 
tense prefix *e- (the so-called augment) is probably an innovation of a group of Indo-Eu- 
ropean dialects. Its reflexes are attested in Greek, Indo-Iranian, Armenian and Phrygian, 
which also share other dialectal features. It is unclear what was the exact nature of the 
opposition between the verbal forms derived from the present stem and those derived 
from the aorist stem, but it is more likely to have been aspectual than temporal. Most 
linguists believe that the forms derived from the present stems were inherently imperfec- 
tive, while those derived from the aorist stem were perfective, and that this system was 
best preserved in Greek. It should be noted, however, that no traces of inherited aspectual 
oppositions exist in the Anatolian languages, which have a purely tense -based opposition 
between the present and the preterite. Moreover, the position of the perfect stem in the 
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system remains somewhat unclear. The perfect may have been, originally, a resultative 
form, expressing the resultant state of a previous action (Comrie 1998), cf. PIE *woydh,e 
(Skr. veda, Lat. uTdT), which is the perfect form derived from the root *weyd- ‘to see’ (cf. 
Skr. aor. avidam ‘I saw’). The perfect developed into a tense form in several individual 
languages, e.g. in Latin, where it merged with the aorist into a single category tradition- 
ally called “perfect”, and in Germanic, where it is called the “(strong) preterite”. A typo- 
logically similar opposition of three aspectual stems is found in Kartvelian languages, 
which distinguish between imperfect, aorist and present stems. Languages of the Semitic 
branch of Afro-Asiatic systematically oppose perfective and imperfective verbal forms 
with different person/number affixes, which is similar to the PIE distinction between 
primary (present) and secondary (aorist) verbal endings on the one hand, and perfect 
verbal endings on the other. Uralic and Altaic languages have tense-based rather than 
aspect-based verbal systems and generally distinguish between past and non-past tenses. 
Different tenses are usually formed by adding different suffixes to the verbal root, while 
the person/number suffixes tend to be the same (or very similar) in all tenses. 

The future tense cannot be reconstructed for PIE; it is most likely that the future events 
were referred to either by using the indicative present tense forms of verbs (as, e.g., in 
Gothic and in Welsh) or by using one of the subjunctive forms (as in Classical Armenian, 
where the aorist subjunctive is mostly used). In individual Indo-European languages, new 
future tenses were created, sometimes by re-interpretation of modal forms (e.g. in Latin 
or Sanskrit), sometimes analytically by using auxiliary verbs such as ‘want’, ‘become’ or 
‘have’, e.g. in Slavic (PS1. *peti xotjp ‘I want to sing’ > Croat, pjevat cu ‘I will sing’) and 
Germanic (Germ, ich werde singen, Eng. I will sing). Sometimes the auxiliaries became 
affixes on main verbs, and thus new synthetic forms of the future came into existence, e.g. 
in Romance (Lat. cantare habeo > Ital. cantard, Fr. chanterai). 

PIE apparently did not have a proper morphological passive. It is possible, of course, 
that there was some sort of analytical passive construction, but no clear traces of it have 
been preserved in the early Indo-European languages. The voice system of PIE is charac- 
terized by the opposition between active and middle, whereby the middle voice expresses 
that the subject is affected by the action of the verb, or that it benefits by it; cf., e.g., 
Gr. act. Xoxko Taq xeipou; xou JiaiSoq ‘I wash the hands of the child’ vs. mid. tamopoa 
mq X £ tP a ? 'I wash my hands’. Besides this primary function, the middle probably also 
expressed reflexivity (Gr. Xouopon ‘I wash myself’) and reciprocal action for some verbs 
(e.g. Skr. vadete ‘they (dual) talk to each other’). The middle is characterized by a spe- 
cial set of personal endings, which seem to be derived from the active endings, e.g. 3 sg. 
mid. aor. *-to vs. act. aor. *-t, 3 sg. mid. pres. *-tor vs. act. pres. *-ti. The similarity of 
the personal endings of the PIE middle to those of the perfect has long been noted, but 
there is no consensus about the explanation of this similarity. From the typological point 
of view, the PIE middle is similar to the grammatical category of “version” in Kartvelian 
languages, cf. Georgian (“subjective version”) v-i-c’er ‘I write (for myself)’ vs. (“neutral 
version”) v-c’er ‘I write’. 

CLAUSE ALIGNMENT 

All the early Indo-European languages have nominative -accusative clause alignment, 
with the partial exception of Hittite. Hittite has a split-ergative alignment, since neuter 
nouns receive the suffix -anza when they are the subjects of transitive verbs, but not when 
they are the subjects of intransitives or direct objects, cf. Hitt. (KUB XIV (23): nu KUR 
mv Hatti-ya apas ishan-anza arha namma zinnesta ‘and that murder moreover ended 
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the land of Hatti’. The common gender nouns in Hittite have the nominative -accusative 
alignment, with direct objects in the accusative case and all the subjects (of both transi- 
tive and intransitive verbs) in the nominative. A similar split-ergative system is attested 
in the Australian language Mangarrayi, where masculine and feminine nouns have the 
nominative-accusative alignment, while the neuters have the ergative alignment (Dixon 
2002: 508). However, most specialists agree that the ergativity in Hittite is unlikely to be 
inherited from PIE and that it is probably a dialectal development, perhaps under the areal 
influence of ergative languages such as Hattie and Hurrian. 

Among the modem Indo-European languages, ergative patterns developed in the Ira- 
nian languages of the Pamyr. Split-ergative systems also developed in some Indo-Aryan 
languages, such as Hindi, where clauses with perfective verbs have ergative alignment, 
while those with imperfective verbs have nominative -accusative alignment. 

Since early Indo-European languages are overwhelmingly nominative-accusative, 
it would be natural enough to assume, as many linguists did, that the same situation 
obtained in PIE. However, as early as 1901, Christian Uhlenbeck proposed to recon- 
struct the ergative alignment for the proto-language, and he was followed by a substan- 
tial minority of Indo-Europeanists, such as Vaillant (1936), Beekes (1995) and Kortlandt 
(2009), who adduced some new arguments for this hypothesis. 

The case for the ergative hypothesis is usually made by pointing out that it allows us 
to find a single explanation for a number of seemingly unconnected, and typologically 
somewhat unusual, facts about PIE morphology: 

1 . That PIE has - at least for the masculine and feminine nouns - a special marker for 
the nominative case (the nominative is usually the unmarked case in nominative -ac- 
cusative languages, as we saw above). 

2. That there is an unusual system of gender assignment, with many inanimate nouns 
belonging to the common (masculine or feminine) gender. 

3. That the ending of the nominative singular is similar to the ending of the genitive 
singular. 

4. That personal pronouns have very different case endings than nouns. 

5. That the stem of one demonstrative pronoun begins with *s- in the nominative sin- 
gular of the common gender, but with *t- in the oblique cases, as well as in all the 
cases of the neuter gender. 

The fact that only nouns denoting animates have the special nominative ending, while 
the nominative-accusative fonn of neuters is unmarked, may be explained if one assumes 
that the inanimates were seldom - if ever - subjects of transitive clauses in Early PIE. 
Hence, their zero-marked fonn is derived from the original absolutive, while the nomina- 
tive fonn of the animates is the original ergative. Those nouns denoting inanimates that 
belong to the common gender (e.g. the masculine noun *pods ‘foot’) might have the nom- 
inative ending because they generalized the ergative, rather than the absolutive, ending 
when the clause alignment pattern changed to the nominative-accusative system. More- 
over, the nominative ending *-s of the masculines and feminines is similar to the genitive 
ending of athematic stems *-es/-os, which appears to be its ablaut variant. The syncretism 
of the genitive case with the ergative is well attested in ergative languages, e.g. in Yup’ik 
(where the genitive is traditionally called relative) and in the NE Caucasian language 
Lak. The reason why there is a “leak” of inanimates into the “animate” gender (i.e. the 
common gender comprising the masculine and the feminine) might be that the masculine 
and feminine inanimates are precisely those that generalized their old ergative ending, 
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rather than their absolutive ending, as the new nominative. It has been established that the 
pattern in which pronouns have nominative -accusative alignment, while the nouns fol- 
low the ergative pattern, is quite common cross-linguistically, occurring, e.g., in several 
Aboriginal languages of Australia, including the well-described Pama-Nyungan language 
Dyirbal. Finally, the curious heteroclisy of the demonstrative stems may be explained 
if *so- is the original ergative case form, while *to- is the absolutive and oblique form. 
Since the neuters were seldom, if ever, in the ergative case, they generalized the absolu- 
tive stem *to- in all their case forms. Of course, all of these arguments in favour of the 
ergative hypothesis are very speculative, and there may be alternative explanations of the 
features of PIE morphology that it seeks to explain. Some linguists would even argue that 
there is nothing to explain, as the development of PIE morphology was subject to many 
complex factors, including phonological changes that we cannot reconstruct any more. 

It has also been argued that clause alignment in Early PIE was active (Lehmann 1974, 
1993, Gamkrelidze & Ivanov 1984, Drinka 1999, Bauer 2000). Languages with active 
clause alignment have two types of intransitive verbs: those expressing actions, such as 
‘run’ or ‘jump’, treat their single arguments morphosyntactically as the subject of tran- 
sitives (if the language has case, the only argument of such verbs would be in the nom- 
inative). On the other hand, the single arguments of stative intransitives (e.g. verbs like 
‘sit’ or ‘sleep’) are treated as the objects of transitives (if the language has case, the single 
argument of such verbs would be in the accusative). It has been proposed that traces of 
active morphosyntax can be found in constructions with “accusative subjects”, which are 
found in some early Indo-European languages such as Latin, where the verb paenitet ‘be 
discontent with’ takes the accusative {quemque ‘each man’): 

Suae quemque fortunae paenitet (Cicero, Ad familiares, 6.1.1) 

‘Each man is discontent with his lot.’ 

However, as has been argued elsewhere (Matasovic 2013b), constructions with such 
“quirky case -marking” can arise in nominative-accusative languages, and there are clear 
indications that such constructions in early Indo-European languages are innovations 
rather than archaisms. 

Ultimately, the question of clause alignment in Early PIE probably cannot be resolved; 
although the evidence that Early PIE had active clause alignment does not carry much 
weight, the arguments for ergativity are inconclusive, and the hypothesis that it was nomi- 
native-accusative, just as Late PIE was, is quite tenable (Villar 1983, Rumsey 1987). From 
the areal point of view, languages with active clause alignment are quite rare in Eurasia. 
One such language is Batsbi (also called Tsova-Tush), a NE Caucasian language spoken in 
Georgia, and Georgian also shows some features of active clause alignment. Nominative-ac- 
cusative languages predominate in the North and Northeast of Eurasia (notable exceptions 
are Chukchi in Eastern Siberia, and the isolate Burushaski in Pakistan). Ergative languages 
are found in the Caucasus (both NE and NW Caucasian languages are overwhelmingly 
ergative), and most languages of the Ancient Near East, including Hattie, Hurrian and 
Sumerian, were also ergative, as is Basque, spoken on the western periphery of Europe. 
The Afro-Asiatic, Uralic and Altaic families are consistently nominative -accusative. 

WORD ORDER, AGREEMENT AND COMPLEX CONSTRUCTIONS 

The earliest attested Indo-European languages, Hittite and Vedic Sanskrit, have the 
basic SOV order, and several linguists have concluded that the same order of syntactic 
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elements must have been regular in PIE (e.g. Lehmann 1974, 1993, Watkins 1976), cf. 
Skr. vis ah ksatriyaya balim haranti (villagers / to.prince / tax / they.pay) ‘The villagers 
pay taxes to the prince’. However, others have pointed out that many early Indo-Euro- 
pean languages have rather free word order (Friedrich 1975) and that in some of them, 
e.g. Ancient Greek, the SVO order actually predominates, at least statistically. The Insu- 
lar Celtic languages are VSO from their earliest attestations, early Gennanic languages 
and Latin are SOV, and Baltic and Slavic languages are mostly SVO. 

The basic order of subject, object and verb in PIE has been the centre of the contro- 
versy since many linguists believe that the relative order of other elements, such as nouns 
(N) and adjectives (Adj), demonstratives (Dem) and nouns, relative clauses (Rel) and 
head nouns, and possessors and possessums (usually represented as nouns and adposed 
genitives, Gen), can be predicted from it, in accordance with implicational universals 
posited by Joseph Greenberg ( 1 963, see also Dryer 1 992). Thus, it has been argued that in 
a SOV language we expect to find the orders Dem-N, Adj-N, Rel-N and Gen-N, and post- 
positions rather than prepositions, as we do in consistently head-final languages such as 
Turkish and Japanese. If PIE had all the other orders of syntactic elements hannonic with 
the SOV order, as claimed by Lehmann (1974) and others, including Gamkrelidze and 
Ivanov (1984), then the inherited construction is found in Hitt. E -ri anda ‘in the house’ 
(with a postposition) rather than in Lat. in casa (with a preposition). Moreover, we would 
expect that the PIE poetic fonnula ‘daughter of the sky’ is preserved in Ved. divo duhita 
(with the Gen-N order) rather than in Gr. Buycrcrip Aioq (with the N-Gen order), and the 
Vedic phrase aksiti sravas ‘imperishable fame’ (Adj-N) would be more archaic than the 
synonymous Gr. k\zo q acpBuov (N-Adj). 

However, subsequent work on word order regularities and universals (especially Dryer 
1992 and his contributions to WALS) has shown that some alleged word order universals 
cannot be confirmed if large enough samples of languages are used, and that some word 
order patterns accidentally co-occur in particular macroareas, but that there is no global 
implication relating them. For example, the order Dem-N is extremely common in Eur- 
asia (with only marginal exceptions, such as Basque and Insular Celtic, which have the 
opposite order), so it appears to be correlated with the order OV, which is also very com- 
mon in Eurasia. However, in other macroareas, the order OV co-occurs with the order 
N-Dem, and overall there is no implicational universal OV — > Dem-N. Therefore, if we 
assume that objects preceded verbs in PIE, it does not follow that demonstratives must 
have preceded nouns in the NP. Similarly, adjectives tend to precede nouns in Eurasia, but 
the order Adj-N is not correlated with the OV order globally, so we cannot infer it from 
the hypothesis that PIE was an OV language. 

Moreover, it is not entirely clear that we are right in attributing the SOV order to 
PIE in the first place. The order of syntactic elements can change very quickly in the 
history of a language, it is far less stable than clause alignment or agreement patterns, 
and it is strongly subject to substratum influences (Thomason 2001). Thus, it is entirely 
possible that Indo-Iranian acquired the SOV order through contact with some language 
of the SOV type (both Dravidian and Elamite are basically SOV), and the same goes for 
the Old Anatolian languages (Hurrian and Akkadian are both quite consistently SOV 
languages). 

No single infinitive form can be reconstructed for PIE (Disterheft 1980), although 
Indo-European languages usually have it (with the exception of Insular Celtic). Infinitives 
in early Indo-European languages are clearly derived from petrified case -marked forms 
of abstract verbal nouns. For example, Latin infinitives in -ere, -are, -ire are originally 
locative singular forms of 5-stems (with rhotacism, the change of *s > r between vowels, 
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which is regular in Latin). In Classical Sanskrit, the infinitive in -turn (e.g. bhartum ‘to 
carry’) is the accusative singular of the verbal noun derived with the suffix *-tu- (identi- 
cal to the supine in Slavic, cf. OCS bratb vs. the infinitive brati ‘carry’). There are also 
no subordinating conjunctions (subordinators) that can be posited for PIE, so apparently 
the proto-language did not have constructions of the English type 1 see that he is coming 
(with the subordinator that ) or I want him to come (with the infinitive to come). Rather, 
it probably used a number of subordinating strategies involving participles (the English 
type I see him coming) or paratactic instead of hypotactic constructions (7 see it, he is 
coming). Many early Indo-European languages have the so-called absolute constructions 
with participles, which are constructed with one of their arguments in a specific case, 
e.g. the ablative absolute construction in Latin, cf. Sole onto UolscT se corcumudllatos 
uiderunt (Livius, IV.9.13) ‘when the sun was risen, the Volscians saw that they were sur- 
rounded by lines of intrenchment’ (with the expression sole onto in the ablative absolute). 
However, the cases used in the absolute constructions do not agree: we find the genitive 
absolute in Greek and the dative absolute in Old Church Slavic, cf. Gr. toutcov ^exfievicov 
dveaieaav ‘this said, they rose’ (Xenophon, Anabasis 3.3.1, with toutcov ^exfievTcov in 
the genitive plural), OCS narodonrb ze sbbirajqstenib sq nacqtb glagolati ‘when the peo- 
ple assembled, he started to speak’ (Luke, 1 1 .29, with narodonrb . . . s-bbirajqstem-b in the 
dative plural). Thus, it is unclear whether a single absolute construction can be posited 
in PIE. 

The use of infinite verbal forms, rather than finite subordinate clauses, is actually the 
more common strategy of clause combining in Eurasia. Uralic and Altaic languages 
(except those strongly influenced by Indo-European) usually do not have subordinators 
and finite subordinate clauses, and the same holds for both North Caucasian families. 
The use of finite subordinate clauses is more typical of various branches of Afro-Asiatic, 
especially the Semitic languages. 

Two different forms of relative pronouns, *(H)yo- (> Skr. yas, yd, yad) and *k w o- (> 
Lat. quT, quae, quod), can be reconstructed, and they either represent dialectal forms or 
were used in different types of relative clauses (Hajnal 1997). One type of relative clause 
that is well attested in the early Indo-European languages, and that certainly goes back 
to a PIE prototype, is the so-called correlative construction, equivalent to the English 
He/the man who sees the woman, he is a warrior (rather than He/the man who sees the 
woman is a warrior). This is the default relative construction in Hittite and Vedic, and it 
can be safely posited for PIE, cf. Hitt, nu kas kuit memai n-at zik sakti (ptcl. / this. one / 
what / says / ptcl. -it / you / know) ‘You will know what this one says’. Overall, inflected 
relative pronouns as a device for introducing relative clauses are typologically rather 
rare. Most language families of Eurasia use uninflectable relative clause markers (e.g. 
Hebrew ? aser ‘such that’) or participles rather than finite relative clauses (e.g. the North 
Caucasian languages and most Altaic languages). However, inflectable relative pronouns 
are found in Kartvelian, cf. Georgian vine ‘he who’. 

PIE had extensive agreement on both the clause and NP levels. On the clause level, 
verbs agreed with their subjects in person and number. An exception to this is the so-called 
TOi ijroa Tpe%ei construction, attested in Greek and Indo-Iranian: the neuter plural (ra tjba 
‘animals’) triggers singular agreement on verbs (Tpe^et ‘runs’ is the third person singular 
present). This construction is generally believed to be archaic, and it shows that neuter 
plurals behaved as collective nouns in PIE, i.e. that they were singular for purposes of 
agreement. On the level of the NP, adjectives, pronouns and some numerals agreed with 
the head nouns in case, gender and number, cf. Lat. hie bonus puer ‘this (m.) good (m.) 
boy’ vs. haec bona puella ‘this (f.) good (f.) girl’. This type of adnominal agreement is 
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typologically rather rarer than clause-level agreement. Areally, it is more common in 
South and Southwestern Eurasia, typically in such language families as Afro-Asiatic, 
Dravidian and NE Caucasian. The Altaic, Uralic and NW Caucasian families typically 
lack adnominal agreement, except in those languages that underwent extensive influences 
from languages with this agreement pattern (for example Finnish, where case agreement 
is usually considered to be the result of the areal influence of Indo-European languages). 
Although case and number, as grammatical categories, appear to be old in PIE, agreement 
in case and number (and perhaps in gender) may be of more recent origin. There are 
archaic traces of uninflectable adjectives in PIE (e.g. the prefixed forms *h[SU- ‘good’ and 
*dus- ‘bad’), and demonstrative pronouns that have different case/number/gender forms 
seem to be derived from uninflectable deictic particles. For example, the pronoun *so- / 
*seh 2 - / *to- (Skr. sas, sa, tad) seems to be etymologically related to Hittite uninflectable 
sentence particles -su and -ta (Matasovic 2013a). 

As Meillet (1937) noted, the difference between transitive and intransitive verbs in 
PIE appears to have been slight, as most verbs taking two arguments (bi-valent verbs) 
could be used with one argument only; e.g. verbs such as ‘to love’ and ‘to hate’ could 
be used without the undergoer argument, as in Catullus’ verse Odi et amo ‘I love and 
I hate’. The government relation between the verb and its arguments was probably 
loose and subject to semantic, more than strictly syntactic, principles, as verbs could 
govern their arguments in different case forms with slightly different meanings. For 
example, the Greek verb kAuco ‘to hear’ (< PIE *klew-) could govern the accusative 
(Homer’s skAuov audijv ‘They heard a voice’), the genitive (ekAdov auiou ‘They heard 
(it) from him’) and the dative (euxopevoi pot skA,uov ‘They obeyed my prayer’, lit. 
‘they obeyed to me praying’). No valence-decreasing morphosyntactic operation, such 
as passive, can be reconstructed in PIE. On the other hand, there is a causative con- 
struction, expressed by a verb form marked by the o-grade of the root and the suffix 
*-eye-, which is a valence-increasing operation, cf. PIE *mon-eye- > ‘cause to remem- 
ber, admonish’ > (Lat. moneo) from the root *men- ‘remember’ (cf. Lat. memim ‘I 
remember’, which is an old perfect). 

CONCLUSION 

In a general survey of the areal typology of PIE, Johanna Nichols (2007) found evidence 
that PIE shared more areally diagnostic features with languages of Northern Eurasia 
than with languages of Southern Eurasia. These features would include the nominative- 
accusative clause alignment, the lack of polypersonal verbs, the suffixes as person markers 
on verbs and the low index of verbal inflectional synthesis. However, she acknowledged 
the presence of some features linking PIE with languages of South Eurasia, including 
three series of stops and the presence of gender. A later study (Matasovic 2012) argued 
that PIE shared some typologically rare features with languages of the Caucasus, includ- 
ing the presence of a morphological optative and a heteroclitic inflection of nouns, while 
pointing out that some features of PIE are rather rare in Northern Eurasia (e.g. nominal 
agreement and the high consonant-to-vowel ratio). Several features that could potentially 
link PIE with the Caucasus as a linguistic area are disputed, in that they are not gener- 
ally reconstructed in the proto-language, e.g. the glottalized stops, the ergative clause 
alignment and the opposition of exclusive and inclusive pronouns. Hence, it is difficult 
to reach firm conclusions about areal-typological affinities of PIE, and it is probable that 
no consensus about these matters will be attained until substantial progress is made in the 
reconstruction of the proto-language. 
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FURTHER READING 

There are several good introductions to linguistic typology and language universals, e.g. 
Comrie 1989, Whaley 1997, Croft 2003 and Velupillai 2012, and one can also learn a lot 
about typology and the areal distribution of typologically relevant features by reading the 
chapters on linguistic features in WALS. The typological profile of PIE is given in an excel- 
lent overview by Bernard Comrie (1998). A (rather too radical) critique of the application 
of typological methods to PIE reconstruction can be found in Haider 1985. The problem 
of areal-typological affinities of PIE is discussed in Nichols 2007 and Matasovic 2012. 
On the possible deep genetic relations of PIE the literature in English is scant; besides 
the rather speculative monograph by Greenberg (2000), there is a joint volume edited by 
Salmons and Joseph (1998), which offers an overview of the Nostratic hypothesis. 

Many controversial features of the reconstructed PIE are treated from the areal and 
typological points of view in the large monograph by Gamkrelidze and Ivanov (1984). 
Typological arguments relevant for the phonetic interpretation of the PIE phonological 
system are discussed in the papers assembled in Vennemann 1989 and in the monograph 
by Joe Salmons (1994). The syntax of PIE is treated from the typological point of view 
by Lehmann (1974, 1993), but most of the conclusions of both books are highly con- 
troversial. Several articles about the typology of certain PIE syntactic constructions are 
contained in the volume edited by Paolo Ramat (1980). The category of gender in PIE 
is typologically and areally compared with other gender systems in Eurasia by Mataso- 
vic (2004), and the typology of the verbal categories of tense and aspect in PIE and its 
descendants is the subject of Hewson and Bubenik 1997. 
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PART 3 


ANATOLIAN 

H. Craig Melchert 


INTRODUCTION 

The Anatolian subgroup as a whole 

The Anatolian subgroup of Indo-European is attested from the middle of the second mil- 
lennium BCE to approximately the first and second centuries CE. There are no modem 
representatives of the family. The evidence ranges from the extensive records for Hittite 
through the significant but progressively smaller text corpora in Luwian, Lycian, Lydian, 
Carian, and Palaic to the mere handful of inscriptions in Sidetic and Pisidian. We have 
Hittite, Palaic, and Luwian texts from the second millennium in the cuneiform writing sys- 
tem borrowed from Mesopotamia and Luwian inscriptions in a native Anatolian system of 
hieroglyphs from the late second and early first millennium BCE. After a significant break, 
the remaining languages from the late first millennium are written in alphabets closely 
related to or derived from that of Greek. The great disparity in the amount of evidence for 
the respective languages and the nature of the cuneiform and hieroglyphic writing systems 
limit to a serious extent our ability to make assured generalizations about features of the 
Anatolian family. These limitations will be addressed explicitly wherever appropriate. 

Like other such designations (Celtic, Germanic, etc.), “Anatolian” as used here is pri- 
marily a linguistic concept. The languages named above are all assumed to be derived 
from a reconstructed stage “Proto-Anatolian” (PAnat.), comparable to Proto-Celtic, 
Proto-Germanic, and so forth, defined by certain unique shared innovations. While most 
of the languages of the family are in fact attested in Anatolia (Asia Minor), Luwian 
was also spoken in parts of Syria. On the other hand, Phrygian, although it is an Indo- 
European language attested from west central Anatolia, is not a member of the Anatolian 
subgroup as defined here. 

The dearth of evidence for certain linguistic categories beyond Hittite and Luwian, and 
especially our poor understanding of Lydian grammar, reduces the number of assured 
common innovations that define Proto-Anatolian, that is, that differentiate the Anatolian 
subgroup from the rest of the Indo-European languages and from Proto-Indo-European. 
Nonetheless, one may cite a few such defining features: (1) Proto-Indo-European 
voiceless stops were “lenited” (probably simply voiced) between unaccented morae in 
Proto-Anatolian (for this formulation see Adiego Lajara 2001 and the discussion below 
under phonology); (2) Proto-Anatolian created a pronominal stem *ob h o/i-, which defi- 
nitely functioned as the accented third person anaphoric pronoun (see Melchert 2009a: 
156-159 with references, and below on its deictic use); (3) Proto-Anatolian developed 
third person subject enclitic pronouns for “unaccusative” verbs (see Garrett 1996, and 
below under pronouns). For a longer list of Proto-Anatolian innovations, including some 
more arguable features, see Melchert forthcoming, section 6.3.1. 

I follow the majority viewpoint that the ancient Indo-European languages of Anatolia 
are intrusive to Asia Minor, resulting from in-migrations from somewhere to the north, 
via either the Balkans or the C aucasus (see further Mallory 2009 and Melchert 2011a with 
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references to dissenting views). Since neither the route nor the timing of the supposed 
entry into Anatolia can be determined with any precision, it remains an open question 
whether the prehistoric stage labeled Proto-Anatolian was spoken in Anatolia. Neverthe- 
less, in the absence of contrary evidence, we may make the default assumption that the 
differentiation of the attested Anatolian languages was mostly associated with the spread 
of Indo-European speakers through Asia Minor. 

There is solid evidence for extensive prehistoric contact between Hittite and Luwian 
and for less intensive contact between Luwian and both Lydian and Palaic (Yakubovich 
2010: Chapter Three). We may safely assume some similar contact between Lycian and 
Carian. Under these circumstances, attempts to describe the relationships of the Anatolian 
languages to each other in terms of divergence by a “family tree’’ model (e.g., Oettinger 
1978) are less than satisfactory. Use of dialect geography presents a more realistic pic- 
ture (see Melchert 2003 for detailed justification of the description give here). Luwian 
and Lycian undeniably share a number of innovations that justify a “Luwo-Lycian” or 
“Luwic” dialect group, and it is likely, but not strictly provable, that Carian belongs to 
this group as well. A subset of these innovations is also shared by Lydian (most notably 
so-called /-mutation, described below under nominal morphology), so that one may also 
speak of “western” dialects of Anatolian. On the other hand, Luwian also shares several 
phonological innovations with Hittite and Palaic. Since these are of a “shallower” nature 
than some of the features common to Luwian and the other western dialects, they might 
be regarded as chronologically later than the latter, but we hardly have enough facts to 
assert this with confidence. What does seem clear is that Luwian plays a central role in 
terms of shared isoglosses, as one might expect from its geographic position. 


The individual languages 

Hittite 

The language we call Hittite was the chief administrative language of the kingdom and 
subsequent empire centered on the capital Hattusha in central Anatolia (see Map 3.1), 
which flourished from the sixteenth to the end of the thirteenth century BCE. The Hit- 
tites themselves called the language nesili, after the city of Nesha (also Kanish), to the 
southeast on the Marassantiya River (= classical Halys, modem Kizihrmak). The original 
locus of Hittite speakers was thus probably an area along the upper course of the Halys. 
They subsequently became the ruling class in Hattusha and in the Hittite kingdom, but the 
extent to which Hittite functioned as a spoken language beyond this group is impossible 
to determine. After 1350 BCE Hittite was likely in serious competition with Luwian as 
the spoken language even among this elite (Yakubovich 2010: Chapter Five). 

Documentation of Hittite is extensive and includes a wide variety of genres: annals, 
treaties, letters, the state cult, therapeutic rituals, myths, and more. Our understanding 
of their content and of the grammar is good, but certain aspects of the phonology are 
obscured by limitations of the cuneiform writing system. As adopted by the Hittites, this 
mixed logographic-syllabic system (compare Japanese) has phonetic signs for V, CV, 
VC, and CVC sequences. Word-initial or word-final strings of two or more consonants 
and word-internal strings of three or more can only be spelled with sequences containing 
“empty” vowels: e.g pai-ah-ta = /parxta/ ‘chased’. The system also does not fully dif- 
ferentiate e- and /-spellings. The resulting ambiguities leave much room for debate about 
the phonological interpretation of the orthography. These remarks naturally also apply to 
Palaic and to Luwian written in cuneiform. 
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MAP 3.1 ANATOLIA IN THE LATE BRONZE AGE 


Palaic 

Palaic was the language of the land of Pala, almost certainly located to the northwest 
of the lower Halys river (Map 3.1). It is attested in a few fragmentary cuneiform texts 
from Hattusha (sixteenth-thirteenth centuries BCE), all dealing with the Palaic state cult, 
which was heavily influenced by the religion of the Hattians, a non-Indo-European peo- 
ple who dominated north and central Anatolia before the Hittites. Palaic is arguably the 
most conservative Anatolian dialect, but our evidence for it is severely limited. For fur- 
ther on Palaic see Carruba 1970. 


Luwian 

Luwian is attested in cuneiform from Hattusha (sixteenth-thirteenth centuries) and in 
a hieroglyphic script native to Anatolia. A few hieroglyphic inscriptions date from the 
last Hittite kings (thirteenth century), but most hieroglyphic texts (monumental inscrip- 
tions and a few letters and economic documents on lead strips) date from the Iron Age 
(eleventh-eighth centuries) after the fall of the Hittite Empire, the products of various 
small city states and kingdoms in southern Anatolia and Syria. Traditionally, one has 
referred simply to Cuneifonn and Hieroglyphic Luwian according to the writing system 
used (as followed here for convenience), but Yakubovich (2010: Chapter 1) has shown 
that one must distinguish between two dialects: (1) Kizzuwatna Luwian, the language of 
the incantation portions of Luwian rituals originating in southwestern Anatolia (Map 3.1), 
but as redacted by cuneiform scribes in Hittite context in Hattusha (with a few exceptions, 
effectively those texts collected in Starke 1985); and (2) Empire Luwian, a koine devel- 
oped in and promulgated from Hattusha starting in the thirteenth century, continued in the 
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later Iron Age hieroglyphic texts. The “Luwianisms” scattered through Hittite cuneiform 
texts of the fourteenth and thirteenth centuries also belong to this dialect. 

The Iron Age Luwian texts provide our richest documentation for an Anatolian lan- 
guage outside of Hittite, but the logographic-syllabic hieroglyphic script is even more 
problematic for phonological interpretation than is cuneifonn. The syllabary has only V 
and C V signs, so that even word- final consonants must be written with an “empty” vowel, 
and many of the CV signs do not distinguish /a/ and /i / vocalism. These and other prac- 
tices produce spellings such as wali+rali-ya-ya for what is probably /warriyai/ ‘helps’. 
Luwian written in cuneifonn helps resolve some of the resulting ambiguities, but by no 
means all. The Iron Age hieroglyphic texts are masterfully edited by Hawkins (2000). 



MAP 3.2 ANATOLIA IN THE IRON AGE 
Lycian 

Lycian is attested in close to two hundred inscriptions, nearly all on stone, from various 
sites in classical Lycia in southwestern Asia Minor (Map 3.2), dating from the fifth and 
fourth centuries BCE. These texts, written in a fonn of the Greek alphabet, are nearly all 
funerary in nature, with the exception of a few decrees and the “stele of Xanthos”, a long 
but poorly understood text describing the military exploits and building activities of one 
dynastic family. Our understanding of Lycian grammar was significantly improved by 
the discovery in 1973 of a decree establishing a religious cult written in Lycian, Greek, 
and Aramaic versions: see on the Lycian text of the “Letoon Trilingual” Laroche 1979. 

A portion of the Xanthos stele text and one other inscription are written in a different 
dialect known conventionally as Milyan or “Lycian B” (the latter as opposed to the 
ordinary “Lycian A”). While the sociolinguistic relationship of the two forms of Lycian 
remains unclear, there is no doubt that they are dialects of the same language, despite 
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some claims to the contrary. Lycian A does show some unique phonological changes 
(*s > h except next to consonants; *k w > t before front vowels), but it shares important 
features with Lycian B that separate both from Luwian: the change of *o to e, massive 
syncope of unaccented vowels, and widespread regressive vowel assimilation, among 
others (for arguments that Lycian is not a dialect of attested Luwian, see also Gusmani 
1960). 


Lydian 

Lydian was the language of the classical kingdom in west central Anatolia with its capital 
at Sardis (Map 3.2). There are now more than a hundred extant Lydian texts, but only 
a couple dozen are of significant length and reasonably preserved, all dating from the 
fifth and fourth centuries, thus roughly contemporaneous with our Lycian corpus. In the 
absence up to now of a Lydian-Greek bilingual text of sufficient length, our comprehen- 
sion of Lydian lags far behind that of the other Anatolian languages described above, 
even Lycian. While we can largely parse the grammatical structure of most textual pas- 
sages, many details still elude us, and our grasp of the lexicon is especially poor. As a 
result, we must be duly cautious in claiming that certain features are pan- Anatolian, since 
we do not know their status in Lydian. For all aspects of Lydian see Gusmani 1964 and 
subsequent supplements, especially 1986. 


Carian, Pisidian, and Sidetic 

Carian was the language of classical Caria, in southwestern Anatolia between Lycia 
and Lydia (Map 3.2). A few extant texts, tentatively dated to the fourth and third cen- 
turies BCE, have been found in Caria itself, but most Carian texts, consisting of tomb 
inscriptions and graffiti of Carian mercenaries, are from Egypt, dating from the seventh 
to fifth centuries. Thanks to the discovery of a Carian-Greek bilingual text in 1986 (see 
Frei & Marek 1987), we can now be sure that Carian does belong to the Anatolian family 
of Indo-European, and it appears to share some features of the “Luwo-Lycian” dialect 
group. Fiowever, our understanding of Carian grammar and vocabulary remains limited, 
and the language cannot be regarded as fully deciphered - in particular we cannot yet 
analyze Carian verbal morphology, or even identify verbal forms, with confidence. Car- 
ian will thus play a very small role in the description of Anatolian grammar that follows. 
See on all aspects of the language Adiego 2007. 

Some ten inscriptions have been discovered in Sidetic, the language of Side, a major 
city on the coast of Pamphylia (Map 3.2), dated very approximately to the third to second 
centuries BCE. Those consisting of more than personal names are largely unintelligible, 
and we can at present affirm only that the language is Anatolian Indo-European, belong- 
ing to the “Luwo-Lycian” dialect group. See Nolle 2001: 625-646 for the texts, but for 
revised readings of some letters and a tentative grammatical sketch also Perez Orozco 
2007. For Pisidian (Map 3.2) we have a mere handful of tomb inscriptions consisting 
of only names and patronymics, dated tentatively to the first and second centuries CE. 
These again can tell us only that the language is a “Luwo-Lycian” dialect of Anatolian. 
The very sparse evidence for both Sidetic and Pisidian leaves open the possibility that 
they represent late dialects of Luwian in the narrow sense. See Brixhe 1988 for the then 
published texts in Pisidian. We will have nothing more to say about Sidetic and Pisidian 
in what follows. 
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PHONOLOGY 

Vowels 

There was very little change in the vowels from Proto-Indo-European to Proto-Anatolian. 
Evidence that *o merged with *e in Lycian instead of with *a as in the rest of Anato- 
lian shows that Proto-Anatolian still maintained the PIE system of five contrasting vowel 
phonemes, both short and long (Rasmussen 1992 and Melchert 1992a, upheld by Hajnal 
1995: 90-97): e.g., PIE/PAnat. *pedom > Lyc. pdde ‘place’, PIE gen. pi. *-om > PAnat. 
*-om > Lyc. -e. As reflected in the second example, unaccented long vowels inherited 
from Proto-Indo-European were shortened in Proto-Anatolian (Eichner 1973: 79 and 86 xx ). 
A PIE tautosyllabic sequence *eh[ became a Proto- Anatolian long vowel that merges with 
inherited long *e in Hittite but with a in Luwian and Lycian (represented here convention- 
ally as PAnat. *te): e.g., *d h eh 1 ti > *d<eti > Hitt, tezzi ‘says’ but Lyc. tadi ‘puts’. Anatolian 
thus uniquely distinguishes between PIE *e and the result of tautosyllabic *ehj. 

Since Hittite maintains the PIE short diphthongs *ay and *oy as /ay/ and PIE *aw and 
*ow as /aw/ before coronal consonants (see Kimball 1994), these must have still been 
preserved at least in this environment in Proto-Anatolian. In those cases where PIE *aw 
and *ow became monophthongs in Hittite (and undoubtedly also Palaic and Luwian) 
they appear as a new vowel /o(:)/ distinct from inherited *5, which has merged with *a 
as /a:/. 1 As shown by Kloekhorst (2008: 35-60), vindicating the earlier claims of several 
scholars, the new /o(:)/ phoneme is spelled with the sign u in Hittite, versus /u(:)/ spelled 
with the sign u. Prehistoric *u is also regularly lowered to the new /o/ vowel next to 
*h 2 or *h 3 . PAnat. long *ay and *oy are reflected in Hitt. /a:y/, and PAnat. long *aw and 
*5w in Hitt. /a:w/ (spelled Ca-a-ul). The precise timing, conditioning, and outcome of 
the monophthongization of PIE *ey and *ew and *ey and *ew remain under debate (but 
contra Melchert 1994a: 56 and passim there is no basis for supposing that the reflex of 
*ey in Proto-Anatolian or Hittite was distinct from that of *e). 


Consonants 

Evidence from Luwian and Lycian shows that Proto-Anatolian preserved the PIE three- 
way contrast of dorsal stops, at least for the voiceless series *k, *k, *k w . However, the 
claim in Melchert 1989: 23-31 for an unconditioned contrast in Luwo-Lycian must be 
abandoned. A reassessment of all available evidence confirms that Anatolian is a “centum” 
subgroup that keeps labiovelars distinct while eventually merging the “palatal” (more 
accurately front velar) and velar series as velars (see for full argumentation Melchert 
2012a). Before the latter merger, however, there was a conditioned palatalization in 
Luwo-Lycian of just the front velar series in fronting environments (*e, *i, *y and *w): 
contrast, e.g., *keyori > CLuw. zivar(i) and Lyc. sijeni Ties’ with *kesaye/o- > CLuw. 
kisa(i)- ‘to comb’, but *koto- > HLuw. /kata-/ ‘enmity, opposition’ and CLuw. kattawat- 
nalla/i- ‘opponent, enemy’ (cognate with Gr. koto:; ‘spite’ and Skr. satru- ‘enemy’). The 
facts for reflexes of the voiced dorsal stops are compatible with this analysis, but strict 
proof is prevented by the widespread loss of medial voiced dorsal stops in Luwo-Lycian. 

There are two major changes in the stop system from Proto-Indo-European to Proto- 
Anatolian. First, in the absence of any compelling examples of differing reflexes, we may 
and should assume that the PIE voiced aspirated stops merged with voiced stops. Second, as 
already mentioned above, PIE voiceless stops are “lenited” (or rather simply voiced) between 
unaccented morae: e.g., PIE *d h eh 1 ti > PAnat. *d<eti (i.e., *dceasti) > *dcfeaedi > Lyc. tadi 
‘puts’, PAnat. abl.-instr. *-oti > *-odi > Luw. /-adi/ and Lyc. -edi. For this unified fonnulation 
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as a single rule see Adiego Lajara 2001, combining the two rules posited by Eichner (1973: 
79-83 and 100") and Morpurgo Davies (1982/83). Evidence for this rule is more plentiful in 
Luwo-Lycian than in Hittite, where most of its effects were undone by analogy. 

Before leaving the stops, we should note that Hittite scribes did not employ the cune- 
iform stop+vowel signs such as ta and da to indicate the phonemic contrast between 
voiceless and voiced stops. In intervocalic position they spelled inherited voiceless stops 
as geminates, while inherited voiced stops and those produced by the change just cited 
were spelled as single stops: thus both pe-e-ta-an and pe-e-da-an = /pe:dan/ ‘place’, 
contrasting with (i-)ya-at-ta(-ri) and i-va-ad-da(-ri) = /yata(ri)/ ‘goes’. This practice has 
aroused much discussion regarding the synchronic distinctive features of the Hittite stops 
(see Melchert 1994a: 13-21 and compare the very different view of Kloekhorst 2008: 
21-25), but for the purposes of Anatolian as a whole we may continue to speak of a 
voiceless vs. voiced contrast. 

There were no major changes in the PIE sibilant *s or the sonorants *m, *n, *r, *1, 
*w, and *y in Proto-Anatolian. The syllabic liquids and nasals eventually appear in the 
attested languages mostly as aR but occasionally as uR instead (e.g., HLuw. /thimid-/ 
‘horn’ < *krngid- cognate with Hitt, karkidant- ‘homed’), but they were likely still syl- 
labic sonorants in Proto-Anatolian. As part of a widespread areal feature, word-initial 
r- is lacking in the attested Anatolian languages, except for extremely rare secondary 
instances in Luwian, Lycian, and Lydian. 

As for the PIE “laryngeals”, *h 2 appears in the cuneiform languages as h (geminate hh 
intervocalically, thus voiceless or fortis), probably a velar fricative /x/: e.g., Hitt, hanz(a) 
‘in front’ < PIE *h,enti (= Lat. ante ‘in front’ etc.) and CLuw. hantili- ‘foremost, first’, 
Hitt, newahh- ‘make new’ < PIE *neweh 2 -. This result was subject to the Proto-Anatolian 
voicing rule described above, whence CLuw. 1 sg. pret. aha and Lye. aga ‘I made’ < 
PAnat. *y£eya < *yeh 1 -h,e versus regular CLuw. -hha and Lye. -xa < PAnat. *-xa. As 
shown by Kloekhorst (2006: 97-101), a sequence of *h 2 w developed into a PAnat. unitary 
labialized fricative /x w /: e.g., Hitt. /tarx w -/ ‘conquer’ spelled tar-hu- and tar-uh- (parallel 
to e-ku- and e-uk- for /eg w -/ ‘drink’). In Lycian the labialized fricative turned into a labio- 
velar stop, represented by q (contra Melchert 1994a: 305-307), as shown by the name of 
the Storm-god, trqqht- < *trh,wnt-. PIE *h 2 was lost in word-final position and before *v 
and underwent assimilation in certain environments (cf. Melchert 1994a: 68-71). 

PIE *h 3 is reflected in word-initial position by h- or zero in Hittite and Luwian, under 
conditions that remain much debated: compare Hitt, happar ‘transaction’ < *h 3 epr or *h 3 opr 
(cognate with Lat. opus ‘work’ etc.) but Hitt, arta ‘stands (up)’ < *h 3 (e)rtor. Contrary to 
previous opinion, Kloekhorst (2008: 838 and 946) has demonstrated that medial *h 3 is 
preserved in the immediate environment of sonorants: CLuw. tatarh- ‘break’ < *terh 3 - (Gr. 
rpcbco ‘wound) and Hitt, walh- ‘strike’ < *welh 3 - (Gr. saAcov ‘perished’). For the suggestion 
that a sequence *h,w is preserved in Hittite as -hw- (/y w /) parallel to *h,w > -hhw- = /x w / in 
lahw- ‘pour’ (cognate with Gr. Aoeco ‘bathe’ etc.) see Melchert 2011b. 

Aside from secondary effects such as compensatory lengthening or assimilation, there 
is no compelling evidence for any direct reflex of PIE *h, in Anatolian, pace Kloekhorst 
2004 and 2006: 77-81 (see Melchert 2010: 152-153 and Weeden 2011: 61-68). 

Accent 

The results of the PAnat. “lenition” or voicing of stops and *h, cited above suggest that the 
position of the inherited PIE accent was mostly retained in Proto-Anatolian. For some excep- 
tions see Melchert 1994a: 89. For some instances of mobile accent in Hittite noun paradigms 
see Melchert 2012c: 180. Eichner (1986) established the synchronic accent of Lydian. 
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NOMINAL MORPHOLOGY 
Nominal inflection 

The attested Anatolian languages show only two numbers, singular and plural, the latter 
distinguishing in Old Hittite a count plural versus a collective (Melchert 2000 after Eichner 
1985). Possible traces of a dual are sparse and limited to the nominative-accusative. As is 
well known, Anatolian also has only two attested grammatical genders, animate (usually 
labeled common) and neuter. It is a matter of great controversy whether Anatolian inher- 
ited the feminine gender from Proto-Indo-European and lost it (as, e.g., Annenian did) or 
whether its absence reflects an archaism not shared by the rest of Indo-European. One thing 
that is now clear is that, contrary to previous claims (Oettinger 1987, Starke 1990: 85-89, 
Melchert 1994b), the phenomenon of “/-mutation” in the western Anatolian languages is 
not a reflex of the PIE feminine gender. As first identified and described by Starke (1990: 
54—85), in Luwian, Lycian, and to a lesser extent Lydian many (though by no means all!) 
nominal stem classes show an inserted -/- between the inherent stem and the endings of just 
the common gender nominative and accusative. In the case of stems in -a- (which mostly 
but not exclusively continue PIE o-stems) the -i- replaces the stem-final -a-. The paradigm 
of the Cuneifonn Luwian synchronic /-stem adduwal- ‘evil’ is nom. sg. comm, adduwalis, 
ace. sg. comm, adduwalin, nom.-acc. sg. n. adduwal(=za), nom. pi. comm, adduwalinzi, 
ace. pi. comm, adduwalinz(a), nom.-acc. pi. n. adduwala, dat. pi. adduwalanz(a), abl.-instr. 
adduwalati. As argued by Rieken (2005b), this pattern arose due to particular formal changes 
in the paradigms of some ablauting /-stems and was then analogically extended. This feature 
tells us nothing about the status of the feminine grammatical gender in Anatolian. 

Old Hittite has at least eight cases: nominative, vocative, accusative, genitive, 
dative-locative, allative, instrumental, and ablative. The vocative is replaced by the nomi- 
native, the allative by the dative-locative, and the instrumental by the ablative by the time 
of New Hittite. The common gender nominative and accusative plural also merge, mostly 
in favor of the original accusative form. The allative (distinct from the dative-locative 
only in the singular) expresses goal with verbs expressing translocation (see Hoffner & 
Melchert 2008: 263 for examples). Its existence already in Proto-Anatolian is made likely 
by its use to form infinitives in multiple languages. It is indeterminate whether it is an 
inheritance from Proto-Indo-European or an innovation of Proto-Anatolian. Likewise, 
we cannot at present determine whether the merger of the dative and locative singular 
that took place in Hittite had already happened in Proto-Anatolian. It is also a matter of 
debate whether the “ergative” was a fully grammaticalized case in Proto-Anatolian (see 
most recently Goedegebuure 2012). 


TABLE 3.1 ANATOLIAN NOMINAL ENDINGS 



Proto-Anatolian 

Hittite 

Palaic 

Cuneiform 

Luwian 

Hieroglyphic 

Luwian 

Lycian 

Lydian 

Carian 

nom. sg. comm. 

*-s 

-s 

-s 

-s 

-s 

zero 

-s 

zero 

acc. sg. comm. 

*-m 

-n 

-n 

-n 

-n 

-V 

-V 

-n 

voc. sg. 

zero 

zero 

9 

zero 

zero 

? 

? 

? 

nom.-acc. sg. n. 

zero 

zero 

zero 

zero 

zero 

zero 

-d 

? 

?erg. sg. 

*-nts 

*-nts 

? 

-ntis 

-ntis 

-Vti 

? 

? 

gen. sg. 

*-os 

-as 

-as 

— 

-as 

-Vhe 

— 

-s 

dat.-loc. sg. 

? 

-i 

-il-a 

-it -a 

-i/-a 

-H-e 1 


-s 1 

all. sg. 

*-a 

-a 

-a 

-a 

-a 

-a 

? 

? 
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Proto- Anatolian 

Hittite 

Palaic 

Cuneiform 

Luwian 

Hieroglyphic 

Luwian 

Lycian 

Lydian 

Carian 

nom. pi. comm. 

*-es 

-es 

-es 

-nzi 

-nzi 

-Vi 

? 

-s’ 

acc. pi. comm. 

*-ms 

-us 

? 

-nz(a) 

-nzi 

-Vs 

? 

-if 

nom.-acc. pi. n. 

*-h 2 

zero l-a 

-a 

-a 

-a 

-a 

-a 

? 

?erg. pi. 

*-ntes 

-ntes 

? 

-ntinzi 

-ntinzi 

? 

? 

? 

gen. pi. 

*-om 

-an 

? 

— 

— 

-e 

-av 

? 

dat.-loc. pi. 

*-os 

-as 

-as 

-anz(a) 

-anz(a) 

-e 

-av 

? 

abl.-instr. 

*-ti/-di 

-z(a) 

? 

-ati 

-adil-ari 

-Vdi 

? 

? 

nom. sg. comm. 

*-os 

-as 

-as 

-as 

-as 

-e 

-as 

? 

acc. sg. comm. 

*-om 

-an 

-an 

-an 

-an 

-e 

-evl-av 

-n 

nom.-acc. sg. n. 

*-om 

-an 

-an 

-an 

-an 

-e 

-ad 

-n 1 

gen. sg. 

*-os/-oso/-osyo 

-as 

-as 

— 

-asl-asi 

-ehe 

— 

-s 

dat.-loc. sg. 

? 

-i 

- Ha ? 

- Ha 1 

-Ha 1 

-He 1 

-i 

? 

all. sg. 

*-0 

-a 

? 

? 

? 

-e 

? 

? 

nom. pi. comm. 

*-os 

-es 

-as 

-anzi 

-anzi 

-ei 

? 

? 

acc. pi. comm. 

*-oms 

-us 

? 

-anz(a) 

-anz(a) 

-es 

? 

? 

nom.-acc. pi. n. 

*-a 

-a 

-a 

-a 

-a 

-a 

-a 

? 

gen. pi. 

*-om 

-an 

? 

— 

— 

-e 

[- 0V ] 

? 

dat.-loc. pi. 

*-os 

-as 

[~as\ 

-as 

-as 

-e 

-av 

? 

abl.-instr. 

*-oti/-'odi 

-az(zi) 

? 

-ati 

-adil-ari 

-edi 

? 

? 


Table 3.1 summarizes the nominal case endings of Anatolian, athematic and thematic (since Anatolian 
has very few oxtyone thematic stems, the thematic endings are given in Proto-Anatolian form, with unac- 
cented long *6 already shortened). Endings in boldface represent innovations, — indicates that the cate- 
gory has been lost, [] marks an assured ending accidentally not attested, while a question mark means that 
we have reason to believe the form existed but do not yet have any evidence for it. 


The direct cases call for only minor comment. In addition to the expected animate nomina- 
tive singular -s, there are a few reflexes of sonorant stems reflecting *-e/oR: OHitt. kessar=sis 
‘his hand’ remarkably continues PIE *g h eysor, but in most cases the PIE fonn has been 
renewed with a secondary -s (e.g., Hitt, and Pal. haras ‘eagle’ < *h 3 ero+s, Hitt, hasterz(a) 
‘star’ < *h 2 ster+s). Lydian has generalized the pronominal neuter nominative-accusative sin- 
gular ending *-(o)d to all nominal stem classes. Only Palaic preserves the animate nomina- 
tive plural endings *-es and *-5s, while Hittite has generalized a prehistoric *-es < *-eyes, 
the /-stem ending, and at least Luwian and Lycian, and probably Carian, have renewed the 
animate nominative plural ending based on the accusative plural (for details see Melchert 
2009b). Old Hittite still shows some traces of the athematic nominative -accusative neuter 
plural ending *-h 2 (e.g., assii ‘goods’ = /a:sso:/ < *-uh 2 ), but elsewhere it has been renewed 
by the thematic ending -a < *-eh 2 . Hittite and Luwian also preserve some examples of 
lengthened-grade nominative -accusative neuter plurals such as Hitt, widar ‘waters’ < *wedor. 

The history of the genitive case in Anatolian is complex. For further details of the pic- 
ture presented here and references to differing views see Melchert 2012b. Hittite, Palaic, 
and Hieroglyphic Luwian all preserve the PIE athematic ending in the form *-os and also 
a thematic ending *-o-s. The PIE thematic genitive singular ending *-osyo is continued 
by at least HLuw. /-as(s)i/ and by Carian -s in personal names, the latter generalized to all 
stem classes. The PIE thematic genitive singular *-e/oso appears in Lye. -Vhe, likewise 
spread to all stem classes but restricted to personal names, and probably in the Carian 
synchronic dative singular ending -5. Whether Cuneiform Luwian attests either *-e/oso 
or *-osyo is uncertain. Old Hittite, Lycian, and Lydian show reflexes of the PIE genitive 
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plural *-om, but in Lydian the ending mostly functions as the dative-locative plural. The 
western Anatolian languages largely replace the genitive case with inflected forms of 
possessive adjectives, mostly reflecting a secondary inflection of the ending *-osyo, but 
Lydian employs rather an adjective in -la/i- (< PIE *-lo- with /-mutation), and Hiero- 
glyphic Luwian also uses adjectives in -iya- < PIE *-iyo- in this function. 

As per Hajnal (1995: 98), Lycian infinitives in -Vna reflect the expected athematic 
allative ending *-eh, (Hajnal) or less likely *-h 2 e (Melchert 1994a: 324), while - Vne 
continues the analogically spread thematic ending *-o-h, (as in Lat. quo ‘whither?’). It 
is unclear to what extent Proto-Anatolian distinguished the dative and locative singu- 
lar and, if so, athematic and thematic inflection. The dative-locative plural ending *-os 
attested in Hittite, Palaic, and Lycian represents an archaic form of the dative plural not 
yet reinforced as *-b h (y)os or *-mos. For more on this ending and the history of the num- 
ber-indifferent ablative and instrumental in Anatolian see Melchert and Oettinger 2009. 


Nominal derivation 

This topic can be treated here only in the most summary fashion. For a still largely valid 
overview of the presence and productivity of various PIE suffixes in Anatolian see Oet- 
tinger 1986. More focused treatments of particular stem classes are found in Weitenberg 
1984, Starke 1990, and Rieken 1999a. In brief, Anatolian shows the usual mixture of 
archaisms and innovations and its own patterns of productivity. Root nouns are present 
but recessive (Rieken 1999a: 18-82). PIE *-e/ont- is attested throughout Anatolian in a 
variety of functions (Melchert 2000: 58-70) but notably not in that of an active participle 
(see further below under non-finite verbal forms). Neuter 5-stems of the type *klew-os, 
*klew-es- ‘fame’ (Skr. sravas- etc.) are barely attested, if at all, but there are examples of 
other types (Rieken 1999a: 185-237). 

Stems in -/- (e.g., Hitt, arkis ‘testicle’ < PIE *h,6r%, d(a)lugastis ‘length’) are produc- 
tive in Hittite but mostly recast in Luwo-Lycian in terms of /-mutation (see again Rieken 
2005b). Stems in -u- (e.g., Hitt, genu ‘knee’ < *genu, CLuw. maddu ‘wine’ < *med h u) 
are likewise reasonably productive in Hittite (Weitenberg 1984) but extremely recessive 
elsewhere. The well-known PIE suffix *-ti- is modestly productive (e.g., Hitt, ispanduzzi 
‘libation vessel’, CLuw. arut(i)- ‘wing’), while *-tu- is virtually unattested. Most syn- 
chronic /-stems (e.g., Hitt, ishiul ‘binding; obligation; treaty’) are secondary from original 
thematic stems in *-lo- (thus with Rieken 2008). There are relatively few reflexes of 
neuter men - stems in Hittite, but the class is very productive in Luwian, e.g., tatariyaman- 
‘ curse’ (Starke 1990: 243-299) and likely was in Lycian as well. Perhaps most notorious 
is the abundance in Anatolian of neuter heteroclite stems in -rt-n-\ see Oettinger 1986: 
11-15, Rieken 1999a: 290-417, and Starke 1990: 435-525. 

Primary barytone action nouns with suffix *-o- are moderately productive, but oxytone 
agent nouns in *-6- are virtually unattested (Oettinger 1986: 19). Secondary derivatives in 
*-o- are plentiful. There is a modest number of denominative stems in *-no- and *-to-, but 
exceedingly few verbal adjectives in *-to- and none in *-no- (cf. Oettinger 1986: 22-23). The 
thematic suffixes *-iyo-, *-lo-, and *-ro- all enjoy some degree of productivity in Anatolian. 


PRONOUNS 
Personal pronouns 

The Anatolian accented personal pronouns all continue PIE preforms, and Old Hittite 
maintains the PIE suppletive pattern in the first person singular and plural and second 
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person singular. For the second person plural we find only forms based on the PIE 
non-subject stem *us- (see Katz 1998: 138-141 for competing accounts of their deriva- 
tion). In New Hittite (and all the other languages for which we have evidence) the first 
persons and the second person plural generalize the dative-accusative, but the second 
person singular maintains the contrast of subject and non-subject forms (Table 3. 2). 2 

There is considerable mutual influence between the first and second person pronouns. 
One defining innovation of Anatolian is the spread of the u-vocalism from the second per- 
son singular to the first person singular. However, the spread of the final dorsal stop from 
the subject form of the first person singular to the non-subject form and to both forms of 
the second person singular is a specifically Hittite innovation, since it is not seen in Palaic 
(which does not lose final stops). Likewise, reshaping of the second person plural *us- to 
*unz- based on the first person plural *anz- < *ns- is thus far attested only for Luwian. 
The attested enclitic non-subject forms of the first persons and the second person singular 
are also clearly based on PIE preforms, while the second person plural is based on the 
accented form, but there are signs that the Proto-Anatolian situation was more compli- 
cated, with a distinction of enclitic dative and accusative fonns and subsequent differing 
developments of enclitic reflexives (see Yakubovich 2010: Chapter 3). 

TABLE 3.2 ANATOLIAN PERSONAL PRONOUNS 


First person singular First person plural 



Hittite 

Palaic 

Hieroglyphic 

Luwian 

Lycian 

Lydian 

Hittite 

Cuneiform 

Luwian 

Hieroglyphic 

Luwian 

nom. 

uk 

? 

amu 

emu 1 amu 

amu 

wes 

? 

anzanz(a) 

dat.-acc. 

ammuk 

? 

amu 

emu 

amu 

anzas 

anz(as) 

anzanz(a) 

enclitic 

-mu 

-mu 

-mu 

? 

? 

-nnas 

? 

-anz(a) 

gen. 

ammel 

? 

9 

? 

? 

anzel 

? 

? 

abl. 

ammedaz 

? 

? 

? 

? 

anzedaz 

? 

? 


Second person singular Second person plural 



Hittite 

Palaic 

Hieroglyphic 

Luwian 

Hittite 

Cuneiform 

Luwian 

Hieroglyphic 

Luwian 

nom. 

zik 

tT 

ti 

sumes 

? 

unzanz(a) 

dat.-acc. 

tuk 

tu 

tu 

sumas 

u(n)z(as) 

unzanz(a) 

dat.-acc. 

enclitic 

-tta 

? 

-tu 

-smas 

? 

-mmanz(a) 

gen. 

tuel 

? 

? 

sumenzan! sumel 

? 

? 

abl. 

tuedaz 

? 

tuwari 

sumedaz 

? 

unzari 


Demonstrative and anaphoric pronouns 

As noted above, one defining innovation of Proto-Anatolian is the creation of a demon- 
strative stem *ob h o/i-. This is attested as the accented third person anaphoric pronoun in 
Hittite, Palaic, both forms of Luwian, Lycian, and Lydian and surely had this function 
in Proto-Anatolian. The inflection is that of an o-stem (Table 3.3) in all the languages 
except Lydian, where it is consistently an /-stem. This pattern certainly has nothing to 
do with /-mutation; whether it gives a clue to the origin of the pronoun is unclear (cf. 
Melchert 2009a: 159 with references). Not only the Hittite nom. pi. comm, -e but also 
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the -i- of CLuw. zinzi and zinz(a) reflects PIE nom. pi. animate *-oy (Melchert 2009b). 
For the Hitt. nom. -ace. pi. n. -e as also continuing PIE *-oy see Jasanoff 2008: 144-149. 
The Hittite genitive singular ending -el reflects a suffix *-elo- (thus with Rieken 2008: 
250-251 against all others). See Goedegebuure 2007 for apin as the Hieroglyphic Luwian 
synchronic ablative-instrumental ending < PIE *-im (vindicating the claim of Dunkel 
(1997) for such an ending) and Goedegebuure 2010 for the contrast between Luw. dat.- 
loc. sg. apati and original abl.-instr. apadi (attested as an adverb ‘thus’). 

The enclitic anaphoric pronouns of Anatolian reflect a mixture of reflexes of PIE *e/o- 
and *se/o-, subject to various innovations, again marked in boldface. On these develop- 
ments and the creation of enclitic reflexive pronouns (and the spread of reflexive *-ti from 
Luwian to Hittite, Lycian, and Lydian) see Yakubovich 2010: Chapter 3. 

TABLE 3.3 ANATOLIAN ANAPHORIC PRONOUNS 


Accented third person anaphoric pronouns 



Hittite 

Cuneiform Luwian 

Hieroglyphic Luwian 

nom. sg. comm. 

apas 

apas 

apas 

acc. sg. comm. 

apun 

apan 

apan 

nom.-acc. sg. n. 

apat 

[apa] 

apa 

gen. sg. 

apel 

— 

apas 

dat.-loc. sg. 

aped (an) i 

apatti 

apati 

nom. pi. comm. 

ape 

[apinzi] 

apanzi 

acc. pi. comm. 

apiis 

apinz(a) 

[apanzi] 

nom.-acc. pi. n. 

ape 

[apa] 

apaya 

gen. pi. 

apenzan 

— 

— 

dat.-loc. pi. 

apedas 

[apattanz(a)] 

apatanz(a) 

abl. 

instr. 

apez 

apedand(a)!apet 

apati/apatln 

apin 


Enclitic third person anaphoric pronouns 


Proto-Anatolian Hittite 

Palaic 

Cuneiform 

Luwian 

Hieroglyphic 

Luwian 

Lycian 

Lydian 

nom. sg. comm. 

*-os 

-as 

-as 

-as 

-as 

— 

-as 

acc. sg. comm. 

*-om 

-an 

-an 

-an 

-an 

-e(ne) 

-av 

nom.-acc. sg. n. 

*-od 

-at 

-at 

-ata 

-ada 

-ede 

-ad 

dat. sg. 

*-soy 

-ssel-ssi 

-tu 

-tu 

-tu 

-i 

-m/. 

nom. pi. comm. 

*-oy 

-el -at 

-as 

-ata 

-ada 

? 

9 

acc. pi. comm. 

*-oms 

-us /-as 

? 

-as 

-ada 

? 

? 

nom.-acc. pi. n. 

*-oy 

-el-at 

-e 

-ata 

-ada 

? 

? 

dat. pi. 

*-sm-os 

-smas 

? 

-mmas 

-mmanz(a) 

-fine 

-ms 


Determining the Proto-Anatolian system of deictic pronouns is made difficult by our 
lack of evidence for the languages other than Hittite: we have almost no data for Cunei- 
form Luwian, or for anything but near deixis in Lycian, Lydian, and Carian. Furthermore, 
some cognate stems have very different deictic values in the various languages. The over- 
view here reflects Melchert 2009a, but much remains indeterminate. The only thing that 
seems clear is that the Proto-Anatolian marker of near deixis was *ko- (compare Lat. 
citra ‘on this side’ etc. < *ki-), reflected in Hitt, and Pal. ka-, CLuw. and HLuw. za-, and 
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probably Carian s(a)-. The inflection is entirely that of an a-stem, except for the aberrant 
and as yet unexplained Hitt, nom.-acc. sg. n. la. As shown by Goedegebuure (2002/2003), 
Hittite has a three-way deictic contrast like that of Lat. hie, iste, and ille : ka- for near the 
speaker, apa- for near the addressee, and asi/uni (< PIE *e/o- plus particle *-i) for far 
deixis. In attested Hieroglyphic Luwian, apa- covers the range of both iste and ille, but 
there is indirect evidence that at least Cuneiform Luwian once also had far-deictic *e/o- 
(Melchert 2009a: 152). Very confusingly, however, Lyd. es- < *e/o- marks near deixis, 
and Lye. ehe- does as well. It is quite unclear just which stem(s) marked far deixis in 
Proto- Anatolian and whether it had a system with a two- or three-way contrast. 


Enclitic possessive pronouns 

Emphatic pronominal possession is expressed throughout the history of Hittite by the 
genitive of the personal pronouns and the accented anaphoric pronoun, preceding their 
head noun. In Old Hittite, unemphatic pronominal possession is marked by enclitic pro- 
nouns (more properly pronominal adjectives) attached to the head noun and agreeing 
with it in gender, number, and case: e.g., nom. sg. comm. attas=mis ‘my father’, nom.- 
acc. sg. n. sahessar=summet ‘our fortress’, dat.-loc. pi. hassas=sas ‘for his offspring’ 
(see Hoffner & Melchert 2008: 137-141). However, there were no specific forms for the 
nominative-accusative plural neuter or the ablative, and the nominative -accusative singu- 
lar neuter and the instrumental forms were used respectively by suppletion: sakuwa=smet 
‘their eyes’ and issaz=(s)mit ‘from their mouths’. By New Hittite, use of the enclitic pos- 
sessives was no longer part of the synchronic grammar, as shown by the fact that copyists 
misunderstood the forms in -et and -it and misused them for cases like the vocative and 
dative-locative singular. Unemphatic possession in New Hittite was conveyed by use of 
the appropriate enclitic dative pronoun: nu=ssi KUR=.St/ GUL-un (conj. + ‘him’ (dat.) 
‘land’ (obj.) ‘I struck’) = ‘I struck his land’. It is likely that Cuneiform Luwian attests a 
few examples of enclitic possessives (Carruba 1986), and the category is probably to be 
reconstructed for Proto-Anatolian (thus Garrett 1991-1993: 160-161). 


TABLE 3.4 ANATOLIAN INTERROGATIVE-RELATIVE PRONOUN 



Hittite 

Cuneiform Luwian 

Hieroglyphic Luwian 

nom. sg. comm. 

kuis 

kuis 

REL-j's 

acc. sg. comm. 

kuin 

kuin 

REL-m 

nom.-acc. sg. n. 

kuit 

kui 

REL-anz(a) 

gen. sg. 

kuel 

— 

— 

dat.-loc. sg. 

kuedani 

kuwatti 

REL -ati 

nom. pi. comm. 

kues/kuies 

kuinzi 

REL -inzi 

acc. pi. comm. 

kuius 

[kuinz(a)] 

REL -inzi 

nom.-acc. pi. n. 

kue 

? 

REL-jfl 

gen. pi. 

[ kuenzan ] 

— 

— 

dat.-loc. pi. 

kuedas 

? 

? 

abl. 

kuez 

kuwati(n) 

REL-adi 


Interrogative, relative, and indefinite pronouns 

Anatolian uses the PIE stem *k w o/i- as both the interrogative and relative pronoun - there 
is no trace of PIE relative *Hyo-. The interrogative -relative was clearly inflected as an 
/-stem in the nominative and accusative, subject to very minor innovations: besides the 
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Hittite and Luwian forms in Table 3.4 note also Pal. kui-. Lye. ti-, Lyd. qi-, and Carian 
xi-. In the other cases the interrogative shows the same endings as the anaphoric demon- 
strative stems in Hittite, and our very limited evidence suggests the same for the other 
languages. Hittite attests the occasional use of the bare interrogative stem kui- as an 
indefinite, but the latter is usually expressed by a form with an added particle: nom. sg. 
comm, kuiski, ace. sg. comm, kuinki, gen. sg. kuelka, etc., and the other languages display 
similar creations: CLuw. kuisha, Lye. tike and tise. 

VERBAL MORPHOLOGY 
Verbal categories 

The Anatolian verb system is markedly different in certain basic features from that of the 
oldest stages of Indo-Iranian and Greek and thus from that traditionally reconstructed 
for PIE. As with the absence of the feminine gender, this glaring discrepancy has been 
the object of long and heated controversy: is it to be explained by loss of categories in 
Anatolian, by shared innovations in the rest of the Indo-European languages, or by a 
combination of the two? There is not remotely a consensus on this important issue, and 
what follows necessarily represents a particular viewpoint. One should not, however, 
overstate the matter: many aspects of the Anatolian verb are reassuringly familiar and not 
subject to serious dispute. One should also distinguish between the presence/absence of 
categories and the presence/absence or relative productivity of particular formal means 
of expressing a given category. 

The Anatolian verb is inflected for the expected three persons and for two numbers, 
singular and plural. There is a contrast between active and medio-passive voice, and 
the latter attests the functions seen in other older Indo-European languages. Opposi- 
tional middles indicate “subject affectedness”: (1) reciprocal (Hitt, med.-pass. zahhanta 
‘they strike each other’ vs. act. zahhanzi ‘they strike’); (2) self-interest (Hitt, med.-pass. 
usneskatta ‘offers for sale’ < *‘pledges in return for payment’ vs. act. usneskemi ‘I 
pledge’; and (3) passive (Hitt, med.-pass. hullattati ‘was struck’ vs. act. hullet ‘struck’). 
The passive use is rare in Old Hittite but becomes highly productive by New Hittite. 
Delimiting the functions of media tantum is notoriously more difficult, but Hittite shows 
most of the well-known types: (1) body care/grooming (Hitt, westa ‘wears’), (2) change 
of body posture (Hitt, kitta(ri). Pal. kftar, CLuw. zivar(i), Lye. sijenilsiteni Ties (down)’), 
(3) non-translational motion (Hitt, wehatta(ri) ‘spins (intr.)’), (4) translational motion 
(Hitt, ivatta(ri) ‘walks’), (5) mental states (Hitt, lelanivatta ‘becomes angry’), and (6) 
“spontaneous” events/changes of state (Hitt. kTsa(ri) ‘happens; becomes’). 

The Anatolian verb displays only two moods: indicative and imperative. There are no 
obvious reflexes of the PIE optative or subjunctive. In Hittite the function of the optative 
to express the non-factual (what is possible, contrary to fact, or desired) is filled by the 
particle man/ man, a secondary use of the subordinating conjunction man ‘when(ever), 
if’, itself derived from a rare PIE interrogative-relative stem *mo- (compare Hitt, masi- 
‘how/as many’, Toch. B maksu ‘who, which’). For a likely indirect trace of the PIE sub- 
junctive in Hittite see Jasanoff 2012, but compare Oettinger 2007. 

The Anatolian verb has two tenses, present (which also serves for the future) and pret- 
erite. Hittite does famously develop a periphrastic perfect with hark- ‘hold, have’ and es- 
‘be ’ and the past participle, comparable to similar formations in modem western European 
languages; see Boley 1984 and, on the correlation of auxiliary selection and unaccusa- 
tivity, Garrett 1996: 102-106. As described in summary fashion below, Anatolian attests 
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virtually all of the suffixes used to form the “present”, that is, imperfective, stem in “Core 
Indo-European”. It is quite uncertain, however, whether Anatolian inherited a true aspec- 
tual contrast of imperfective (“present”) vs. perfective (“aorist”). The evidence presented 
in Melchert 1997 for reflexes of such a contrast is suggestive, but far from compelling, 
and one may also suppose that Anatolian inherited the various present- forming suffixes 
merely as markers of Aktionsart and that the development of a true aspectual system is a 
common innovation of the non-Anatolian languages; for a well-reasoned presentation of 
this viewpoint see Strunk 1994. 

Synchronically, most, if not all verbs are “monothematic”, showing a single stem that 
may be understood as perfective or imperfective according to context. However, as shown 
by evidence that is plentiful in Hittite and limited but clear in the other languages, Anato- 
lian developed the means to optionally mark a finite verb as explicitly imperfective, most 
often by addition of a reflex of the PIE suffix *-ske/o-, but also for some verbs of *-enh 2 i- 
or *-s(e/o)- (whose PIE antecedents have not been fully elucidated). This “marked” imper- 
fectivity may be realized as various Aktionsarten, depending on the context and the lexical 
semantics of the base verb: progressive, iterative, durative, habitual, inceptive, or distribu- 
tive (see Hoffner & Melchert 2008: 318-323 for illustrations from Hittite). 

There is no Anatolian category corresponding to the PIE perfect, generally agreed to 
have expressed an attained state, whether one defines it as an aspect (e.g., Meier-Brugger 
2000: 155) or an Aktionsart (e.g., DiGiovine 1996: 273). Once again, explanations for 
this absence differ radically, as discussed further in the next section. 

Verbal inflection: active 

The Hittite finite verb follows in the active voice one of two inflectional patterns, labeled 
after the respective present first-person singular endings the “m/-conjugation” and the 
“/^-conjugation” (Table 3.5). Evidence from Luwian (see Morpurgo Davies 1979) sug- 
gests that this system is Proto-Anatolian, although examples from Lycian and Palaic are 
extremely sparse and not beyond question. Even in Hittite there was much mutual influ- 
ence between the two conjugations, and in the other attested languages the difference 
may have been reduced to just the present third singular indicative. 


TABLE 3.5 HITTITE VERBAL INFLECTION 


Active indicative 


7?7 /-conjugation 


///-conjugation 

Present 


Singular 

Plural 


Singular Plural 

1 

-mi 

-wenil-wani 


-hhel-hhi -wenil-wani 

2 

-si 

-ttenil -ttani 


-ttH-sti -(s)teni/-(s)tani 

3 

-zzi 

-anzi 


-el-il-ai -anzi 

Preterite 


Singular 

Plural 


Singular Plural 

1 

-unl-anun 

-wen 


-hhun -wen 

2 

-sl-tta 

-ten 


-tta/-sta -(s)ten 

3 

-t(a) 

-ert-ar 


-s/-sta -er 



186 H. CRAIG MELCHERT 


Active imperative 


mi-conjugation fe'-conjugation 



Singular 

Plural 

Singular 

Plural 

1 

-allu 

-weni 

-allu 

-weni 

2 

zero l-t 

-tten 

zero /-i 

-tten 

3 

-ttu 

-antu 

-u 

-antu 


Medio-passive indicative 


Present 


Preterite 



Singular 

Plural 

Singular 

Plural 

i 

-(h)ha(ri), -(h)hahari 

-wasta(ti) 

-(h)hat(i), -(h)hahat(i) 

-wastat(i) 

2 

-(t)ta(ri), -(t)tati 

-t(t)uma(ri) 

-at, -(t)tat(i) 

~(d)dumat 

3 

-a(ri), -(t)ta(ri) 

-anta(ri) 

-at(i), -(t)tat(i) 

-antat(i) 


Medio-passive imperative 


Singular 

Plural 

i 

-hharu, -haharu 

-wastati 

2 

-hut(i) 

-(d)dumat(i) 

3 

-aru, -(t)taru 

-antaru 


The singular endings of the mi-conjugation manifestly match the primary and second- 
ary endings standardly reconstructed for the present-aorist system in PIE. While only 
Hittite has 1 sg. pres, -mi (see below on the other languages), Palaic, Luwian, and Lydian 
also attest to 2 sg. pres, -si, and these plus Lycian attest to 3 sg. pres, -ti (only Hittite assi- 
bilates *t before syllabic *i). Hitt. 1 sg. pret. -un continues athematic *-m, while -anun 
reflects thematic *-om renewed by the athematic ending. Lyd. 1 sg. pret. -v continues at 
least *-om (with syncope) and perhaps also *-m. Older Hittite still preserves -s as the 2 
sg. pret. ending of vocalic stems (e.g., ies ‘you did’), but consonant stems have already 
adopted -to from the ///-conjugation (e.g., epta ‘you took’). Hittite and Palaic show 3 sg. 
pret. -V-t but -C-to, the latter with a genuine “prop-vowel” (Hitt. 3 sg. pret. e-ku-ut-ta 
‘drank’ = /eg w ta/, not f e-ku-uf). On the other hand, the consistent 3 sg. pret. ending -t(t)a 
of Luwian and -tet-de of Lycian after a vowel as well as a consonant reflects renewal of 
the third person preterite active endings by those of the medio-passive (thus with Yoshida 
1993). 

The Anatolian first person plural active endings show a -w-, recalling the PIE first 
person dual endings, but also an -n matching the Greek dialectal 1 pi. act. -psv (cf. Eich- 
ner 1975: 79). Hitt. 1 pi. pret. -wen and Lyd. -wv reflect a virtual *-wen, to which was 
added the usual mark of the present *-i to form 1 pi. pres. *-weni continued by Hitt, and 
Pal. -weni and CLuw. -wannil-unni. On the model of the first person plural in *-wen(i), 
Anatolian created a matching second person plural *-ten(i), extending PIE *-te, attested 
in Hittite, Palaic, and Luwian. In posttonic position *-weni and *-teni regularly became 
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Hitt, -wani and -t(t)ani (Melchert 1994a: 137-138 after Cowgill). The ending -t(t)ani 
has no connection whatever to the *-th,ene seen in Skr. -thana, contra Eichner 1975: 79 
and others. The present third person plural ending of Hittite -anzi and -anti elsewhere 
probably reflect PIE athematic *-enti and *-nti as well as thematic *-onti, but it is dif- 
ficult to establish this beyond doubt. Only Pal. 3 pi. pret. -anta may preserve a trace of 
the expected secondary ending *-ent/-nt/-ont (but it cannot be excluded that Pal. -anta 
reflects medio-passive *-onto, as in Luwian and Lycian; see Yoshida 1993: 34, note 21). 
In Hittite, the third person plural preterite ending of the /?/-conjugation has been gener- 
alized to all verbs, usually in the form -er < *-er, but there are a few examples of -ar < 
*-r. Much less certain is whether Lyd. 3 pi. pret. -rsl-ris continues PIE *-rs, matching Skr. 
-ur (see Melchert 2004a: 147, note 14, on the synchronic ending). The Anatolian preterite 
shows no credible traces of the PIE “augment” *e-. 

The present singular endings of the ///-conjugation strongly resemble those of the PIE 
perfect and can in fact be straightforwardly derived from endings matching the latter plus 
the *-i particle that characterizes the present indicative: PIE *-h 2 ey > OHitt. -hhe and 
PIE *-ey > OHitt. -e, already mostly leveled to -hhi and -/ after the majority of present 
endings ending in -/. The leveling has already occurred in Hitt. 2 sg. pres, -tti for *-tte < 
*-th,ey. The third singular ending is renewed partially in Hittite and totally elsewhere by 
an ending -ai spread from monosyllabic stems where it results from contractions (thus 
Luw. -ai and probably Pal. -ai and Lye. -e). An entirely unsolved problem is the gener- 
alized 1 sg. pres, -wi of Luwian, which is surely matched by Lyd. -Cu/-Vw (with regular 
apocope) and probably also by Lye. -u. 

Endings matching the perfect also appear in 1 sg. pret. Pal. -hha, CLuw. -hha, HLuw. 
-ha, and Lye. -xa/-ga < *-h,e (Hitt, -hhun has been reshaped after m/-conjugation -uri). 
However, Hitt. 3 sg. pret. -s obviously cannot be connected with the PIE perfect, and the 
ending -tta in Luwian and -te in Lycian is again a renewal by the medio-passive ending 
*-to (Yoshida 1 993: 3 1-33) that tells us nothing about the Proto-Anatolian state of affairs. 

It is extremely unlikely that the Anatolian ^/-conjugation is to be derived from the 
PIE perfect, or vice versa (for trenchant criticism of attempts at the former see Jasanoff 
2003: 7-17), but just what the true relationship of the two categories is remains to be 
determined. Jasanoff (2003), elaborating earlier publications, has made a strong case for 
both a “/? , e-present” and a “ */?_,e-aorist” in PIE, but many issues remain unresolved. On 
the status of *o/e and *o/zero ablaut in root *h 2 e-verbs see Melchert 2013 versus Kloek- 
horst 2012. Regarding the ablaut pattern of *h 2 e-presents in -/- (Hitt. 3 sg. pres, dai, 3 pi. 
pres, tianzi ‘put(s)’) compare the differing analyses of Kimball (1998), Jasanoff (2003: 
98-107), and Kloekhorst (2008: 807 and passim). For the source of the 3 sg. pret. ending 
-s of the ^/-conjugation and of the -s- that appears in the second person plural of /?/-verbs 
in -/-, one may contrast the proposals of Jasanoff (2003: 174—197 and 2012: 118-119), 
Kloekhorst (2007), and Melchert (2015). 

The status of the perfect in Anatolian also remains disputed. Jasanoff (2003: 1 1 and 37 
and 117-118) argues that Hitt, wewakk- ‘demand’ and memi/a- ‘speak’ continue redupli- 
cated perfects. See also Forssman 1994: 103 for Hitt, sipand- Tibate; sacrifice’ < *spe- 
spond- (vs. ispant- ‘idem’ < *spend/spnd-). But even if one or more of these verbs reflects 
a perfect in formal terms (which is far from assured), none demonstrably shows the 
“attained state” meaning assumed for “Core Indo-European”. Oettinger (2001 : 80-83 and 
2006: 37 — 42) views the perfect (along with the present type of Skr. dadhati and the inten- 
sive type of Skr. varvarti) as a post-Anatolian development of a PIE reduplicated pres- 
ent that appears in mostly de-reduplicated form in the Anatolian ^/-conjugation. Hajnal 
(1999: 8-25) sees rather the Anatolian ^/-conjugation and the post- Anatolian perfect as 
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separate developments of a PIE “proto-perfect”, the perfect having been formally influ- 
enced by inherited intensive presents. A reasoned choice between these competing sce- 
narios is not yet possible. 

The imperative is marked in the third person by replacing the -i of the indicative end- 
ings with -u, a clearly Proto-Anatolian feature, as shown by the equation of 3 sg. imp. 
Hitt, esdu, Pal. asdu, CLuw. asdu, HLuw. /a:stu/, and Lye. esu ‘let be’. In Hittite this -u 
was spread to the first person singular voluntative in -allu, whose prehistory remains 
unclear. The exhortative first person plural in -weni is formally identical to the indicative 
and can be identified only by context; whether it continues a PIE subjunctive is impossi- 
ble to determine. The second person singular imperative is, as expected, usually the bare 
verbal stem with a zero ending. One finds the reflex of the PIE ending *-d h i in the Hitt. 
2 sg. imp. it ‘go!’ (cognate with Skr. ihi and Gr. lOi) and in verbs with the suffix -mm-: 2 
sg. imp. warnut ‘burn!’. The second person plural imperative continues PIE augmentless 
forms with secondary endings. Since the preterite indicative also lacks the “augment”, 
one can distinguish second person plural preterites and imperatives only by context. 

Verbal inflection: medio-passive 

The endings of the PIE medio-passive are also ultimately related to those of the per- 
fect and the Anatolian /?/-conjugation, but this relationship is a pre-PIE matter. Anato- 
lian incontrovertibly inherited a formally and functionally distinct medio-passive voice. 
As demonstrated by Yoshida (1990), the present medio-passive was marked in Proto- 
Anatolian at least in the third person by endings in *-r (a feature shared with Italo-Celtic 
and Tocharian). This ending was lost by regular sound change, totally in Lycian, but 
only partially in Hittite, Luwian, and Palaic (the situation in Lydian is unclear). In the 
course of attested Hittite, the 7-endings were restored, reinforced by the particle -i of the 
present active (Table 3.5). The details of the developments in Palaic and Luwian remain 
unclear due to the paucity of evidence (see Yoshida 1990: 115-117). In Lycian we find a 
few medio-passive forms marked by a nasal ending (1 sg. pres, sixani ‘I lie’, 3 sg. pres. 
sijeni/siteni Ties’, 1 sg. pret. axaga ‘I became’, for which see Melchert 1992b). The 
source of this inflection is quite uncertain. 

In the preterite and also in the second person singular and first person plural of the pres- 
ent, the Hittite medio-passive shows inherited endings enlarged by a particle -ti (reduced 
to -t in the preterite after Old Hittite, surely because verbal endings in -i were strongly 
associated with the present tense). As already seen by Neu (1968: 145), this element -ti 
([-di]) must be a variant of the reflexive particle *-ti, which in the latter function appears 
assibilated as Hitt. -z(a). See for details of the derivation Yakubovich 2010: 199-205 
and Yoshida 2010: 237-238. Late Hieroglyphic Luwian attests preterite medio-passives 
ending in -si (Oshiro 1993: 53-54 and Rieken 2004), which must somehow continue a 
reflexive use of *-soy (attested as such in Palaic), but the appearance of HLuw. -si in 
this function so late raises serious issues of relative chronology (cf. Yakubovich 2010: 
201 - 202 ). 

As in the active, the imperative is marked in the third persons by replacing -i with -u 
(cf. also CLuw. 3 sg. imp. dyaru and HLuw. 3 sg. imp. /ifiyaru/ ‘let it become’), and 
at least in Hittite this was also spread to the first person singular. Again as in the active 
the first person plural exhortative in Hittite is formally identical to the indicative, and 
the second person plural is identical with the preterite indicative. The Hittite 2 sg. imp. 
ending -hut(i) is of uncertain origin, though it is likely that the -t(i) is again the reflexive 
particle [-di]. 
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Verbal stem formation 

As indicated above, whether or not Anatolian inherited a fully developed aspectual con- 
trast of imperfective versus perfective, it certainly does preserve evidence for virtually 
every type of PIE “present”-fonning stem. The following summary agrees on most 
substantial points with the presentation in Oettinger 1979, with the important updates 
in Oettinger 2002. We find the following familiar primary formations: (1) root pres- 
ents (PIE *h[esti, *h,s-enti > Hitt, eszi, asanzi, CLuw. asti, *asanti, Lyc. esi, *ahnti ‘is, 
are’), (2) aerostatic “Narten” presents (PIE *wekti, *weknti > Hitt, wekzi, wek(k)anzi 
‘demand(s)’), (3) barytone and oxytone media tantum (PIE *keyor > CLuw. zivar(i), 
Lyc. sijeni Ties’ and renewed *keytor > Hitt, kitta(ri), Pal. kitar, and Lyc. sitenh PIE *t / 
dukor > Hitt, tuqqari ‘is visible; important’), (4) nasal infix presents (e.g., PIE *h,u-ne- 
g-ti, *h 2 u-n-g-enti > Hitt, hunikzi, huninkanzi ‘wound(s)’- see Shatskov 2006 on the Hit- 
tite stem formation - and PIE *d h wr-ne-h[-ti, *d h wr-n-h,-enti > Hitt. d(u)warnezzi, d(u) 
warnanzi ‘breaks’ - see Kloekhorst 2008: 907 on the Hittite stem formation), (5) verbs 
with the suffix *-n(e)u- (extremely productive, as in Hitt, huinu- and CLuw. huinu(wa)- 
‘run’ and Lyc. qanuwe- ‘cause to be slain’), (6) presents in *-ye/o- with a full-grade root 
(PIE *kerp-ye-ti, *kerp-yonti > Hitt, karpiezzi, karpiyanzi ‘lift(s)’, Lyd .fa-korfid ‘under- 
takes’), (7) iterative presents in *-eye/o- and a zero-grade root (virtual *(s)tubh-eye/o- > 
CLuw. dupiti, dupainti and Lyc. tubidi, tubeiti ‘strike(s)’), and (8) causative presents in 
*-eye/o- and an o-grade root (PIE *lowkeye/o- > Hitt, lukkezzi, lukkanzi ‘kindle(s)’ - see 
Oettinger 2002: xx). 

Simple thematic presents existed in Anatolian: HLuw. tama- (/damma-/) ‘build’ with 
3 sg. pres. AEDIFIC ARE +M/-n'+z' = /dammari/ < *demh 2 -eti - where the rhotacized 
ending reflects *-di voiced from *-ti between unaccented morae - and Hitt, suwezzi, 
suwanzi ‘push(es), reject(s)’ < *suh r eti, *suh r onti (Oettinger 1979: 297). They are, 
however, startlingly rare vis-a-vis “Core Indo-European’’: for one account of this situ- 
ation see Jasanoff 1998. As already discussed above, “z-presents” in Hittite belong to 
the /zz-conjugation, with an ablaut pattern that is disputed. The status of “zz-presents” 
is unclear, with Hittite attesting one clear case in the zzzz'-conjugation ( tarhuzzi/taruhzi 
‘conquers’ < *terh 2 u-ti - see Kloekhorst 2006: 98-101 and 2008: 835-838) and one in 
the /zz-conjugation ( lahui ‘pours’ < *loh,w-ey - see Melchert 2011b). There are numerous 
reduplicated stems in Anatolian, but they have not yet received a systematic treatment, 
and the possible relationships of the various types to those of PIE remain to be elucidated. 

PIE denominative formations are also well attested in Anatolian. Those in *-ye/o- are 
very productive in all the languages and hardly need illustration. We also find factitives 
in *-eh 2 , which as shown by Hittite originally belonged to /zz-conjugation (Jasanoff 
2003: 139): OHitt. 3 sg. pres, suppiyahhi ‘purifies’. In New Hittite there is transfer to the 
zzzz'-conjugation ( suppivahzi ), and this shift is complete in the other languages (e.g., Lyc. 
prnnawati ‘builds’), but the secondary nature of this inflection is betrayed by the lack 
of voicing of the -t- of the ending (correct Hajnal 1995: 162, note 82, contra Melchert 
1994a: 69). Anatolian also attests the extended factitive suffix *-eh 2 -ye/o- represented by 
the Hittite type of armaizzi, armanzi ‘impregnate(s)’ (beside armahh-) and Lyc. xttadi, 
xttaiti ‘harm(s)’. 

Continuants of PIE root aorists are assured (Eichner 1975: 82): e.g., Hitt, kuerzi, 
kuranzi = CLuw. k(u)warti, *kuranti ‘cut’ (Skr. akar ‘made’) and Hitt, tezzi ‘says’ = Lyc. 
tadi ‘puts’ (Skr. adhat ‘put’). Whether there are any attested reflexes of the PIE sigmatic 
aorist is uncertain. Anatolian tells us nothing about the PIE status of the thematic or 
reduplicated aorist. 
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Non-finite verbal forms 

Proto- Anatolian clearly formed productive neuter verbal nouns with a suffix *-wer/-wen-. 
Hittite evidence suggests that these originally were inflected according to the “proteroki- 
netic” pattern, in idealized fonn nom.-acc. R(e)-wr, gen. R(zero)-wen-s. Hittite productively 
generalizes the post-vocalic fonn -war of the nom.-acc. and the gen. sg. -was < *-wen-s. 
There are, however, some traces of post-consonantal *-wr and a generalized zero grade of 
the suffix: Hitt, henkur, henkunas ‘offering’. Hittite no longer has a full paradigm of this 
verbal noun, but the old abl.-instr. *-wen-ti functions as the infinitive in -wanzi, and the 
endingless locative in *-wen appears as the “supine” in -wan (see below for its use). The 
other languages appear to have generalized the zero-grade form of the suffix, based on the 
use of the allative of the paradigm as the infinitive in Palaic, Luwian, and Lycian: Pal. -una, 
Luw. -una, and Lyc. -na < PAnat. *-una < *-uneh, (as per Hajnal 1995: 98, the more com- 
mon Lyc. -ne reflects analogical influence of the thematic allative ending *-oh 2 ). The other 
attested Hittite infinitive in -anna is a specific Hittite innovation, representing the allative of 
the Hittite verbal abstract in *-eh 2 -tr, *-eh,-tn-: virtual *-eh 2 -tn-eh 2 . Lydian infinitives have 
not yet been identified with confidence. Brief mention should also be made of Hieroglyphic 
Luwian gerundives in -min(a), thus far attested only in predicatival function in nominal 
sentences with the meaning ‘(X is) to be ed’ (Melchert 2004b). 

One of the great surprises of Anatolian from the viewpoint of other older Indo-European 
languages is that, as intimated earlier, there is hardly any trace of verbal adjectives in 
*-to- and none in *-no-. Instead, one finds in the function of a past participle (that is, one 
expressing an attained state) derivatives in *-e/ont-. This is the productive form in Hittite, 
predictably usually with a passive sense for transitive verbs (e.g., kunant- ‘killed’), but 
merely resultative for intransitives (e.g., uwant- ‘come, arrived’). Also unsurprising is 
that in generic use (i.e., without an expressed object) the participles of transitive verbs 
can have an active sense (e.g., adant- ‘having eaten’, pah(ha)snuwant- ‘protecting, on 
guard’). How one is to explain this use versus the active processual meaning of *-e/ont- 
in “Core Indo-European” has yet to be decided. 

In Luwian and Lycian there are mere relics of the past participle in *-e/ont- (e.g., 
CLuw. walant-l ulant- and Lyc. lata- ‘dead’). The productive past participle is of the form 
-mma/i- = -me/i- (with the ubiquitous /-mutation). Despite its enduring popularity, the 
derivation of this suffix from a PIE medio-passive participle cannot possibly be correct. 
One must insist on the fact that the function of this participle is precisely the same as that 
of Hitt, -ant-: it does not express an ongoing activity “being Xed”, but rather an attained 
state, predictably usually passive for transitive verbs: e.g., CLuw. dupaimma/i- ‘struck’, 
awimma/i- ‘come, arrived’. The usual exceptions apply to transitive verbs, and we find 
HLuw. /adamma/i-/ ‘having eaten’ and /uwamma/i-/ ‘having drunk’. Given this meaning 
and the rampant productivity of neuter verbal nouns in *-men precisely in Luwo-Lycian, 
there should be no doubt that these past participles originate in secondary thematic deriv- 
atives with possessive sense: virtual *-o-mn-o- (Melchert 2014: 206-207). 


ADVERBS AND ADPOSITIONS 

There is no truly productive means of forming manner adverbs in Anatolian. Both Hittite 
and Luwian show use of the nominative-accusative plural neuter of adjectives in this 
function: e.g., Hitt, karsi (and renewed karsava) ‘frankly’ < karsi- ‘bare, unadulterated; 
frank’ and CLuw. wasu ‘well’. Hittite does show a limited productive use of adverbs in -Hi 
(e.g., duddumili ‘silently’ < duddumili- ‘silent, mute’), almost certainly the nominative- 
accusative plural neuter of adjectives in *-ili- (compare Lat. -ilis). 
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What calls for extended comment is the Anatolian system of “local adverbs”, attested 
as free-standing adverbs, adpositions, and preverbs. Starke (1977: 127-180) established 
that in Old Hittite there were several pairs of such adverbs, one expressing location and 
the other direction. However, there are signs that the Old Hittite situation is in certain 
respects innovative vis-a-vis Proto-Anatolian. E.g., the form of the Hittite /w'-verb appai, 
appianzi ‘step back; be finished’ argues that the Pro to- Anatolian form of the adverb 
‘back’ was *opi, as attested in CLuw. appi/ HLuw. api, and that Hitt, appa ‘back; again’ 
has been reshaped after other directional adverbs that came to end in -a in Hittite (such 
as anda ‘into’ < *endo). 

Despite repeated claims to the contrary, there is no evidence that most Anatolian local 
adverbs represent frozen case forms of original substantives. As described in Melchert 
2009c, the use of enclitic possessives with local adverbs in Old Hittite is in most cases 
merely analogical to new postpositions that manifestly are case forms of nouns, such as 
tapusz(a) ‘beside’ < tapus- ‘side’. The one notable exception is the family of Hitt, ser 
‘above; over’ and sard ‘up’ (with plentiful cognates elsewhere in Anatolian). Here we 
truly are dealing with an archaic noun referring to a height: compare Gr. piov ‘mountain 
peak’. Much work remains to be done in establishing the Proto-Anatolian antecedents of 
the respective systems of the attested languages. 

SYNTAX 

Morphosyntax 

The use of the nominal cases in Anatolian is mostly unremarkable in Indo-European 
terms. We find the “dative of disadvantage” with persons: e.g., OHitt. LU SAGI (subj.) 
LUGAL-/ (dat. sg.) NINDA.GUR 4 .RA (obj.) epzi ‘takes’ = ‘The drink-server takes the 
leavened bread from the king’. Due to the formal merger of the dative and locative, the 
dative-locative could analogically also be used to express ‘place from which’ with inan- 
imate objects, in competition with the ablative: OHitt. kardi=smi=at ‘your heart’ (dat. 
sg.) + ‘it’ (nom.-acc. sg.) dahhun ‘I have taken’ = ‘I have taken it (the sickness) from 
your heart’. Hieroglyphic Luwian shows the same use of the dative-locative beside the 
ablative-instrumental (see KARKAMIS A6, §§27-28, Hawkins 2000: 125). Due to a 
reanalysis of a dative of goal beside an infinitive expressing purpose with motion verbs, 
Anatolian came to use the dative-locative for objects of infinitives besides the accusative: 
see Melchert 2012 (2014). On the controversial issue of “split ergativity” see Goedege- 
buure 2012 with references to previous analyses. 

To express ‘begin to ’, Hittite uses either a combination of the reflexive particle 

-z(a), the finite verb ep- ‘take’, and the infinitive, or the finite verb dai- ‘put’ (second- 
arily also tiva- ‘step’) plus the so-called supine in -wan (see Hoffner & Melchert 2008: 
§§25.20 and 25.37-38). Hittite also famously develops a “serial” or “phraseological” 
construction that combines a finite form of pai- ‘go’ or uwa- ‘come’ with another finite 
verb in a single clause. The precise meaning, synchronic structure, and historical source 
of this construction have elicited a variety of analyses. For a recent discussion see Roller 
2013 with ample references to previous treatments. 

Configurational syntax: word order 

The unmarked, functionally neutral word order of all the second-millennium languages 
is clearly SOV. Lydian also is unmistakably SOV. Lycian has manifestly innovated, but 
the paucity of diagnostic examples has made it hard to determine just what the “basic” 
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Lycian word order is; for a tentative claim of VSO see Garrett 1994: 30-32. One must 
emphasize the notion of “unmarked” or “basic” word order, because other orders of 
major constituents are by no means rare, and recent intensified study of Hittite order 
has revealed even more variation than previously acknowledged. In trying to make 
sense of this variety, one must distinguish two separate though interrelated issues: 
(1) how do various configurations correlate with discourse structure (topic, focus, 
anaphora); and (2) which orders are licensed by the syntax, and which by the phonol- 
ogy? Significant progress has been made in addressing these questions, but much work 
remains to be done. One complicating factor is that it is now clear that for Hittite one 
cannot use as primary data material from mythological texts or ritual incantations, 
since word order in these may show influence from Hattie or Hurrian (see Rizza 2007 
and Rieken 2011). A systematic overview is impossible here and would be almost 
immediately outdated in any case. In addition to the summary in Hoffner and Melchert 
2008: 406-409 (flawed by the uncritical use of material from translation literature), 
one may get a sense of what has been achieved and how much remains to be done by 
consulting Bauer 2011, Goedegebuure 2013 (with references to previous analyses), 
and Sideltsev forthcoming. 

One famous feature of Anatolian syntax is that most clauses in connected dis- 
course are linked to a preceding clause by “sentence-initial” conjunctions to which 
are attached sequences of enclitics (unemphatic anaphoric pronouns plus a variety of 
“particles”) by what is known as “Wackernagel’s Law”. For persuasive arguments that 
said conjunctions are actually interclausal see Agbayani and Golston 2010. The begin- 
nings of this system are certainly Proto-Anatolian, but several facts argue that it was 
there only embryonic and that each of the Anatolian languages has elaborated it in its 
own fashion. First, there are differences in the order of the clitics: in Hittite the singular 
dative clitics follow third person nominative or accusative clitics ( n=at=mu/si conj. + 
nom.-acc. sg. n. ‘it’ + ‘to me’/‘to him’), but in Luwian the order is the reverse (HLuw. 
/a/=/wa/=/m(u)/=/an/ conj. + quotative marker + ‘to me’ + ace. sg. comm. ‘it’). Second, 
different morphemes are used for the same function: Hittite uses =ma to mark con- 
trastive focus, while Palaic and Luwian employ =ppa in this function. Third, the same 
morpheme appears in different functions: PIE *nu ‘now’ still appears as such in Pal. nu, 
but it has begun to be grammaticalized as a clause-linking conjunction (Carruba 1970: 
65), a development completed in Hittite, while the Lydian cognate -in (with apocope 
of the original vowel and then anaptyxis) comes at the end of the clitic sequence and 
likely has an asseverative value (that it is cognate cannot be doubted, contra Gusmani 
1964: 133). 

Nevertheless, some features can tentatively be attributed to Proto-Anatolian. The only 
clause-linking conjunction that may be Proto-Anatolian is *oh[ (cf. Dunkel 2007: 57), the 
frozen instrumental of the PIE *e/o- anaphoric pronoun, attested as a- in Hittite (rare), 
Palaic, and both forms of Luwian, and in extended Lyd. ak, in all cases with a prosecutive 
sense, roughly ‘and then’. Contrary to the claim of Watkins (1963: 13-16) and others, the 
Old Hittite set of clause-linking conjunctions su, ta, and nu cannot be compared with OIr. 
se, to, and no. As shown by Weitenberg (1992: 327), OHitt. su is used only with the pret- 
erite, while ta is used with the present-future. This distribution supports the etymology 
of su < PIE *h,su (Zimmer apud Dunkel 2007: 56-57) and of ta < instr. *toh, (Rieken 
1999b: 86). In any case, all three of the cited conjunctions are creations of Hittite. Of the 
various “local particles” that follow the anaphoric pronouns in the clitic sequence, that 
most likely to have existed already in Proto-Anatolian is *-te, which probably indicated 
movement from one spatial domain across a boundary into another (Josephson 1995: 
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171), attested in Pal. and Luw. -tta, Lyd. -(i)t-, and Hitt, -(a)sta, the last via metanalysis 
(Josephson 1972: 419). 

Configurational syntax: subordinate clauses 

Anatolian has conditional and temporal subordinating clauses, using a variety of conjunc- 
tions built on interrogative-relative stems (see Hoffner & Melchert 2008: 414-423 for an 
overview of these in Hittite). Differences in detail among the various languages leave in 
doubt the status of such clauses in Proto-Anatolian. Anatolian lacks “final clauses” and 
uses parataxis (Hitt, nu, Lye. me): e.g., Hitt, takku LU-an pahhueni kuiski pessiezzi n=as 
aki ‘if anyone makes a man fall into a fire, so that he dies’. 

Our understanding of Hittite and Anatolian relative clauses has changed dramatically 
in the last decade, and ongoing research promises to bring further changes in the next 
few years. Contrary to a long-standing consensus (Garrett 1994, following Hale 1987), 
Goedegebuure (2009) and Huggard (2011) have shown that Hittite does not have overt 
“wh-movement”; the surface order of wh-interrogative and relative clauses is governed by 
considerations of focus. Furthermore, there is also counterevidence to the claim that Hit- 
tite distinguishes “determinate” from “indeterminate” preposed relative clauses in that the 
former must be preceded by at least one full constituent, while the latter occur only clause- 
initially (not counting clause-linking conjunctions and any attached clitics); see Held 1957, 
Hale 1987: 46-49, and Garrett 1994: 45-49. Note the two indeterminate relatives preceded 
by focused constituents in the following New Hittite text (KBo 5.4 Ro 33-34): 

namma ANA - UTU-5/ kids LU.KUR \n=as=tta\ LU.KUR esdu tuk=ma kuis LU.KUR A/VT 
further to His Majesty who enemy conj.=he=you enemy let be you=focus who enemy to 
d U7U-$[=ya=as LU.KUR 
His Majesty=also=he enemy 

‘Furthermore, whoever is an enemy to His Majesty shall be an enemy to you, while- 
whoever is an enemy to you is an enemy to His Majesty.’ 

While preposed correlative relative clauses of the type just shown are statistically by far 
the most common type in Hittite, Probert (2006) showed that Old Hittite also has a type 
of embedded relative clause. Current research reveals that Hittite also has relative clauses 
embedded between constituents of the main clause, as well as “free relatives”, while 
postposed restrictive and non-restrictive relatives are far more common than previously 
acknowledged. 

LEXICON 

Tischler (1979) demonstrated that contrary to a widespread perception (see, e.g., Kronasser 
1956: 219), at least two-thirds of Hittite’s core vocabulary that is known is either inherited 
from PIE or built on inherited material. The misperception that a majority of the Hittite 
lexicon is non-Indo-European is due to the nature of our textual evidence, much of which 
deals with the state cult and rituals. Since Hittite religion and ritual were heavily influenced 
by Hattie and Hurrian, unsurprisingly much of the vocabulary for the relevant personnel 
and apparatus is borrowed. Nor is it unusual that a significant portion of the words for Ana- 
tolian flora and fauna are loanwords. This has nothing to do with core vocabulary, which 
is securely Indo-European. Our knowledge of the core vocabulary of the other Anatolian 
languages is woefully inadequate, but what is known suggests the same for them. 
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THE POSITION OF ANATOLIAN 

The notable differences between the Anatolian languages and the grammar reconstructed 
for PIE on the basis of the other subgroups has elicited a variety of responses. Some 
scholars have attributed them nearly entirely to changes in Anatolian (mostly losses, 
hence the overly simplistic label “Schwundhypothese”): see Pedersen 1938 and Eichner 
1975. Others have argued that Anatolian is in Stammbaum terms a “sister” of recon- 
structed PIE, one that preserves many archaic features of a much earlier proto-language 
labeled “Indo-Hittite”, from which PIE had markedly diverged: thus Sturtevant 1933 and 
Lehrman 1998. In the 1960s a number of scholars proposed that the reconstruction of PIE 
itself, especially the verbal system, needed to be drastically revised, on the basis of data 
not only from Anatolian but also from other subgroups: see the very diverse views of 
Adrados 1963, Meid 1963, Neu 1968, and Watkins 1969. 

After more than two decades during which a compromise between these diametrically 
opposed views seemed unlikely, there has since developed a growing consensus that the 
non-Anatolian languages did participate in some common innovations setting them apart 
from Anatolian, but that these innovations are neither particularly numerous nor for the 
most part profound. Nor do they preclude that Anatolian shared in certain post-PIE inno- 
vations with particular subgroups (cf. Puhvel 1994). Debate now centers on just how many 
revisions must be made to PIE to account for the features of Anatolian - or, to phrase it dif- 
ferently, just which features of “Core Indo-European” (= “Restindogermanisch”) actually 
represent common innovations not shared by Anatolian. For three individual viewpoints 
one may compare Rieken 2009, Oettinger 2013-14, and Melchert forthcoming, but one 
should not conclude that these by any means represent the full spectrum of opinion. 

FURTHER READING 

In addition to the works already cited, one may consult the following: Popko (2008) 
offers a useful overall summary; see for an overview of Hittite Rieken 2005a and for 
grammatical sketches of all the languages the relevant chapters in Woodard 2004. Hoff- 
ner and Melchert (2008) describe Hittite synchronic grammar in extenso. Tischler (2001) 
furnishes a reliable basic dictionary for Hittite. The two major dictionaries for Hittite 
(Friedrich-Kammenhuber-Hoffmann [1975-] and the CHD [1980-]) will take decades to 
complete. Tischler (1977-2016), Puhvel ( 1 984 — ), and Kloekhorst (2008) provide three 
different viewpoints on Hittite etymology. For Hittite texts, editions, bibliography, ono- 
mastics, and much more, one should consult the invaluable online “Hethitologie Portal 
Mainz”: http://www.hethport.uni-wuerzburg.de/HPM/index.html. 

NOTES 

1 In the special case of word-final accented *-om(s), PIE *o does appear as Hitt, /o:/, thus 
ace. sg. comm, ku-u-un = /ko:n/< *kom and comm. ace. pi. ku-u-us = /ko:s/ < *koms: 
cf. Kloekhorst 2008: 54—57. 

2 For the Hieroglyphic Luwian first and second person plural as ending in /-ants/, not 
/-unts/, see Yakubovich 2010: 65-68. 
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CHAPTER 1 


INDO-IRANIAN 

Leonid Kulikov 


INDO-IRANIANS AND THEIR ORIGINS 

Indo-Iranian languages (another, outdated and quite misleading term is “Aryan”), which 
form a major branch of the Indo-European language family, are spoken by more than a 
billion of speakers occupying an immense territory from the Caucasus and southeastern 
Anatolia in the West to Northeast India and Bengal in the East and the Maidive Islands in 
the South. This grouping includes the Indo-Aryan, Iranian, NuristanT (in earlier scholar- 
ship also called Kafir), and Dardic branches. The latter is considered by many scholars as 
part of the Indo-Aryan branch (e.g., Morgenstieme 1961, 1973, Strand 1973, Southworth 
2005: 127, 149). Convincing argumentation for treating the Dardic languages as a sepa- 
rate genetic group, a fourth sub-branch of Indo-Iranian, alongside Indo-Aryan, Iranian, 
and NuristanT, can be found in Kogan 2005. 

All Indo-Iranian languages derive from a common ancestor, the reconstructed 
Proto-Indo-Iranian language that was spoken around the end of the 3rd millennium BCE. 
The speakers of this proto-language, Proto-Indo-Iranians, are commonly identified with 
the Sintashta archaeological culture and its successor, the (early) Andronovo culture, 
located in the vast area between the Southern Ural and the Tian Shan mountains and 
dated to the end of the 3rd millennium BCE to the 1st half of the 2nd millennium BCE. 

The oldest preserved linguistic evidence comes from the Mitanni kingdom, which 
existed between ca. 1500 BCE and 1300 BCE on the territory that is now northern Syria 
and southeastern Anatolia (and thus almost precisely matching with the easternmost end of 
the territory populated by the speakers of the modem Indo-Iranian languages). The main 
language of the kingdom was Hurrian (belonging to the Hurro-Urartian language family, 
probably related to the Northeast Caucasian, or Nakh-Daghestanian, family). However, 
the names of the ruling elite (such as Artassumara [~ Ilr. Arta-smara ‘who thinks of Arta/ 
Rta’] or Biridashva (biridasua, biriiasua) [~ Pntasva ‘whose horse is dear’]), and, particu- 
larly, the names of the gods (Mitra, Varuna, Indra), as well as a number of technical terms 
preserved in a Hittite treatise on horse training (such as aika ‘one’ [cf. Ved. Skr. eka\, 
panza (~ panca) ‘five’), vartana ‘round’), preserve traces of an Indo-Iranian language (or 
perhaps, more precisely, Indo-Aryan, as a few features may point to an Indo-Aryan, rather 
than common Indo-Iranian, character for this hypothetical idiom; see Mayrhofer 1974). 

The actual text documentation starts from the 2nd half of the 2nd millennium BCE (ca. 
1200-1000 BCE) for Indo-Aryan, with the hymns of the Rgveda (RV), the main sacred 
text of the Hinduism, while the text documentation of Iranian begins in the 1st half of the 
1st millennium BCE (Avesta). 

Due to the essentially uncontroversial character of the Proto-Indo-Iranian reconstruc- 
tion, Indo-Iranian remains probably the best-established larger subgrouping within the 
Indo-European family and the only one that is adopted by all Indo-Europeanists (even 
another very likely subgrouping of approximately the same level, Balto-Slavic, is doubted 
by a number of scholars). 
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INDO-IRANIAN PHONOLOGY 

The most important features characterizing Proto-Indo-Iranian as opposed to all or most 

other branches of Indo-European include: 

(i) The merger of the two PIE vowels, e and o, in a, with the preservation of the 
quantity distinctions, i.e., PIE *e, *o > Pllr. *a; PIE *e, *5 > Pllr. *a; cf. Ved. data, 
Av. dasa ‘ten’ (~ Gr. Seica, Lat. decent ) < PIE *dekm; Ved. vSk, Av. vaxs ‘speech’ 
(~ Lat. uox ‘voice’) < PIE *wok w s. The same holds for the vocalic part of the PIE 
diphthongs, which invariably changes to a, i.e., PIE *ey, *oy > Pllr. *ai; etc. The 
only phonological context where the original opposition between two PIE timbres 
leaves traces in the form of a new opposition (in length) is the open non-final syl- 
lable, where the original short *o was lengthened (*o | _CV > *a), while *e was 
not (Brugmann’s Law); cf. the ace. sg. form Ved. dataram ‘giver’, Av. dataram 
‘creator’ (~ Gr. Scoxopa) < PIE *-tor-m as opposed to ace. sg. Ved. pitdmm, Av. 
pitaram ‘father’ (~ Gr. Jiaispa) < PIE *-ter-m. 

(ii) Another source of Pllr. *a is the vocalic allophone of the two nasal sonants, i.e., 
PIE *n, *m > Pllr. *a; cf. Ved. mad- ‘thought’ (~ Lat. mens, gen. sg. mentis ‘mind’) 
< PIE *mn-ti- [*mn-ti] and the words for ‘ten’ mentioned above. 

(iii) The phonological status of *1 was weakened in Indo-Iranian. / changed to r in Ira- 
nian, and in many, but not all, cases in Indo-Aryan. In particular, / always changed 
to r when surfacing as a vocalic allophone; in the language of the RV, words con- 
taining / are very rare, but from the end of the early Vedic period, more forms with / 
start to appear (perhaps originating in one or several hypothetical /-dialects, where 
the original opposition was preserved); cf. RV reh- as opposed to the younger 
Atharvavedic (Paippalada) root variant left- Tick’ < PIE *leyg h -. 

(iv) Proto-Indo-Iranian is very conservative as far as the system of stop consonants 
is concerned, preserving the PIE system virtually intact: the opposition of three 
series - voiceless T, voiced non-aspirate D, and voiced aspirate D h (or glottalized ’D 
and simple voiced D, if we accept the glottalic theory for the PIE reconstruction) - 
remains without changes. Voiceless aspirates (and, within the glottalic theory, 
voiced aspirates, too) must represent Indo-Aryan innovation. 

(v) Palatalizations: Indo-Iranian belongs to the satam languages; that is, the simple 
velar (*k *g *g h ) and labiovelar (*k w *g w *g wh ) PIE series of consonants merged 
together, while the palatovelars were palatalized: *k *g *g h > *c *j *j h ; cf. Ved. 
dasa, Av. dasa ‘ten’ < Pllr. *daca < PIE *dekm (~ Gr. 8 £ku etc.). Another series of 
palatal consonants, Pllr. *c *) *J\ reflects PIE simple velars and labiovelars before 
/e/ and /i/; cf. Ved. ca, Av. -ca ‘and’ < PIE *k w e (~ Lat. que etc.). 

(vi) The only PIE sibilant, *s, is retroflexivized in Pllr. after i, u, r, and A (= all velars); 
cf. Ved. tfsna-, Av. tarsna- ‘thirst’ < PIE *trs-n- (~ Eng. thirsty)', Ved. visa-, Av. 
visa- ‘poison’ < PIE *wis-o- (~ Lat. virus). This phonetic law, known also as the 
RUKI rule, is an isogloss (partly) shared with the Balto-Slavic, Armenian, and 
Albanian branches. 

(vii) Indo-Iranian differs from all other branches in the timbre of the vocalization of the 
PIE laryngeals (in the interconsonantal position): *H > *i (unconditionally in the 
final syllable and occasionally also in other positions); cf. PIE *ph,-ter- ‘father’ > 
Pllr. *p(i)tar- > Ved. pitar-, Av. dat. sg. pi9re,fadroi, OPers. pitar-. 

(viii) Indo-Iranian is the most conservative branch of Indo-European as far as the accen- 
tuation system is concerned: Vedic is the main source of our knowledge about PIE 
accentuation, preserving it, presumably, virtually intact. 
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THE INDO-IRANIAN MORPHOLOGICAL SYSTEM 

From the morphological point of view, Indo-Iranian is also one of the most archaic branches 
of Indo-European, probably preserving the original system of inflectional and derivational 
categories as reconstructable for the proto-language better than any other branch. 

In particular, within the nominal system Indo-Iranian perfectly preserves the three 
numbers (singular, dual, and plural) and eight cases (nominative, accusative, instrumen- 
tal, dative, ablative, genitive, locative, and vocative) as well as the three genders of PIE. 
Only some regularizations in the system of endings in nominal paradigms must represent 
Indo-Iranian innovations, though. These include, for instance, introducing y after b h in the 
dative -ablative dual and plural endings *-b h ya(m) and *-b h yas, respectively (for *-b h 5 and 
*-b h os reconstructable on the basis of evidence from other branches), apparently from the 
instrumental plural *-b h i-s. 

Another probable innovation in the Indo-Iranian pronominal system was adding (pre- 
fixing or suffixing) the particle of remoteness *u/*au to certain forms derived from the 
pronominal stem *a- (cf. Iranian *au-a- and Indo-Aryan *ad-u — » *ad-au by analogy 
with *sau; *am-u-), in order to distinguish between proximal and distal demonstratives, 
presumably due to the fact that some originally distinct pronouns fell together after the 
merger of the two PIE vowels *e and *o in Indo-Iranian (Klein 1977). 

The Indo-Iranian verbal system was also quite archaic. It had five moods: indicative, 
injunctive (very productive in the earliest periods, where it was used as an “ ‘extratem- 
poral’ tense category rather than a special mood” (M. Kiimmel forthcoming/20 17), men- 
tioning an event or activity as a “known fact”), imperative, optative, and subjunctive; 
and it distinguished between the present tense and three past tenses (aorist, imperfect, 
and perfect) in the indicative. The function of the future was largely taken over by the 
subjunctive mood, which denoted the expected action (cf. Tichy 2006). 

Indo-Iranian had two voices, active and middle; at least in the 3 sg. and 3 pi. forms, 
many verbs could distinguish between “(medio-)passive” and “non-passive”. The 
“(medio-)passive” paradigm was differentiated from the non-passive forms partly by 
means of special stem formative morphemes (such as the present passive suffix *-ia-, the 
use of which in this function was an important Indo-Iranian innovation), partly by means 
of specific inflection (see Kiimmel 1996, Kulikov 2006). 

THE INDO-IRANIAN LEXICON 
The inherited IE lexicon 

A large part of the Common Indo-Iranian vocabulary is inherited from PIE, being, at the 
same time, an important source for its reconstruction. This inherited part of the lexicon 
covers all important domains of lexica (see numerous examples above and in the Indo- 
Aryan and Iranian chapters) and, like other parts of the linguistic system, testifies to a 
considerable conservatism of Indo-Iranian. 

Loan-words in the Indo-Iranian lexicon and 
substrate/adstrate influence 

Alongside with the well-established inherited part of the lexicon, there are a consid- 
erable number of words that cannot be traced to PIE but are safely reconstructable for 
Proto-Indo-Iranian on the basis of evidence from both Old Indo-Aryan and (Old) Iranian 
(evidence from Middle and New Iranian may be relevant, as the expected cognates may 
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be lacking from the Old Iranian corpus, which is somewhat more limited as compared to 
Old Indo-Aryan), thus representing loan-words into Proto-Indo-Iranian before its split. 
Such borrowings encompass a variety of cultural domains, such as agricultural and fann- 
ing tenns, animal names (cf. Ved. ustra-, Av. ustra-, OPers. usa- < Pllr. *ustra- ‘camel’; 
Ved. kasyapa-, Av. kasiiapa- < Pllr. *kaciapa- ‘tortoise’), household and building (cf. 
Ved. istaka-, istika-, Av. istiia-, OPers. isti- < Pllr. *ista-, *isti- ‘brick’), irrigation tenns, 
religious tenns, and mythology (Ved. amsu-, Av. qsu- < Pllr. *ancu- ‘Soma plant’; Ved. 
atharvan-, Av. aOrauuan- < Pllr. *at h aruan- ‘(a particular type of) priest’ (? see Pinault 
2006)). 

Candidates for the sources of these bonowings may be cultural languages that were 
spoken in and around the area populated by the Proto-Indo-Iranians, that is, from Mes- 
opotamia to the South Ural, around the 3rd millennium BCE, as well as some isolated 
languages spoken in remote mountain areas, such as Burushaski (see Tikkanen 1988). 
Proto-Uralic could also bonow loan-words from some Indo-Iranian dialects; see, e.g., 
Katz 2003. However, the origin of most of these loan-words is unclear. Quite attractive 
is the theory about the bonowing of the bulk of these Common Indo-Iranian loan-words 
from the unidentified language spoken by the population of the Bactria-Margiana Archae- 
ological Complex that was located immediately to the south of the Andronovo culture’s 
area and had a well-developed urban civilization (see Lubotsky 2001). 

INDO-IRANIAN LANGUAGES IN A DIACHRONIC 
TYPOLOGICAL PERSPECTIVE 

The relevant shared features of the languages belonging to the Indo-Iranian branch are 
not confined to the peculiarities inherited by Proto-Indo-Iranian from PIE and thus are 
shared, foremost, by the oldest representatives of this subgrouping, i.e., by Old Indo- 
Aryan (Vedic/Sanskrit) and Old Iranian (Avestan and Old Persian), but they are only 
partly (if at all) preserved in the modem members of the branch. Of no less importance 
are much more recent features of the group that emerged after the split of Proto-Indo- 
Iranian (or even of Proto-Indo-Aryan and Proto-Iranian) and must be due to the fact that 
many members of the Indo-Iranian branch belong to the same linguistic area (together 
with languages of non-Indo-European affiliation). Specifically, virtually all Indo-Aryan 
and NuristanT languages as well as a few neighboring Iranian languages such as Pashto 
(together with the non-Indo-European Dravidian, Munda, and the isolate Burushaski) 
form the South Asian linguistic area (which is notorious as one of the handbook exam- 
ples of a linguistic area; see Masica 1976). There are also some good reasons to extend 
this area further to the North and East, including such languages or families as Tocharian 
(Indo-European) and Turkic/Altaic into the large Central-South Asian linguistic area 
(see, e.g., Hock 2007). 

The features briefly summarized below thus reveal the same basic type of evolution, 
representing shared innovations due to parallel developments and/or triggered by shared 
influences of neighboring (substrate or adstrate) languages. The list of linguistic features 
given in this section by no means claims to be exhaustive. 

Diachronic trends within the system of transitivity oppositions: 
voice and valency-changing categories 

Considering the system of transitivity oppositions in Indo-European in a diachronic 
typological perspective, one may posit two major evolutionary types attested for the 
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members of this linguistic family (briefly outlined in Comrie 2006; for details, see 
Kulikov 2009; 2012). One major type is located in the West of the Indo-European area 
and includes such branches as Germanic, Romance, Greek, and Slavic. These branches 
share a number of parallel tendencies in the system of encoding transitivity oppositions 
and valency-changing categories that can be qualified, by and large, as instantiating a 
“syncretic” type of evolution: (i) The old syncretic marker encoding a number of valency- 
reducing (intransitivizing) derivations (such as passive, anticausative, and reflexive), the 
middle voice (type of inflection), is either preserved (as in Greek) or replaced with a new 
middle, mostly going back to the PIE reflexive pronoun *s(w)e-. (ii) The PIE morpholog- 
ical causatives with the suffix *-eye/o-, reflexes of which are still relatively well attested 
in Old Germanic (e.g., in Gothic y'an-verbs) and (early) Slavic (/-causatives), virtually 
disappear, (iii) Many languages of this “Western” area attest the emergence and expan- 
sion of labile verbs (i.e., verbs or verbal forms that can show changes in syntactic pattern, 
with no formal change in the verb) of the type John opens the door / The door opens (this 
trend is particularly well attested in the history of English; see, e.g., McWhorter 2002, 
Gelderen 2011). 

By contrast, several other daughter languages, mostly those belonging to some Eastern 
branches of Indo-European, such as, first of all, Indo-Iranian, radically abandon the syn- 
cretic strategy in the system of encoding transitivity oppositions and valency-changing 
categories, developing, by and large, in the opposite, “antisyncretic”, direction. The most 
important diachronic typological features shared by the representatives of the Eastern 
type include the following: (i) The middle voice declines, which amounts to the loss of 
most original (intransitivizing) functions of the PIE middle and the lexicalization of many 
middle forms; subsequently, the active/middle opposition, albeit physically preserved in 
the paradigm, loses a large part of its functional content, thus being degrammaticalized. 
(ii) Instead, most languages of this area develop special markers for several intransitiv- 
izing voices and valency-changing categories. These include, in particular, Indo-Iranian 
and Armenian markers of a morphological passive going back to the PIE suffix *-ye/o- 
and Indo-Iranian reflexive and reciprocal pronouns (cf. Ved. tanu-, Av. tanii- ‘self’ <— 
‘body’; Ved. anvo . . . anya- ( anyo-’nva -, anvonva-), Av. aniio. ainun ‘one another, each 
other’; see Kulikov 2014b). Morphological causatives (in particular, the reflexes of the 
PIE *-eye/o-causative - cf. Vedic and Avestan morphological causatives with the suffixes 
-ava- and -aiia-, respectively - but also other morphological causatives, especially with 
nasal affixes) demonstrate a considerable increase in productivity. This increase in the 
productivity of morphological causatives can be observed during the entire history of 
Indo-Iranian languages and eventually gives rise to the complex systems with two or 
even more causatives (including double causatives with a reiterated causative morpheme, 
as in Marathi and Tajik; see examples in Masica 1976: Ch. 3) in some members of this 
branch, which distinguishes them from other Indo-European groups, (iii) Labile verbs, 
still common in (early) Old Indo-Aryan and Avestan, become rarer; in particular, in Vedic 
this tendency is clearly observable as early as in the post-Rgvedic language; see Kulikov 
2014a. 

Typologically, the antisyncretic (Eastern) type, as attested in Indo-Iranian, shares 
more features with the languages of some non-Indo-European families, such as Turkic 
(or, more generally, Altaic) or Dravidian, than with the languages of several Western 
Indo-European branches. This is demonstrated in Table 4.1, which compares a few 
related features characterizing the main trends in the development of the systems of 
transitivity oppositions for a variety of both Indo-European and non-Indo-European 
language groups. 
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TABLE 4.1 ENCODING TRANSITIVITY OPPOSITIONS: DIACHRONIC 
FEATURES OF SOME LANGUAGE FAMILIES 

TYPOLOGICAL 


middle voice 

morphological valency-changing categories 

labile syntax 

East Caucasian 

0 

+/71 

+ 

Kartvelian 

+ 

+ 

0 

Egyptian/Coptic 

+ 

+/7I 

(+) 

Indo-European 

Germanic 

71 (new) 

^1 

71 

Romance 

71 (new) 

^1 

+/71 

Slavic 

71 (new) 


o/(+) 

Greek 

+ 

o/(+) 

++ 

Indo-Iranian 


71 

SI 

Turkic 

0/^1 

+/71 

0/SI 


0 lacking 

(+) present but rare or non-productive 

+ present 

71 increasing (in productivity, frequency, etc.) 

SI decreasing/disappearing 

Diachronic changes within the case system 

As mentioned above, the Proto-Indo-Iranian morphological system was very conserva- 
tive, perfectly preserving the eight PIE cases, which further remained intact in both main 
branches, Indo-Aryan and Iranian, for more than 1000 years after the split of Proto-Indo- 
Iranian, as documented in Old Indo-Aryan (Vedic) and Old Iranian (Avestan). After the 
end of the Old Indo-Aryan and Old Iranian periods, we observe a dramatic decline of 
the case systems in both subgroups, which thus represents another shared innovation. 
By the end of the Middle Indo-Aryan/Iranian period, the Indo-Iranian languages had lost 
the majority of the conservative Old Indo-Aryan/Iranian case system. Generally, only 
two cases survive, direct (resulting from the merger of nominative and accusative) and 
oblique (mostly going back to the Old Indo-Aryan/Iranian genitive); in some languages 
traces of some other oblique cases, such as the instrumental, locative, or ablative, can still 
be found, such as the Sinhala instrumental case in -ent-in and the Assamese ergative -e, 
both probably reflecting the OIA instrumental singular ending of the -fl-declension, -ena. 
The functions of the lost cases are largely taken over by postpositions of different origin. 

In several New Indo-Aryan and New Iranian languages we observe the grammaticaliza- 
tion of such new postpositions, which are normally added to the oblique case form. Very 
often this grammaticalization results in the amalgamation of a postposition with the oblique 
and, ultimately, in the rise of a new case. Such is, for instance, the origin of the two new case 
endings in Sinhala, viz. dat. -to (< Skt. artha ‘goal, purpose’, used as the final member of 
a nominal compound) and gen. -ge (< Skr. grhe ‘in the house’ = loc. sg. of grha- ‘house’), 
and several case morphemes in Ossetic, such as dat. -cen (< *ana, cf. Av. ana ‘upon, over, 
across’, or *anu, cf. OPers. anuv, Av. anu ‘along, after, according to’) and adessive -bcel 
(Digoron) (< *upari ‘above, upon, on’); for details and discussion, see Kulikov 2011. 

Other shared syntactic innovations 

There are a number of other syntactic innovations shared by many or most Indo-Iranian 
languages of both major subgroups: 

(i) The emergence of the ergative construction, first of all, on the basis of the syn- 
tactic reanalysis of constructions with resultative verbal adjectives (traditionally 
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also called “perfect passive participles”) with the suffix -ta-l-na-, is a syntactic 
innovation shared by several Indo-Aryan and Iranian (foremost Western Iranian) 
languages. This development started during the Middle Indo -Aryan/Iranian period, 
when the agentive noun (the passive agent) of constructions with -ta-/-na-adjectives 
acquired some subject properties (such as the referential control of converbs). In 
fact, however, the origins of the ergative alignment can presumably be dated as 
early as Old Indo-Aryan/Iranian or perhaps Proto-Indo-Iranian. In particular, there 
are some reasons to take Vedic constructions with -ta-/-na-adjectives and the geni- 
tive marking of the agent separately from canonical passives with the instrumental. 
According to P. K. Andersen ( 1 986), the genitive noun displays a number of subject 
properties (usually animate; definite and/or refers to old information) in such con- 
structions, and therefore they should be qualified as ergative rather than passive, 
properly speaking. 

(ii) Constructions with oblique (non-canonical) subjects, quite common in many 
Indo-European branches (in particular, in the Western part of the Indo-European 
area; see, e.g., Barddal et al. 2012), had become relatively rare in the easternmost 
branches (in particular, in Indo-Iranian), being replaced by the canonical (nomina- 
tive) subject pattern. This innovation appears to be shared with some other Eastern 
(antisyncretic) branches of Indo-European, such as Tocharian; see Viti (forthcom- 
ing). Nevertheless, many New Indo-Aryan languages have ultimately developed 
new oblique (non-canonical) subject constructions (see Masica 1991: 346ffi). 

(iii) A remarkable feature of the syntax of many modem Indo-Iranian languages is the 
existence of long chains of converbs in narrative texts (see, e.g., Masica 1976). This 
feature is shared with a number of non-Indo-European languages of South, East, and 
Central Asia (cf. Bickel’s (1998) “Asian type”). 

The origin of the cluster of (diachronic) typological features described above, such as, 
above all, the antisyncretic evolutionary type of encoding transitivity oppositions, the rise 
of new agglutinating case systems, and the decline of constructions with non-canonical 
(oblique) subjects in Old Indo-Aryan, is a difficult problem on its own. It may be partly 
due to the influence of substrate (or adstrate) languages of the Altaic or Dravidian type. 
Further study of these areal features in typological context is required for a better under- 
standing of the mechanisms of these changes in the historical period. 


FURTHER READING 

There is no comprehensive monographic description of Proto-Indo-Iranian and its his- 
torical grammar. However, the relevant chapters of de Gruyter’s forthcoming Handbook 
of Comparative and Historical Indo-European Linguistics, Lubotsky forthcoming/2017, 
Kiimmel forthcoming/2017, and Schmitt forthcoming/2017, give excellent brief surveys 
of the corresponding parts (phonology, morphology, and the lexicon, respectively) of 
the Proto-Indo-Iranian linguistic system. In addition, Goto 2013 offers a comprehensive 
overview of the Indo-Aryan morphological system in a diachronic perspective, tracing 
most of its aspects back to the Indo-Iranian. The reconstructable Indo-Iranian vocab- 
ulary is fairly well represented in the Indo-Aryan etymological dictionary Mayrhofer 
1992-1996. 

There are several works on the South Asian linguistic area that discuss the features 
shared by its members, i.e., virtually all Indo-Aryan, NuristanT, and some Iranian lan- 
guages, starting with such seminal works as Masica 1976 (see also Masica 1991); see 
also, more recently, Southworth 2005 and Hock 2007. 
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CHAPTER 2 


INDO-ARYAN 

Leonid Kulikov 


INTRODUCTION 
General information 

The Indo-Aryan languages (sometimes also referred to, misleadingly and not quite correctly, 
as Indie) represent the largest group of the Indo-European both by the total number of speak- 
ers of the present-day Indo-Aryan languages (approx, one and a half billion of the total 
three billion speakers of Indo-European languages) and by the number of languages (ca. 
225 languages recognized, for instance, by Ethnologue, thus making up more than half of all 
Indo-European languages listed by this source). The largest Indo-Aryan languages include 
Hindi and Urdu (about 240 million speakers), Bengali (about 230 million), Punjabi (about 
110 million), Marathi (about 70 million), Rajasthani (about 50 million) and Gujarati (about 
45 million). At present, Indo-Aryan languages are spoken, above all, on the Indian subconti- 
nent, also referred to as South Asia. This region includes India proper (officially the Republic 
of India), Bangladesh, Pakistan, Nepal, Bhutan, Sri Lanka and the Maldives (Republic of the 
Maldives). Due to labor migration, large groups of speakers of Indo-Aryan languages have 
settled all over the world, and these languages are now spoken in a number of diasporas, in 
particular, in such regions as South-East Asia, Africa, South America (in particular, Guyana 
and Surinam), Australia and Melanesia (in particular Fiji), as well as in the USA and some 
European countries, foremost in the United Kingdom. 

The Indo-Aryan group is also the easternmost branch of the Indo-European languages 
(except for the extinct Tocharian languages, which were spoken still further to the East). 
The Indo-Aryan-speaking peoples probably came to the Indian subcontinent from the 
North-West, entering the greater Punjab (where the oldest Vedic text, the Rgveda, was 
composed) by the end of the first half of the second millennium BC. In that epoch, Pro- 
to-Indo-Aryan was a single language (still very close to Proto-Iranian), which started to 
split into dialects essentially after the arrival in India. 


Chronology of the Indo-Aryan languages and the 
sociolinguistic situation in ancient India 

The most ancient Indo-Aryan language is Vedic (Vedic Sanskrit), characterized as the main 
representative of the Old Indo-Aryan (OIA) linguistic period in the development of the Indo- 
Aryan languages, which is attested from the second millennium BC onwards. Chronologi- 
cally, Vedic can be divided into at least two main periods, early Vedic (also called “mantra 
language”, i.e. the language of the hymns addressed to the Vedic gods, mantras and magic 
spells, and middle/late Vedic (also called “Vedic prose”). The oldest layer of Vedic is attested 
in the language of the Rgveda (RV), which remains the main sacral text of the Hindus and 
which can approximately be dated to the second half of the second millennium BC. Within 
the RV, we can distinguish between the early RV (“family books”, or mandalas, which 
include books II-VII) and the late RV (encompassing, above all, mandalas I and X, as well 
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as a part of book VIII, Valakhilya); books VIII and IX are chronologically rather heteroge- 
neous. The language of the second most ancient text, the Atharvaveda (AV), resembles in 
many respects (and is essentially synchronic with) the language of the late RV. Early Vedic 
is followed by middle and late Vedic (= the language attested in the Brahmanas, Aranyakas, 
oldest Upanisads and Sutras); the post- Vedic period covers younger Upanisads and Sutras, 
as well as Epic and Classical Sanskrit. The term “Sanskrit” is sometimes used as a cover 
term encompassing the idioms of both the Vedic (= Vedic Sanskrit) and post- Vedic periods. 

The absolute chronology of these periods poses serious problems (see, e.g., Witzel 
1995: 97f.), so that we can only afford very rough approximations. Thus, the early Vedic 
period cannot be dated earlier than 1500 BC (and it hardly begins much later than 1200 
BC); the middle Vedic period probably starts after 800 BC; and the post- Vedic period 
must have started somewhere in the second half of the first millennium BC, hardly much 
earlier than 300 BC. 

The Middle Indo-Aryan epoque lasts approximately from 600 BC (the time to which 
the oldest Middle Indie, Pali, texts may reach back) till the end of the first millennium 
AD. From 1000 AD onward, the earliest forms of New Indo-Aryan (NIA) languages, 
such as Old Hindi, Old Bengali etc., are attested. 

The general chronology of the Indo-Aryan languages as well as the main subdivisions 
within the Old and Middle Indo-Aryan periods is shown in Figure 4. 1 . 
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FIGURE 4.1 CHRONOLOGY OF INDO-ARYAN LANGUAGES AND TEXTS 

The sociolinguistic situation in ancient India is a particularly complex issue and 
requires at least a short commentary. Already by the Middle Vedic period, Sanskrit was 
no longer a spoken language but co-existed, as a sacral language, alongside the Middle 
Indo-Aryan (MIA) vernaculars. During the middle and late MIA period, a number of 
languages (or, to be more precise, “forms of speech”) were used in India. In fact, we are 
dealing with triglossia, or even polyglossia: Sanskrit was used in the Hindu sacral con- 
text, in scientific treatises and some literary works; MIA languages (Prakrits) were used 
in poetry and dramatic works as well as in religious (Buddhist and Jainist) texts and in 
epigraphy. Late MIA vernaculars (Apabhramsa Prakrits) found their place in the literary 
tradition as well, while, finally, the colloquial vernaculars, which represented the earliest 
forms of the NIA languages, were employed in everyday life. 

It is important to emphasize that in the course of these developments Sanskrit and 
Prakrits were not replaced and ousted by later varieties (i.e. Sanskrit by Prakrits, Prakrits 
by Apabhramsas, etc.), but moved up vertically into the position of the high/prestigious 
form of speech (as indicated by simple arrows in Figure 4.2), 1 to be imitated by the low 
varieties of speech. 
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FIGURE 4.2 POLYGLOSSIA IN ANCIENT INDIA ( adapted from Bubenik 1998) 


All these languages (or fonns of speech) co-existed with each other. Most importantly, 
while the phonological systems and inventories of morphological forms of OIA and MIA 
languages have been preserved basically intact over the centuries, we can observe numer- 
ous traces of the influence of the spoken MIA and NIA vernaculars in the syntax and 
semantics of forms in the languages of higher rank. In a way, their grammatical systems, 
albeit morphologically stable, were open for syntactic “infection” from below, as indi- 
cated by the vertical dotted arrows in Figure 4.2. This fact is of crucial importance for 
understanding the syntactic developments in the late OIA and MIA texts. 

Another peculiar feature of the sociolinguistic situation in Ancient India is the enor- 
mous authority of the Ancient Indian linguistic tradition, differing in many respects from 
the younger traditions of Europe and, particularly, that of Panini, the author of the famous 
grammatical treatise AstadhyavT (lit. ‘consisting of eight chapters’), dating to the fifth to 
sixth century BC. By now, it has become the communis opinio that the language described 
by Panini (Panini ’s object language) can be roughly identified with middle Vedic Sanskrit 
(also known as the language of the Vedic prose), attested in the Brahmanas, Aranyakas, 
early Upanisads and Sutras. The oldest of these prose texts can probably be dated to the 
middle of the first millennium BC (see Figure 4.1). Flowever, this scheme is imprecise in 
some respects. In particular, we find linguistic phenomena (forms, constructions etc.) that 
are prescribed by Panini’s grammar but are not found in the Vedic corpus. The most plau- 
sible explanation for this mismatch can be found in the peculiar sociolinguistic situation 
in Ancient India, briefly outlined above. Specifically, a number of linguistic phenomena 
described by grammarians did not appear in Vedic texts but existed within the semi-col- 
loquial scholarly discourse of the learned community of Sanskrit scholars (comparable 
to Latin scholarly discourse in medieval Europe); for a discussion of the linguistic status 
of the Paninian Sanskrit, see Kulikov 2013 and Flock 2016. Some such phenomena may 
result from the influence of Middle Indie dialects spoken by Ancient Indian scholars, 
thus representing syntactic and morphological caiques from their native dialects onto the 
Sanskrit grammatical system. 
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Furthermore, we even have some reasons to assume that the rise and rapid development of 
the Paninian prescriptive grammatical tradition was due, foremost, to the fact that Vedic had 
ceased to be a living language and the necessity of its codification had been clearly formu- 
lated by the contemporary scholarly community. This task was particularly pressing in view 
of increasing variation within the (semi-colloquial) idiom - essentially based on Middle 
Vedic Sanskrit, but heavily influenced by Middle Indie dialects - that was used by Ancient 
Indian panditas in their scientific and, to some extent, infonnal discourse. 

Remarks on the genetic classification of (new) Indo-Aryan languages 

The genetic classification of Indo-Aryan languages is a much debated and fairly con- 
troversial issue. Due to a variety of circumstances, such as the peculiar sociolinguis- 
tic situation in Ancient and Medieval India, strong influence from the languages of the 
sacral and cultural tradition (above all, Sanskrit), and strong convergent processes that 
never stopped during the more than three thousand year history of the Indo-Aryan lan- 
guages on the Indian subcontinent, a consistent genetic classification of the modern Indo- 
Aryan languages is virtually impossible. All classifications are mainly based on structural 
(grammatical) features, rather than on phonetic developments, and typically posit four or 
five major subgroups. Thus, according to the notorious Grierson’s division, the eastern 
languages (foremost, Bengali, Oriya and Assamese) share more features with the south- 
ern (Marathi and Sinhala with the closely related Maldivian) and northwestern (Sindhi, 
Lahnda) languages than with two other groups, the “inner”, or central (Western Hindi), 
and the “intermediate” (Eastern Hindi, Panjabi, Nepali etc.), which implies a larger sub- 
division into “outer” (= eastern + southern + northwestern) and “inner” languages. Some 
scholars attempted to trace at least this macro-division back to the dialectal variation 
within late Old Indo-Aryan and MIA periods and consider such features as, in particular, 
the merger of the three Old Indo-Aryan sibilants (s, s, s ) in s as “East- West” (= “outer”) 
innovation. The three- or two-fold macro-division of the Indo-Aryan languages is often 
connected with the hypothesis about two waves of Indo-Aryan migration, where the first 
wave is said to correspond to the outer group. 

This chapter will concentrate on the earliest attested Indo-Aryan language, Vedic 
Sanskrit, which can be roughly identified with Old Indo-Aryan and provides the most 
valuable evidence for Indo-European comparative studies and Indo-European recon- 
struction. Vedic is one of the most ancient attested Indo-European languages, and from 
the structural point of view this languages is considered in many respects as one of the 
closest approximations to the reconstructed Proto-Indo-European language. Vedic and 
post- Vedic Sanskrit are documented with an enormous corpus of literature - larger than 
the corpus of any other Indo-European literary tradition. 

The Vedic corpus includes four main categories of texts, or Vedas, which fulfill dif- 
ferent roles in the Hindu ritual: reciting hymns (re-), pronouncing sacrificial fonnulae 
(yajus-) that accompany performing sacrifices, chanting songs (saman-); the fourth Veda, 
which mainly contains magic spells and is called Atharvaveda was added later to the 
Vedic canon. Each Veda is transmitted in a number of schools, or “branches” (sakhas), 
and is subdivided into four genres of texts, depending on their function within the ritual: 
Samhitas (mantras addressed directly to the gods), Brahmanas and Aranyaka (explana- 
tory texts and commentaries on mantras) and Upanisads (philosophical texts). These four 
categories are followed by a variety of Sutras related to the rituals as well as scientific 
and law texts (sastras). The content of the Vedic and early post- Vedic corpus of texts is 
summarized in Table 4.2. 
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TABLE 4.2 VEDIC TEXTS AND SCHOOLS 



yc- (hymns) 

saman- 

(chants) 

yajus- 

Sakala 

Baskala 

Kauthuma 

JaiminTya 

White YV 

Madhyandina Kanva 

Samhitas 


RV 

sv 

SVJ 

VS(M) 

VSK 

(mantras) 


RVKh. 





Brahmanas 

AB 

KB 

PB 

JB 

SB(M) 

SBK 




SB 




Aranyakas 

AA 

SA 


JUB 

SBM 14.1-8 







SBK 17 


Upanisads 

AiU 

KausU 

ChU 

KenaU 

BAUM 

BAU(K) 



(=AA 


(=JUB 

(= SB 14) 




2.4-6) 


4.18-21) 

IsU(M) 







(= VS 40) 


Srauta- Sutras 

AsSS 

SSS 

LatySS 

JSS 

KatySS 



DrahySS 


Grhya-Sutras AsGS SGS GobhGS JGS ParGS 

KausGS DrahyGS 


ManB, 

SVB 

Pitrmedha- GautPS 

Sutras 

Vyakarana [grammatical treatises]: Panini’s AstadhyayT; Nirukta [etymology]; Jyotisa [astronomy/ 
astrology]; etc. 

Dharma- Sutras 


(Dharma-Sastras) 


GautDhS YajnSmr. 
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(sacrificial formulae) 


atharvan- (magic spells) 



Black YV 


Saunakiya Paippalada 

Taittiriya 

Maitrayani 

Kathaka 

Kapisthala 

TS lm) 

MS (m) 

KS l ”) 

KpS lm> 

AV(S) AVP 

(TS P ), TB 

(MS") 

(KS P ), 

*Kath.-Br. 

(KpS p ) 

GB 

TA 


KathA 



TaiU 

(= TA 7-9) 
MNU 
(= TA 10) 
SvetU 

MaitrU 

KathU 


MundU 

BaudhSS 

VadhS 

BharSS 

ApSS 

HirSS 

VaikhSS 

ManSS 

VarSS 



VaitS 

BaudhGS 

BharGS 

ApGS 

HirGS 

VaikhGS 

AgnivGS 

ManGS 

VarGS 

KathGS 


KausS 

ApM 





BaudhPS 

HirPS 





ApDhS 

BaudhDhS 

HirDhS 

VisnuSmr. 

ManuSmr 





-a 


-a 

2 


u 

T3 


3 


T3 

£ 
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TABLE 4.3 ABBREVIATIONS OF TEXTS (TEXT SIGLA) 

aA 

Aitareya-Aranyaka 

AB 

Aitareya-Brahmana 

AgnivGS 

Agnivesya-Grhya-Sutra 

AitU 

Aitareya-Upanisad 

ApDhS 

Apastamba-Dharma-Sutra 

ApGS 

Apastamba-Grhya- Sutra 

ApM 

Apastamba-Mantrapatha 

ApSS 

Apastamba-Srauta-Sutra 

AsGS 

Asvalayana-Grhya-Sutra 

AsSS 

Asvalayana-Srauta-Sutra 

AV 

Atharvaveda 

AVP 

AV, Paippalada recension 

AVS 

AV, SaunakTya recension 

BaudhDhS 

B audhayana-Dharma- Sutra 

BaudhGS 

Baudhayana-Grhya-Sutra 

BaudhPS 

Baudhayana-Pitrmedha-Sutra 

BaudhSS 

B audhayana- Srauta- Sutra 

BAU(K) 

Brhad-Aranyaka-Upanisad (Kanva recension) 

bAum 

Brhad-Aranyaka-Upanisad, Madhyandina recension 

BharGS 

Bharadvaj a-Grhya-Sutra 

BharPS 

Bharadvaj a-Pitrmedha- Sutra 

BharSS 

Bharadvaj a- Srauta- Sutra 

Br. 

Brahmanas 

ChU 

Chandogya-Upanisad 

DrahyGS 

Drahy ay ana-Grhya- Sutra 

DrahySS 

Drahy ay ana- Srauta- Sutra 

GautDhS 

Gautama-Dharma- Sutra 

GB 

Gopatha-Brahmana 

GobhGS 

Gobhila-Grhya- Sutra 

HirDhS 

Hiranyakesi-Dharma- Sutra 

HirGS 

Hiranyakesi-Grhya-Sutra 

HirPS 

Hiranyakesi-Pitrmedha-Sutra 

HirSS 

Hiranyakesi-Srauta-Sutra 

IsU 

Isa-Upanisad 

JB 

JaiminTya-Brahmana 

JGS 

J aimirny a-Grhya- Sutra 

JSS 

J aiminly a- Srauta- Sutra 

JUB 

JaiminTya-Upanisad-Brahm. 

KathA 

Katha-Aranyaka 

KathGS 

Kathaka-Grhya- Sutra 

Kath-Samk. 

Kathaka-Samkalana 

KathU 

Katha-Upanisad 

KatySS 

Katy ay ana- Srauta- Sutra 

KausGS 

KausTtaka-Grhya- Sutra 

KausU 

Kausltaki-Upanisad 

KausS 

Kausika-Sutra 

KB 

KausTtaki-Brahmana (= Sankhayana-Brahmana) 

KenaU 

Kena-Upanisad 

KpS 

Kapisthala-Katha-Samhita 

KS 

Kathaka(- S amhita) 

LatySS 

Laty ay ana- Srauta- Sutra 

LaugGS 

Laugaksi-Grhya-Sutra 

MaitrU 

Maitri- (MaitrT-), Maitrayana-, MaitrayanTya-Upanisad 

ManB 

Mantra-Brahmana 

ManGS 

Manava-Grhya- Sutra 

ManSS 

Manava-Srauta-Sutra 

ManuSmr. 

Manu-Smrti (= Manava-Dharma-Sastra) 
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MNU 

Maha-Narayana-Upanisad 

MS 

MaitrayanT Samhita 

MundU 

Mundaka-Upanisad 

NarSmr. 

Narada-Smrti 

Pan. 

Panini (AstadhyayT) 

ParGS 

Paraskara-Grhya-Sutra 

PB 

Pancavimsa-Brahmana (= Tandy amaha-Brahmana) 

PrasU 

Prasna-Upanisad 

Ram. 

Ramayana 

RV 

Rgveda 

RVKh. 

Rgveda-Khilani 

Su. 

Sutra(s) 

SUB 

S amhitopani sad-Brahmana 

SV 

Samaveda (Kauthuma rec.) 

SVB 

Samavidhana-Brahmana 

SVJ 

Samaveda, Jaiminlya rec. 

SA 

S ankhayana- Aranyaka 

SB(M) 

Satapatha-Brahmana (Madhyandina recension) 

SBK 

Satapatha-Brahmana, Kanva recension 

SGS 

Sankhayana-Grhya-Sutra 

SSS 

S ankhayana- Srauta- Sutra 

SvetU 

Svetasvatara-Upanisad 

SB 

Sadvimsa-Brahmana 

TA 

TaittirTya- Aranyaka 

TaiU 

TaittirTya-Upanisad 

TB 

TaittirTya-Brahmana 

TS 

TaittirTya- S amhita 

VadhS 

Vadhula- Sutra 

VaikhDhS 

Vaikhanasa-Dharma-Sutra 

VaikhGS 

Vaikhanasa-Grhya-Sutra 

VaikhSS 

Vaikhanasa- Srauta- Sutra 

VaitS 

Vaitana- Sutra 

VarGS 

V araha-Grhya-Sutra 

VarSS 

V araha- Srauta- Sutra 

VasDhS 

Vasistha-Dharma-Sutra 

VisnuSmr. 

Visnu-Smrti 

VSK 

Vajasaneyi- Samhita, Kanva recension 

VS(M) 

Vajasaneyi- Samhita (Madhyandina recension) 

Y ajnSmr. 

Y ajnavalkya-Smrti 

YV 

Yajurveda(- Samhita) (= VS(K), MS, KS, KpS, TS) 


PHONOLOGY 

The phonological system of Old Indo- Aryan 

The phonological system of Vedic includes only two vowels proper, a and a, which dif- 
fered not only in length but also in timbre (the long vowel was a back vowel). Other 
monophthong items represent either vocalic allophones of sonants (y/i, rlr etc.; the con- 
trast between v and u is phonemic, not allophonic, however) or former diphthongs ( e , o, 
phonetically long and probably still preserving the diphthong pronunciation in early Old 
Indo-Aryan). The vowel system can be summarized as follows: 


Vowels proper: 


a a 
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Diphthongs: 

e o 
ai au 


Sonants that have vocalic variants: 

v ~ u u 
y / i I 

r / r (r) 

1 /( 1 ) 

The consonant system includes plosives (voiceless, voiceless aspirates, voiced and voiced 
aspirates) and nasals (some of which have weak or no phonemic status), organized by five 
places of articulation, as well as three sibilants and the voiced (pharyngeal) h\ 

TABLE 4.4 OLD INDO-ARYAN CONSONANTISM 


Place of articulation 


pharyngeal 

(morpho- 

phonological 

symbols) 

labial 

dental 

retroflex 

palatal 

velar 

laryngeal 

Voiceless stops 

(T) 

p 

t 

t 

c 

k 


Voiceless aspirates 

(Th) 

P h 

th 

th 

eh 

kh 


Voiced stops 

(D) 

b 

d 

d/J 

j 

g 


Voiced aspirates 

(Dh) 

bh 

dh 

dh 

Gh) 

gh 


Nasals 

(N) 

m 

n 

n 


(A) 


Voiceless fricatives 

(S) 


s 

s 

s 



Voiced fricatives 







h 


Accentuation 

A number of the early and middle Vedic texts mark accents. In particular, both early Vedic 
texts, RV and AV (in the SaunakTya recension), as well as a number of early Samhita 
prose texts (TaittirTya and Maitrayam) and early Brahmanas (TaittirTya, SB), are pre- 
served in accented form. 

Phonetically, three tones (degrees of pitch) can be distinguished, high (udatta') [= 
principal accent], middle (svarita') and low (anudatta). Phonologically, we only have 
an opposition between the vowels (or syllables) that bear the principal accent, or the 
high pitch, and those which bear no principal accent (= unaccented), i.e. middle or low 
pitch. A word-form can have only one principal accent (or high pitch), except for some 
compounds and tavdi-infinitives. The placement of middle and low pitches is determined 
automatically within the sequence of words forming metrical units (pada): (i) all vowels/ 
syllables that have no principal accent bear the low pitch (anudatta); and (ii) the vowel/ 
syllable that immediately follows udatta receives the middle pitch (svarita), unless the 
following syllable bears high pitch (udatta). 

The European tradition of the transcription of Vedic texts marks only the principal 
accent (high pitch), 2 using the acute accent mark ( a etc.). Svarita is marked, by using the 
gravis accent mark (a etc.), only if the vowel that bears the preceding udatta is not writ- 
ten as the full vowel as well as in the case of “independent svarita” (due to a particular 
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external sandhi). By contrast, the DevanagarT notation accepted for the RV, AV and Tait- 
tirlya texts marks only udatta and svarita. 

There are certain historically conditioned (essentially going back to the Proto-Indo- 
European morphophonological system) correlations between the ablaut grade and the 
placement of the accent. In particular, the first morpheme that shows a non-zero grade 
typically bears the accent. Yet numerous exceptions and analogical developments make 
these two parameters essentially independent from the synchronic point of view. 

A number of word-forms bear no principal accent. These include (i) finite verbal forms, 
unless they are employed at the beginning of a sentence and/or pada (= minimal metrical 
unit), or in a subordinate clause; (ii) vocatives, except at the beginning of a sentence and / 
or pada; and (iii) clitics. 

History of common Indo-Aryan phonemes 
Vowels and diphthongs 

The vowel system of Common Indo-Aryan is almost identical to that of Proto-Indo- 
Iranian and reproduces its main distinctive features: the loss of most quality distinctions 
(e/o), as opposed to the preservation of most quantity (length) distinctions. The main 
innovations in Indo-Aryan (partly shared with Iranian) include (i) the merger of all proper 
Proto-Indo-European vowels (that is, e and 6) as well as the vocalic allophones of the 
nasal sonants in a; the same holds for its long pendants and vocalic elements of diph- 
thongs; (ii) monophthongization of diphthongs (Vi > e, Vu> o ); and (iii) merger of the 
vocalic variants of Hy and of all laryngeals in i. 

The history of the Common/Old Indo-Aryan vowels and diphthongs can be illustrated 
with the following examples: 

a < *e, *o, *m, *n 

dasa ‘ten’ < *dekm, cf. Gr. Sera, Lat. decern 
marta- ‘mortal, man’ < *mor-to-, cf. Gr. (Kallimachos) popxol pi. 
ajra- ‘field’ < *h 2 eg-ro-, cf. Gr. dypo<;, Lat. ager, Goth, ah's 
asthi- n. < *h,estH- ‘bone’, cf. Hitt. Ihastai-I, Gr. ooieov, YAv. ast- n. ‘bone, body’ 
mad- f. ‘thought’ < *mn-ti-, cf. Lat. mens, mends f. ‘mind’, Lith. minds f. 
a-(putra-) ‘without (a son)’ < *n- adj., cf. YAv. a-pu&ra-, Gr. a-fiso q adj. ‘without a 
god’, Lat. in-, Goth, un- 
it < *e, *6, *o (| _CV, _#), *eH, *oH, *mH, *nH 

mata ‘mother’ < *meh 2 te(r), cf. Av. mata, Gr. parqp, Lat. mater 
vdk ‘speech, voice’ < *wok w s, cf. OAv. vaxs, Lat. udx 
janu- ‘knee’ < *gonu-, cf. YAv. zanu °, Gr. yovu 

The only trace of the original Proto-Indo-European *e/o distinction can be 
found in the open non-final syllable, where PIE *o is lengthened, in accor- 
dance with Brugmann’s Law, which operated as early as Indo-Iranian, cf. 1 sg. 
perf. jagama < *g w e-g w om-h 2 e (cf. Gr. pspova) vs. long vowel in 3 sg. perf. 
jagdma < *g w e-g w om-e (cf. Gr. pepovs); or in causatives lik ejanayati ‘to beget’ 

(< *gonh 2 -eye-), not **jdnayati\) 
jata-‘ bom’ < *gnH-to-, cf. Lat. natus 
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i < *i, *H/a (ir < rll | _V) 

i-hi ‘to go’, 2 sg. imp. < *h 1 i-d h i, cf. OCS idi, Gk. !0i 

pitar- ‘father’ < *ph 2 te(r), gen. *ph 2 tr-es, *-os, cf. Gr. 3iaif|p, Jiaipot;; Lat .pater, -tris; 
Goth .fadar 

-mahi (sec. ending 1 pi. mid.) < *-med h h 2 , cf. Gr. -peSa 

-tira ‘to pass’ 2 sg. imp. act. < *-trh,-e-, cf. LHitt. tar-ra-at-ta (<*terh r o-to-) ‘to be 
able’; Lat. trails (<*trh,-nt-) 


T < *iH, *IJ/o (Tr < *fH, *JH | C) 

jiva- ‘alive, living’ < *g w ih 3 -wo-, cf. Lat. uTuus, OCS zivb, Lith. gyvas 
pu-n-T-mas ‘to cleanse’ 1 pi. pres. < *pu-n-H-mes 
a-tant ‘to pass’ 3 sg. sigm. aor. < *e-terh 2 -s- 
tirna- ‘to pass’ perf. ptcp. < *-trh,-no- 


u <* u (ur < *rH | _V) 

vasmi, usmasi ‘to wish’, 1 sg., 1 pi. pres. < *wek-mi, *uk-mes, cf. OAv. vasami, 
usSmahl /vasmi, usmahi/, Hitt. 1 sg. u-e-ek-mi 
sphurati ‘kicks’ < *sp <h) rH-e-, cf. Lat. sperno ‘to dissociate, reject, spurn’, Ole. sperna 
‘to kick out with the feet’ 


u < Gill (ur < rll, *IH | C) 

mus- ‘mouse’ < *muHs-, cf. Gr. puq; Lat. mils', OHG mus; Russ, mvsb 
purna- ‘full’ < *plH-no-, cf. Goth .fulls, Lith . pi bias, OCS plbin, Av. psrana- 

r < *r, *1 

mrta- ‘died, dead’ < *mr-to-, cf. Lat. mortuus 
vrka- ‘wolf’ < *wlk w o-, cf. Goth, wulfs, Lith. vilkas, ORuss. vblkb 
r arises from r analogically, cf. pitrn ‘father’, ace. pi. (*ph 2 tr-ns) in analogy with deva- 
‘god’ : ace. pi. devan 

e < *ey, *oy 

eti ‘to go’, 3 sg. pres. < *h[eyti, cf. Gr. elai 

cetavati ‘to reveal’ < *k w oyt-eye-, cf. Goth, haidu- (< *k ,wl oyt-u- or *k <w) oy-tu-) 
‘manner’ 


ai < *ey, *oy 

(vfk-)aih ending instr. pi. o-decl. < *-oys, cf. Lith. vilk-ais, Lat. lup-Ts 
o < *ew, *ow 

djas- ‘strength, power’ < *h,ewg-es-, cf. Lat. augustus ‘elevated’; Lith. augestis 
‘growth’ 
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bodhati ‘to perceive’ < *b h ewd h -e-, cf. Gr. reeuSopai ‘to learn, to hear’ 
bodhayati ‘to make awake’ (caus.) < *b h owd h -eye-ti, cf. OCS buditi 

au < *ew, *ow 

dyauh ‘heaven, Father Sky’ nom. sg. < *dyew-s, cf. Gr. Zebq, Lat. dius 
gauh nom. sg. ‘cow’ < *g w 5w-s, cf. Gr. (3ouq 


Sonants 

The system of sonants is quite conservative, except for the weakening of the phonemic 
status of / (completely lost in Iranian, but partly preserved in Indo-Aryan), which merges 
with r in many cases. 


y < *i/y | _V (allophonic variant of i) 

yuvan- ‘young, young hero’ < *h 2 yu-h,on-, cf. Arm. yavanak/vovanak ‘young (of ani- 
mal)’; Lat. iuvenis 


r < *r, *1 

bhru- ‘(eye)brow’ < *h 3 b h ruH-, cf. Gr. 6(ppu<;; OEng. bru; Lith. bruvis; OCS bnv b 
mriya-te ‘die’ < *mr-ye-, cf. Lat. morior 

sravas- ‘fame’ < *klewes-, cf. Gr. Kkioq ‘call, fame’, OCS slovo ‘word’ 

/ < *1, (*r) 

lubhyati ‘to be in disorder’ < *lub h -, cf. OCS ljubiti ‘to love’ 
reh-Heh- (RV)/(AVP+) ‘to lick’ < *leyg h -, cf. OCS lizati 


v < *u/w |_V 

vak ‘speech, voice’ < *wok w s, cf. OAv. vaxs, Lat. udx 


Plosives 

The system of sonants and plosive consonants is both conservative, preserving the main 
contrasts of the Proto-Indo-European consonantism, and innovative, exhibiting a number 
of additions to the original system. The most important innovations include (i) the rise 
of voiceless aspirates from the combination of voiceless stops and laryngeals (*TH > 
T /?); (ii) the rise of retroflex consonants, foremost on the basis of retroflexivization of 5 
after r, k and vowels distinct from a (see also the section on sandhi below), which further 
retroflexivizes adjacent dental stops; retroflexivization of n after r; and the development 
of final *-ks > t; (iii) two palatalizations, which result, in particular, in the merger of the 
Proto-Indo-European voiced palatal gutturals (*g and *g h ) with simple voiced gutturals 
before front vowels (e, f) in j and h, respectively; notice that these two types of sources 
can still be distinguished morphophonologically; that is, they yield distinct reflexes (y 2 
and h 2 as opposed to j\ and h { ) in sandhi. 
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The history of the Common/Old Indo-Aryan consonants is summarized and illustrated 
with examples below: 


VELARS 

k < *k < " ) | 

kad nom./acc. sg. n. interrog. pron. < *k w od, cf. Lat. quod, OHG hvaz 

loka- m. ‘free/light space, world’ < *lewk-o-, cf. Gr. ksuKoq Tight, white, bright’, Lat. 
lux f. Tight’ 

kh < ' k "'H | _V r ,^, 1 

sakha- ‘branch, twig’ < *kok["]-h,-, cf. Goth, hoha ‘plough’, Lith. saka ‘twig’, ORuss. 
soxa ‘(wooden) plough’ 

«< 1 g w L x /.Ui 

gam ace. sg. ‘cow’ < *g w om, cf. OAv. gqm, Gr. (3djv, Umbr. bum 

g h < * g <")h | 

ghnanti ‘to slay’ 3 pi. pres. < *g wh nenti, cf. Hitt, ku-na-an-zi 

« < *!» I _C ve lar 

The velar nasal emerges only through sandhi before velar plosives, and the following 
example represents the only grammatical context where this sound has phonemic 
status: pran nom. sg. m. of the adjective pranc- ‘directed forwards’ <— prank-s <— 
pranc-s (sandhi) 


PALATALS 

c < *k (w) | _ *e, *i 

ca ‘and’ < *k w e, cf. Gr. te, Lat. que 

cakra- ‘wheel’ < *k w e-k w l-o-, cf. Gr. kukAo^ ‘circle, ring, wheel’; Toch. A kukal ‘cart’ 

ch < *sk | _ *5, *i/y 

chid- ‘to split, break’ < *skid-, cf. YAv. sic Gr. ax^co ‘to split, cut’, Lat. scindo ‘to 
cut open’ 

gacchati pres, stem ‘to go’ < *g w m-ske-, cf. YA v.jasaiti 3 sg. pres., Gr. Poigks 2 sg. 
imp. act. ‘go!’ 

y ( i) < *g <w) I _ *e, *i (/', = \jlglk}) 

jiva- ‘alive, living’ < *g w ih r wo-, cf. Lat. uTuus, OCS zivb, Lith. gyvas 

vuj- ‘to connect, yoke’, vuj- ‘connected, yoked; companion’ < *Hyug- ‘to yoke, join’ 
(e.g. in dat.sg./inf. yuj-e < *Hyug-ei, ins.sg. vuj-a < *Hyug-eh,) (: yuk-ta- perf. 
ptcp. < *Hyug-to-), cf. Gr. ^euyvuiii, Lat. iungere, Lith .jiingti, OCS igo ‘yoke’ 

7(2) < *g Hi = {//■«■}) 

janu- ‘knee’ < *gonu-, cf. YAv. zanu °, Gr. yovu 

yaj- ‘to worship, sacrifice’ < *Hyag- (cf. ptcp. perf. is-td- < *Hig-to-), cf. Gr. a^opai 
‘to honour’, Gr. ayioq ‘sacred, consecrated’ 



INDO-ARYAN 227 


jh emerges only through borrowing or in onomatopoeia, as in jajjhat- pres. ptcp. 
‘laughing’ 

[»] < n \_ _Cp a i ata i, ii is a non-phonemic (allophonic) variant of n, cf. yu-h-j-anti ‘to yoke, 
join’ 3 pi. pres. < *Hyu-n-g-nti 


RETROFLEX 

s < *s | *i, *u, *r, *k_ 
s < *E, *g |_*t 

araiksam ‘to leave’ lsg. aor. < *(-leyk w -)s-, cf. Gr. £Leuv|/cx 

visa- ‘venom, poison’ < *wis-, cf. Gr. loq, Lat. ulrus 

jus- ‘to enjoy’ < *gus-, cf. Lat. gusto ‘to taste’, Goth, ga-kiusan ‘to test’ 

varsman- ‘height, peak, top’ < *wers-men-, cf. Lith. virsiis, OCS vrbXb 

vas-ti ‘to wish, desire’ 3 sg. pres. < *wek-ti, cf. Gr. ekcov ‘voluntary’, Hitt, /uektsi/ 

t<t | s_, *Ks | _# 

justa- < *gus-to- ‘enjoy’ perf. ptcp. 

vit (vis-) ‘settlement, community’, nom.sg. < *wik-s, cf. OCS vbSb ‘village’, Gr. oiko<; 
‘house, dwelling’ 

th < *tH | s_V 

tistha- ‘stand’, pres. < *sti-sth 2 -e-, cf. Gr. icnr|pi ‘I place’ 
d < *z < *s | _D (h) 

viprud-bhih < *-prus-b h -. . . ‘drop’ ( vi-prus -) 

[/] is an allophonic variant of d that emerges only in the intervocalic position (| V_V) 
in the dialect of the RV, cf. RV lie = ( Tde in other texts) ‘I worship’ 1 sg. pres. 

(V)dh < *(V)zdh < *(V)sdh 

udha- ‘carry’ perf. ptcp. (V vah) *ug h -to- (> *ug h -d h o-) 

>u da- < *ni-sdo- ‘nest’, cf. Lat. nidus, Eng. nest 

n < *n | s, r (. . .)_ 

Notice that the retroflexivization of n is blocked by dental, retroflex and palatal con- 
sonants in (. . .), 

cf. tlrna- ‘to pass’ perf. ptcp. < *trh,-no- 
varsman-am ‘height, peak, top’ < *wers-men- 

grhyamdna- (*grb h -ye-mh,no-) ‘to seize’ pres. pass. ptcp. but not rajandm ‘king’ ace. 
sg.; prsthena ‘back’ instr. sg.; etc. 


DENTALS 

t<*t 

tri- ‘three’ < *tri-, cf. Gr. ipfiiq, Lat. tres (nom. pi.), Goth .prins (ace. pi. m./fl), OCS 
tri (f.) 

th < til | V 

prthu- ‘broad, wide’ < *plth,-u-, cf. Gr. jiL(xt6<;, Lith . platiis 
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d<* d 

pad- ‘foot’ < *ped-/*pod-, cf. Gr. (Dor.) tic bq, noSoc; (gen. sg.), OEng .fot (nom. pi), 
Lat. ped-is (gen. sg.) 

dlt < *d h 

dadha-ti ‘to put, place’ 3 sg. pres. < *d h e-d h eh r ti, cf. Gr. t!9t|(-|.u) 
n < *n 

nava- ‘new, young’ < *newo-, cf. Hitt, neua-; Gr. veoq; Gr. (Myc.) ne-wo; Lat. nouus; 
OCS novb 


LABIALS 
P < *P 

pitar- ‘father’< *ph,ter-, nom. sg. pita < *ph 2 te(r), cf. Gr. natfip, Lat. pater, Goth .fadar 
ph < *pH | V 

phena- ‘foam’ < *pHoyno- (?), cf. OCS pena 

b<* b 

bala- ‘power, strength’ < *bel-o-, cf. Lat. de-bilis ‘without strength’, Gr. Pe^xepoi; 
‘better’; OCS bolijb ‘bigger’ 

bit < *b h 

bhuj- ‘to enjoy, consume’ < *b h ewg-, cf. OAv. buj- f. ‘expiation’, Lat./«;?g07' ‘to enjoy, 
suffer, get rid of’ 

bhrii- ‘(eye)brow’ < *h,b h ruH-, cf. Gr. ocpput;; OEng. brii; Lith. bruvis; OCS bnvb 

tit < *m 

mats ‘mother’ < *meh 2 te(r), cf. Av. mat a, Gr. (Dor.) pdxr|p, Lat. mater 


SIBILANTS and h 

s<* s 

as, asti 3 sg. act. ‘to be’ < *h,es-, cf. Gr. eon, eaxi; Hitt. e-es-zi\ Lat. est; Goth, isl 


s< *k 

dasa ‘ten’ < *deitm, cf. Gr. SeKa, Lat. decent 

svan- ‘dog’ < *itwon-/*Kun-, cf. Ann. sown, Gr. kucov, icuvoq, OIr. con (gen. sg.), Lith. 
suo, Hitt, kuuan/kun- ‘dog-man’, Ole. hun-d-r 

h m < *g (w)h | _ *e, *i 

hand 3 pi. pres, ‘to slay’ < *g wh en-ti 

dahati ‘to burn’ 3 sg. pres. < *d h eg wh -e-ti (: dagdha- ptcp. perf. < *d h eg wh -to-), cf. Lith. 
degii, Lat. foued ‘to make warm’ 

lt(D < *g h (also < d h , mostly in | V^^V; rarely < b h ) 
hasta ‘hand’ < *g h es-to-, cf. Av. zasta-; Lith. pa-zastls ‘annpit’ 
aham ‘I’ < *h,egH-om, cf. Av. azSm, OPers. adam, OCS azb, cf. Gr. eyco, Lat. egd < 
*h,eg-oH 
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vahati ‘to move, carry, drive’ 3 sg. pres. < *weg h -e-ti (: udhd- ptcp. perf. < *uz h -d h o- < 
*ug h -d h o- < *ug h -to-), cf. Av. vazaiti, Lat. uehd, -ere. Ole. vega, OHG wegan, OCS vezg 
iha ‘here’ < *id h a, cf. Pali idha, LAv. iSa; cf. also ihi ‘to go’, 2 sg. imp. discussed on 
p. 224. 

Note that the voiced sibilants z (existing in Avestan) and *z disappeared in 
Proto-Indo-Aryan. 

To conclude this section, one should mention two important phonetic laws that are 
important for the development of aspirates in Old Indian: 

Bartholomae’s Law (D h +T > D h D\ D h +s > D h z h , also where the consonants are sep- 
arated by s or a laryngeal) is responsible for the shift of aspiration to the second 
member of a consonant cluster and its subsequent voicing, cf. buddha- ‘awaken’ 
< *b h ud h -to-; dpi gdha 3 sg. inj. mid. ‘to devour’ < Pllr. *-g h zd h a < PIE *g h s-to; 
duhitar- ‘daughter’ < *d h ug h Htar- < PIE *d h ugh 2 ter- (OAv. dugddar-). 

Grassmann’s Law (D h . . . D h > D . . . D h | . . . < 1 syllable) accounts for the loss of the 
aspiration of the first aspirate in case of a sequence of aspirates separated by no more 
than one syllable, cf. dugh- ‘to give milk’ < PIE *d h ewg h - (cf. Goth, daug ‘to be 
good for smth., fit’); da-dha-ti ‘to put, place’ 3 sg. pres. < PIE *d h e-d h eh 1 -ti (cf. Gr. 
t(9t|(-|.u)); bodh-a-ti ‘to perceive’ 3 sg. pres. < PIE *b h ewd h -e-ti (cf. Gr. nsuHopai ‘to 
learn, to hear’). 


MORPHOPHONOLOGY 

Morphotactics 

Sanskrit is notorious for its extremely rich system of processes that occur at morpheme 
or word boundaries that are called, in accordance with Indian tradition, sandhi (a Sanskrit 
term, literally meaning ‘together-posing, joining’, which has been borrowed into Euro- 
pean linguistic terminology). Some of these processes have an allophonic nature (such as, 
for instance, palatalization of the dental nasal before palatals) and thus can be described 
in terms of phonological rules, but many others represent phonological, non-automatic 
phenomena and thus belong to the domain of morphophonology. The main two types of 
sandhi include internal and external sandhi. External sandhis operate on the boundaries 
between words (= free morphemes), as well as, normally, (i) between members of com- 
pounds, (ii) between preverbs/prefixes and roots, and (iii) between nominal stems and the 
following endings of the dual and plural: three plural/dual case endings beginning with 
bh-, i.e. -bhis, -bhvas and -bhyam and the loc. pi. ending -su. Internal sandhis operate on 
the boundaries between (bound) morphemes. 

Typical examples of internal sandhi include, for instance: 

i) CjC, . . . C n — > C, |_# (all members of the final consonant cluster are dropped except 
for the first one), cf. pacant-s — » paean (act. ptcp. (nom. sg. m.) of pac ‘cook’); 
a-chand-s-t — > achan (3 sg. aor. act. of chand ‘appear’); 

ii) devoicing and deaspiration of stops before .? or in auslaut: T(h), D(h) — > T |_#, 
[except for palatals: T(h), D(h) ^ c,j], cf. labh-svate — > lapsvate (3 sg. fut. mid. of 
labh ‘obtain’); vudh-s — > vut ‘fighter’ (nom. sg ,);yudh — > yut ‘O fighter!’ (voc. sg.); 

iii) retroflexivization of s after r, k and vowels distinct from a (see 2.3.2 above), also 
known as the “RUKI rule”: s — > s | k, r, V f =_ (not _#, r = not in auslaut or before r), 
cf. a-raik-s-am — > araiksam, a-kar-s-am — > dkdrsam,juho-si — > juho-si'. 
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iv) assimilation to the following plosive in the way of articulation (i.e. T(h), D + 1 —> Tt; 
T(h), D + th — > Tth; T(h), D + dh — > D dh), accompanied by progressive retroflexiv- 
ization (retroflex consonants retroflexivize the following stop), which can be conve- 
niently presented in table fonn (Table 4.5; adopted from Zaliznjak 1978/1987): 


TABLE 4.5 ASSIMILATION IN CONSONANT CLUSTERS 


BEFORE DENTAL STOPS 



+ 1 

+ th 

+ dh 

K kh, g, c,j I + 

kt 

kth 

gdh 

/, th,d + 

tt 

Uh 

ddh 

t, th, d + 

tt 

tth 

ddh 

p,ph, b + 

pt 

pth 

bdh 

s, s, ch,j 2 + 

sf 

sfh 

ddh 

5 + 

St 

sth 

dh 


Cf. yoj\-tum — > yoktum (inf. of yuj ‘yoke, join’); vac-tum — » vaktum (inf. of vac ‘say’); 
prach-tum — > prastum (inf. of prach ‘ask’); mrj 2 -tha — > mrstha (2 pi. pres. act. of mrj ‘adorn, 
anoint’); mrj 2 -dhvam — » mrddhvam (2 pi. pres. act. of mrj ‘adorn, anoint’); as-dhve — > adhve 
(2 pi. pres. mid. of as ‘to sit’); but cf. also as-dhi — » edhi (2 pi. imp. of as ‘to be’). 

External sandhi (many of which - especially the vowel sandhi - are optional in early 
Vedic) can be illustrated, for instance, by the following rules: 

i) merger (contraction) of two homorganic vowels (optional in early Vedic, which is 
shown by a bracketed arrow): -V + V — » -V-, cf. iha asti — » ihasti ‘here is’; su uktam 
— > suktam ‘well-said’; 

ii) monophthongization: 


-a + T — > -e- (cf. iha iha — > iheha, pitaiva — » piteva Tike father’) 
and 

-a + ii — > -o- (cf. a uhha — > obha ‘to both’); 

-as + a — > -o cf. devils atha — » devo ’tha ‘and the god’. 

Ablaut alternation 

Most morphemes may appear in one of the three alternation grades (traditionally called 
“ablaut” grades): (i) zero, or weak; (ii) full, or normal (guna “quality” in Indian tradi- 
tion); and (iii) long (vrddhi ‘increasing’). Historically, this alternation goes back to the 
Proto-Indo-European ablaut, *zero/*e/*o/*e/*5. Thus, for roots with the structure CaC, 
the alternation between the zero, full and long grades will manifest as 0 laid (e.g. pt-lpat- 
/pat- ‘fly’). The alternating vowel can be followed by a sonant (i/y, u/v etc.), which can be 
realized in different ways, depending on the phonological context. Altogether, this results 
in a rich allomorphy, such as i- el ay -ail ay (e.g . ji-lje-ljay-ljai-ljay- ‘win’) etc. 

The choice of the alternation grade is determined by the grammatical form. Next to 
the fonns that require one of the three “traditional” grades listed above, there are a num- 
ber of grammatical positions (causative stem, 3 sg. perf. act.), where some morphemes 
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show the full grade, while others have the long grade. In Indo-Iranian, e and o merge 
in a, but o changed to a in open syllables, according to Brugmann’s Law. Thus, the 
original ablaut is reflected in Vedic as the zero lala + ala opposition. Accordingly, for 
descriptive purposes it seems appropriate to posit (iv) the fourth (“Brugmann’s”) grade 
for such grammatical contexts as causative stem or 3 sg. perf. act. 

Table 4.6 presents the main alternation grades (in a somewhat simplified form; for 
detailed rules that describe the choice of the alternation grade depending on the phono- 
logical context, see Zaliznjak 1975): 


TABLE 4.6 THE MAIN ALTERNATION GRADES 



a(C) a( CC) 

>(V ) 

K O 

«(V) 

u(C) . 

■ ■ T ;■( C) . 

. . an 

an( C) 

am(W) . . . 

zero 

- 

- 

i(y) 

i 

u(v) 

U 

. . r r 

. . aln 

aln 

a(m)/m . . . 

(guna) full 

a 

A 

av 

e 

av 

0 

. . ar ar 

. . an 

an 

am 

(Brugmann’s grade) 

a 

A 

ay 

e 

av 

0 

. . dr ar 

. . an 

an 

dm 

(vrddhi) long 

a 

a 

ay 

ai 

av 

au 

.. dr dr 

. . an 

an 

dm 



«(C) 




id C) 


u 



am 


zero 

a-pa-pt-at 

rik-ta- 

sru-ta-, sruv-at 

ya-ta-, yam-et 

full 

pat-ati 

a-ri-rec-Tt 

sro-syati 

ydm-itavai 

(Brugmann’s) 

pat-ayati 

ri-rec-a 

srav-ayati 

yam-ayati 

long 

- 

d-raik 

a-srau-slt 

a-yam-s-am 


pat ‘fly’ 

ric ‘leave’ 

sru ‘hear’ 

yam ‘stretch’ 






r 

r(Q 

-an- 

-ant- 


zero 

kf-td- 

dfs-ta- 

raj-a-su, raj-h-e 

car-at-su 

full 

a-kar-at 

a-dars-am 

raj-an-i 

car-an(t) 

(Brugmann’s) 

ca-kdr-a 

da-dars-a 

raj-dn-am 

car-ant-am 

long 

a-kar-sam 
kr ‘make’ 

(a-drdk) 

drs ‘see’ 

-an- nom.suff. 

-ant- ptcp.suff. 


Illustrations 

A: 

pat ‘fall, fly’: 3 sg. redupl. (non-caus.) aor. act. a-pa-pt-at, 3 sg. pres. I act. pat-ati, 3 
sg. caus. pres. act. pat-ayati 

tap ‘heat’: 3 sg. pres. I act. tap-ati, 3 sg. caus. pres. act. tap-avati, 3 sg. sigm. aor. act. 
a-tap-sTt 

yaj ‘sacrifice’: ptcp. perf. pass, is-ta-, 3 sg. pres. I act .yaj-ati, 3 sg. sigm. aor. inj. act. 
yat 


A: 


stha ‘stand’: ptcp. perf. pass, sthi-ta-, 3 sg. rootaor. act. a-stha-t, 3 sg. pf. act. ta-sth-au 
[full grade before a vowel], 3 sg. caus. pres. act. stha-p-avati 
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I: 


ric ‘leave’: ptep. perf. pass, rik-ta-, 3 sg. pluperf. act. a-ri-rec-it, 3 sg. perf. act. 
ri-rec-a, 3 sg. sigm. aor. act. a-raik 


I: 

m ‘lead’: ptep. perf. pass, ni-ta-, 3 sg. pres. I act. nay-ati, 3 sg. sigm. aor. subj. act. 
ne-s-at, 2 pi. sigm. aor. inj. act. nai-s-ta, 3 sg. perf. act. ni-nay-a 


kr ‘make’: ptep. perf. pass, kr-ta-, ptep. root aor. mid. kr-ana-, 3 pi. root aor. act. 
a-kr-an, 3 sg. them. aor. act. a-kar-at AV+, 3 sg. perf. act. ca-kdr-a, 1 sg. sigm. 
aor. act. a-kar-s-am 

drs ‘see’: ptep. perf. pass, drs-ta-, 1 sg. root aor. act. a-dars-am, 3 sg. perf. act. 
da-ddrs-a (3 sg. sigm. aor. act. a-drak Br., a-drak-s-Tt) 


R: 

tf ‘pass’: ptep. perf. pass, ffr-na-, 2 sg. pres. VI act. tir-asi, 3 sg. pres. I act. tar-ati, 3 
sg. sigm. aor. act. a-tdr-Tt 


N: 

-an- (nominal stem suffix), stem raj-an- ‘king’: loc. pi. raj-a-su, dat. sg. raj-n-e, loc. 
sg. raj-an-i, ace. sg. raj-an-am 

-ant- (act. ptep. suffix), act. ptep. car-an- ‘wandering’: loc. pi. car-at-su, nom. sg. car- 
an(t), ace. sg. car-ant-am 


N: 

jan‘ ‘be bom, generate’: ptep. perf. pass .ja-ta-, 3 sg. pres. IV mid .ja-ya-te, 3 sg. perf. 
mid .ja-jn-e, 1 sg. pres. I act .jan-ami, 3 sg. perf. act .ja-jdn-a 


Morphophonological classification of (root) morphemes 

In accordance with the ability of the root to appear in all or some of the above- 
listed ablaut grades, all Vedic roots can be divided into the following three main 
classes: 

(i) Fully alternating roots (hereafter called “alternating” for short) attest all three 
morphophonological grades; ct.ji-ljy- : je-l jay- : jai-l jav-, etc. 

(ii) Non-zeroing roots show the full grade in the forms where the zero grade is 
expected; cf. pad ‘fall’: pres. IV pad-va-te ‘falls’ (instead of **pd-ya-te) etc. In 
Vedic, a number of roots that in later (Classical) Sanskrit belong to the non-zeroing 
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type (in particular, most of the CaC roots) still attest isolated forms with the zero 
grade, thus vacillating between the type (i) and type (ii) behavior; cf. the themat- 
icized reduplicated present (I<— III in the notation of Table 4.18) pibdamana- {pi- 
pd-} made from this root. 

(iii) Non-alternating roots appear only either in the full or (for only a few roots) in the 
long grade; cf. vid ‘be firm’ etc. 

Another important division distinguishes between set and anit roots, set roots require 
insertion of -i- before certain suffixes (inf. -turn, ptcp. perf. pass, -ta-, fut. -sva-, sigm. 
aor. -s-); cf. e.g. vad ‘say’: ptcp. perf. pass, ud-i-ta- (not *ut-ta-) ‘said’; bhu ‘become’: inf. 
bhav-i-tum. Historically, this i goes back to the vocalic realization of a laryngeal in the 
interconsonantal position, but in Sanskrit the class of formations with the inserted i was 
considerably expanded by analogy. 

MORPHOLOGY 
Nominal morphology 

The grammatical categories of the noun include (i) the case (nominative, vocative, accu- 
sative, instrumental, dative, ablative, genitive and locative); (ii) the number (singular, 
dual and plural); and (iii) the gender (masculine, feminine and neuter). 

Alongside inflection proper, the forms of the paradigm may differ in their stems. The 
two main types of stem alternations include (i) ablaut in the root (only in a few noun 
stems) or, much more often, in stem suffixes; and (ii) accentuation. The ablauting mor- 
pheme may exhibit as many as three grades within the paradigm: (i) long or “Brug- 
mann’s”, (ii) full and (iii) zero; the zero grade may be represented by two variants in 
some declension types (e.g. in the an-declension). Accordingly, in terms of the ablaut 
type, all declension types can be divided into “three stem”, “two stem” and non-ablauting 
declensions. Within those types that distinguish between at least two stems, i.e. with the 
long or full grade, on the one hand, and with the zero grade, on the other, all forms of 
the paradigm are traditionally divided into “strong” and “weak” forms, respectively. The 
strong forms include nom., ace. and voc. of the masculine and feminine genders, except 
for the ace. pi., as well as nom./acc./voc. pi. of the neuter. The weak forms encompass all 
other members of the paradigm. Among the strong forms, the voc. sg. may differ from the 
others as the only form that has the full grade (shown with the underline font) as opposed 
to the long grade of other strong forms. In some declension types, the loc. sg. may have 
the full grade and, accordingly, be grouped with strong forms. 

The endings attested in the declension paradigm are summarized in Table 4.7. The right- 
most column shows the pronominal endings that do not occur in the substantive declension. 


TABLE 4.7 OLD INDO-ARYAN CASE ENDINGS 



Substantive 

Pronominal 

Singular 

NOM. 

-s, -0 (in a-, T-, n-, r-declensions) 

-dm 

VOC. 

-0 


ACC. 

-am, -m (in vocalic non-root 



declensions) 



( Continued ) 
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TABLE 4.7 (CONTINUED) 



Substantive 

Pronominal 

Singular 

INSTR. 

- a , - ,-ena (in a-declension, 
pron. m.) 


DAT. 

-e; -ai (in a-, T-, w-declensions, 
pron. f.) 

-smai (m./n.), -syai (f.) 

ABL. 

= gen. (except for a-type); -at 
(in a-declension) 

-smdt (m./n.), -syds (f.) 

GEN. 

- as , -sya (in a-declension, 
pron. m.); -as (in a-, F-, 
w-declensions, pron. f.) 

-syas (f.) 

LOC. 

- i , - 0 ; -dm (in a-, T-, 
w-declensions, pron. f.) 

-smin (m./n.), -syam (f.) 

nom./acc./voc. n. 

- 0 , -m (in a-declension) 

-d 

Dual 

NOM./ACC./VOC. 

-a, - , -au 


instr./dat./abl. 

-bhyam 


oen./loc. 

-os 


NOM./ACC./VOC. n. 

-l 


Plural 

NOM./VOC. 

-as 

e (m.) («— . . .a-i) 

ACC. 

-as, - s (in vocalic declensions, f.), - n 
(in vocalic declensions, m.) 


INSTR. 

-bhis 


dat./abl. 

-bhyas 


GEN. 

-dm, - n-am (in vocalic declensions) 


LOC. 

-su 


nom./acc./voc. n. 

-i, - , - n-i (in vocalic declensions) 



V = vowel; = length of the stem vowel; boldface = strong forms; underlined = full grade of alternation. 


Below are given in table form a few important declension types: simple consonant stems 
as attested in root nouns; the nf-declension (represented, foremost, by present active par- 
ticiples and mant-lvant- adjectives); the most productive masculine a-type (type deva-); 
and the feminine /-types. The forms unattested in early Vedic but reconstructable and/or 
known from later texts are in square brackets. 

(i) Root consonant stems 

((m./n./f.) °krt- ‘doing, producing’, °jit- ‘winning, conquering’, °cvut- ‘mov- 
ing’, soma-siit- ‘pressing Soma’, dirgha-srut- ‘heard far’) ; pad- ‘foot’, 
dp- ‘water(s)’, (m./n./f.) sah- ‘overwinner’, vah- ‘carrying, drawing’ (e.g. in 
anad-vah- ‘ox’) 
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TABLE 4.8 CONSONANT DECLENSION 

Singular 


Alternating 

nouns 


NOM. 

°kft 

pat 


sat 

voc. 

°kft 

[pad] 



ACC. 

°kft-am 

pad-am 


sah-am 

INSTR. 

°kft-a 

pad-a 

ap-a 

sah-a 

DAT. 

°krt-e 

pad-e 


sah-e 

abl./qen. 

°kft-ah 

pad-ah 

ap-ah 

sah-ah 

LOC. 

°cyut-i 

pad-i 



nom./acc./voc. n. 

°jit 

°pad 



Dual 

nom./acc.(/voc.) 

°jit-a, °krt-au 

pad-a 

ap-a 

sah-a °vah-au 

instr./dat./abl. 

[. . . d-bhyam \ 

pad-bhyam 



oen./loc. 

°kft-oh 

pad-dh 



nom./acc./voc. n. 





Plural 

NOM. (/VOC.) 

°kft-ah 

pad-ah 

ap-ah 

sah-ah 

ACC. 

°kft-ah 

pad-ah 

ap-ah 

m. sah-ah, mf. sah-ah 

INSTR. 

somasud-bhih 

pad-bhih 

ad-bhih 


dat./abl. 

°kfd-bhyah 

pad-bhyah 

ad-bhyah 

sad-bhyah 

GEN. 

°kft-am 

pad-am 


sah-am 

LOC. 

°kft-su 

pat-su 

ap-su 

anal-ut-su 

nom./acc./voc. n. 

( °srut 4 ) 





(ii) (a)n/-declension 

(with the root accent: cd.ra.nt- ‘going’ [pres. I cara-" ‘go ’],ydjant- ‘sacrificing’ [pres. 
I yaja-' ‘sacrifice’], vadant- ‘speaking’ [pres. I vdda- a ‘speak’], pdsyant- ‘looking’ 
[pres. IV pasva-" ‘look patavant- ‘flying’ [AYA-pres. patava-" ‘fly’]; with the suf- 
fix accent: grrnnt- ‘praising’ [pres. IX gpjd-“, grtji Ji ‘praise’], janant- ‘knowing’ [pr. 
IX jdnd- l ',jdni- K ‘know’], yant- ‘going’ [pres. II e-“ ‘go’], sant- ‘being’ [pres. II as-" 
‘be’]; pasumant- ‘having cattle’) 


TABLE 4.9 (A ) V 7'- 1 ) E C I . E N S I O N 


Singular 


“Lengthening” type (m/vant-) 

NOM. 

car an; janan 

pasu-man 

VOC. 

[car an] 


ACC. 

carant-am; grnant-am 


INSTR. 

carat-a; janat-a 


DAT. 

carat-e; janat-e 


abl./gen. 

carat-ah; yat-ah 


LOC. 

[carat-i]; °yat-l 


nom./acc./voc. n. 

carat ; sat 



(i Continued ) 
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TABLE 4.9 (CONTINUED) 

Dual 

nom./acc.(/voc.) 

carant-a, yajant-au; sant-a, sant-au 5 

instr./dat./abl. 

[carad-bhyam] 

oen./loc. 

[carat-oh] 

nom./acc./voc. n. 

[carat-T\; °yat-i 

Plural 

N0M.(/V0C.) 

pasyant-ah 6 ; janant-ah 

ACC. 

vadat-ah; grnat-ah 

INSTR. 

patayad-bhih 

dat./abl. 

pasyad-bhyah; yad-bhyah 

GEN. 

carat-am; sat-am 

LOC. 

patayat-su; grnat-su 

nom./acc./voc. n. 

[carant-i]; sant-i 1 pasu-mant-P 

(iii) a-stems ( deva - 

‘god’, yajna- ‘sacrifice’, pada- ‘foot’, kanja- ‘ear’) 


TABLE 4.10 ^-DECLENSION (TYPE DEVA-) 

Singular 

NOM. 

deva-h 

voc. 

deva 

ACC. 

deva-m 

Instr. 

devena , (yajna) 

DAT. 

devaya 

ABL. 

yajnat 

GEN. 

devasya 

LOC. 

deve 

nom./acc./voc. n. 

pada-m 

Dual 

nom./acc.(/voc.) 

deva, devau 9 

instr./dat./abl. 

karnabhyam 

oen./loc. 

deva-y-oh 

nom./acc./voc. n. 

pade 

Plural 

NOM. (/VOC.) 

dev ah, dev as ah 

ACC. 

devan 

INSTR. 

devaih, devebhih 10 

dat./abl. 

devebhyah 

GEN. 

devanam 

LOC. 

devesu 

nom./acc./voc. n. 

pada, padani n 


(iv) (-declension 

Root steins: dht- ‘thought’, srt- ‘light, splendour’; w-Af-type ({*-///-}, consonant 
type): apart- ‘future’, nadt- ‘river’, naptt- ‘grand-daughter’, rathf- ‘charioteer’, 
vrkt- ‘she -wolf’; devt- type ( vocalic type): devt- ‘goddess’; maht- ‘earth; cow’; 
osadhT- ‘herb, plant’ 
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TABLE4.il /-DECLENSION (TYPES DEVI- AND VRKI-) 



root stems 

vyki-type 

devi-type 

Singular 

NOM. 

dhi-h 

vrkt-h 

devt 

voc. 


°nadi 

devi 

ACC. 

dhly-am 

vrk/y-dm 

devt-m 

INSTR. 

dhiy-a 

rathiy-a 

devy-a 

DAT. 

dhiy-e 

vrk/y-e 

devy-ai 

abl./gen. 

dhiy-as 

ady-ah 

devy-ah 

LOC. 


nadt 

devy-am 

Dual 

nom./acc. 

°sriy-a, -au' AVt) 

nady-a 

devi 

voc. 


rathy-a 

devi 

instr./dat./abl. 

[dhi-bhyam] [vrki-bhyam] rodasi-bhyam 


gen./loc. 

[dhiy-oh] 

napty-dh 

rodasy-oh 

Plural 

nom./acc. 

dhly-ah 

rathy-ah 

devt-h 

voc. 

dhly-ah 


devi-h 

INSTR. 

dhT-bhlh 

nadt-bhih 

osadhi-bhih 

dat./abl. 

[dhl-bhyah] 

apart-bhyah 

osadhl-bhyah 

GEN. 

dhl-n-am, (dhiy-am [x ) 

nadt-n-am 

mahi-n-am 

LOC. 

dhi-su 

nadt-su 

osadhi-su 

The history of the nominal inflexion is 

summarized in Table 4.12: 



TABLE 4.12 PIE SOURCES OF CASE ENDINGS 

Singular 

nom. -s, -0 < *-s, *-0 

voc. -0 < *-0 

acc. -(a)m < *-m 

instr. -a, - < *-eh, i-ena is borrowed from pronominal declension) 

dat. -e; -ai < *-(e)y 

(a-declension: ) -aya < *. . .o-ey + a (particle (?)) 
abl. -at < *-o-ed (?) 

gen. -as < *-(o)s, -sya (from pronominal) 

-dvas for **-as (<*... eH-os) with insertion of -y- from /“-declension 
loc. -/', -0 < * -i, *-0 

-dm < *-a + am ? 

nom./acc ./ voc. n. -0, -m (in n-declension) 


Dual 


nom./acc ./voc. 

- < *-hj, -au < a + u 

instr./dat./abl. 

-bhyam < *? 

gen./loc. 

-os < *- h,e/oHs ? 

nom./acc./ voc. n. 

-/“ < *-ihj 

Plural 

NOM./VOC. 

-as < *-es 

-as < *. . .o-es (a-declension) 

ACC. 

-as < *-ns (< *-m-s (?)) 

-an (^-declension): analogy with nom. pi. 

- n: analogically from ^-declension 


( Continued ) 
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TABLE 4.12 (CONTINUED) 


Plural 


INSTR. 

DAT./ABL. 

OEN. 

LOC. 

nom./acc./ voc. n. 


-bins < *-b h i + *-s (plural) 

-ais (pronominal) 

-bhyas : *-b h i + *-(y)os (abl. pi.) 

-dm <*. . . o-om (o-declension; then analogically to other types) 
-su < *-su 

-i, - < *-h 2 ; - n-i is borrowed from //-declension 


Verbal morphology 

Grammatical categories of the verb 

The Vedic verbal paradigm includes four main classes of forms (“tense systems”), called 
present, aorist, perfect and (rare in early Vedic) future systems. Within each of these sub- 
sets, fonns are built on the same stem, i.e. on present, aorist and perfect stems, respectively. 
There are several sets of personal endings: “primary” (used foremost in the present tense) 
and “secondary” (endings used in the imperfect, aorist and pluperfect, as well as in two non- 
indicative moods, the optative and subjunctive), perfect, imperative and subjunctive. Each 
tense system includes a number of finite forms and a pair of participles, active and middle. 

The inventory of the grammatical categories of the verb includes (i) person (1st, 2nd and 
3rd) and number (singular, dual and plural); (ii) mood (indicative, imperative, injunctive, 
subjunctive, optative and conditional); (iii) tense: present, three past tenses (imperfect, 
perfect and aorist, in Vedic typically encoding a recent past event, most often experienced 
by the speaker/author of the text) and future; and (iv) diathesis, also called “voice” (active 
and middle). The range of the functions rendered by the middle diathesis is typical for the 
ancient Indo-European linguistic type as attested in “Classical” languages (Ancient Greek, 
Latin). Here belong the self-beneficent meanings with no valency change (‘do smth. for 
oneself’, as in the textbook example yajati ‘sacrifices’ -yajate ‘sacrifices for oneself’), as 
well as a number of intransitivizing derivations, such as passive, reflexive and anticausative 
(decausative); see Gonda 1979. The choice of the function(s) idiosyncratically depends on 
the base verb. However, already in the language of the earliest text, RV, we observe the loss 
of several grammatical functions of the ancient Indo-European middle, and the intransitiv- 
izing functions are largely taken over by special productive markers, such as the passive 
suffix -yd- and the reflexive pronouns tami- and atman -; see Kulikov 2012b. 

The non-finite fonns include two participles (active and middle) for each tense, converbs 
(traditionally called “absolutives” or “gerunds”), infinitives, gerundives and some others. 

The morphemic structure of the verbal form 

The verbal form has the following maximal morphemic structure: (preverb(s)) . . . /-(aug- 
ment a-)-(reduplication syllable)-root [which may incorporate the nasal infix]-(derivational 
stem suffix)-(thematic vowel a 12 )-(mood)-personal ending. Cf. a-cu-cvuv-T-mahi (RV 8.9.9) 
= preverb + reduplication syllable + root + optative morpheme + ending of the 1st person 
plural middle optative fonn (= 1 pi. opt. mid. fonn of the reduplicated (causative) aorist of 
the verb cyu ‘move, impel’) ‘(if) we could impel [you]’; vy-a-di-dvis-a-h (AV) = preverb + 
augment + reduplication syllable + root + thematic vowel + secondary (= aorist/imperfect) 
ending of the 2nd person singular active fonn (= 2 sg. act. of the reduplicated (causative) aor- 
ist of the verb dvis ‘hate’) ‘you have made (them) hate each other’; {a-chand-s-t} — > achan 
augment + root + sigmatic aorist morpheme + secondary ending of the 3rd person singular 
active fonn (= 3 sg. aor. act. of chand ‘appear’) ‘s/he has appeared’. 



INDO-ARYAN 239 


Personal endings and ablaut alternation in the verbal stem 

The general system of personal endings is summarized in Table 4.13. Wherever possi- 
ble the morphemic border between the ending and the thematic vowel is shown with a 
hyphen; the lack of a hyphen before the endings of the thematic conjugation shows its 
coalescence with the thematic vowel (a). 


TABLE 4.13 VEDIC VERBAL PERSONAL ENDINGS 


Active diathesis 




“primary” (present) 

“secondary” 
(impf., aor., opt.) 


imperative 

subj. 

perf. 



athem. them. 

athem. them. 

athem. them. 



SG 

1 

-mi - mi 

-am -m 


(= subj.) 

-dni, -a, -m 

-a 


2 

-si 

-s 


- 0 , -tat 

-s(i) 

-tha 





-hi, u : 

- dhi , 14 - ana 15 




3 

-ti 

-t 


-tu 

-t(i) 

-a 

DU 

1 

-vas - vas 

-va - va 


(= subj.) 

- va 

[-va] 


2 

-thas 


-tam 


-thas 

-athur 


3 

-tas 


-tdm 


-tas 

-atur 

PL 

1 

-mas(i) - mas(i) 

-ma - ma 


(= subj.) 

- ma 

-ma 


2 

-tha, -thana 


ta, -tana 


-tha 

-a 


3 

-nti, -ati {6 

-an, -ur xl -n 


-(a)ntu / -atu 

-n 

-ur 


Middle diathesis 




“primary” 

“secondary” 

imperative 

subjunctive 

perfect 



athem. them. 

athem. them. 




SG 

1 

(-)e 

-i -e 

(= subj.) 

(-)ai 

-e 


2 

-se 

-thas 

-sva 

-se, -sai 18 

-se 


3 

-te (-e) 

-ta (-a-t) 

-tdm, -dm 

-te, -tai 19 

-e 

DU 

1 

-vahe - vahe 

-vahi - vahi 

(= subj.) 

- vahai 

-vahe 


2 

-athe ethe 

-atham / etham 


(-)aithe 

-athe 


3 

-ate ete 

-a tdm / etam 


(-)aite 

-ate 

PL 

1 

-make - make 

-mahi - mahi 

(= subj.) 

- mahe 

-mahe 






- mahai 



2 

-dhve 

-dhvam 


-dhve 

-dhve 






(- dhvai 20 ) 



3 

-ate, -ate -nte 

-a(n)ta -nta 

-atam -ntdm 

-nta 

-re 


- ran 21 


The two main classes of stems, athematic and thematic, differ in that the latter appear 
in the same shape in all forms of the paradigm, while the former exhibit a number of 
alternations. Like in the declension paradigm, the main types of stem alternations include 
(i) ablaut in the verbal stems, that is, in the root (in athematic root stems) or in the suffix 
(in other athematic stems), and (ii) accentuation (partly correlating with the ablaut grade 
of the morpheme). 

The ablauting morpheme may appear (i) in the full (or “Brugmann’s”) grade or (ii) in 
the zero grade. Accordingly, all forms of the paradigm are divided into “strong” and 
“weak” forms, respectively. The strong forms (the corresponding endings are in bold- 
face in the tables) include (i) the sg. act. of the present, imperfect/injunctive and perfect 
(3 sg. perf. act. has the “Brugmann’s” grade); (ii) 3 sg. imp. act.; (iii) all forms of the 
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subjunctive; (iv) 3 pi. impf. act. of the reduplicated present (class III) [before -ur]. The 
forms that often or sporadically (shown with the underline font) have the full grade are 
(iv) 2 pi. pres./imp. act. (almost always before -tana, sporadically before - ta ); and (v) 2 
sg. imp. act. (with a few roots). Fonns that have the zero grade are called “weak forms”. 
The accent is on the stem in the strong forms and on the endings in the weak fonns 
(except for some weak forms of the reduplicated present). 


Histoiy oj Old Indo- Ary an personal endings 

The rich variety of the Old Indo-Aryan personal endings must go back to the six or seven 
series of the Proto-Indo-European verbal endings as reconstructed, in particular, by Kort- 
landt (1979: 66ff.) or Beekes (2011): 


TABLE 4.14 SERIES OF THE PIE VERBAL ENDINGS 




Primary 

athematic 

Secondary 

athematic 

Primary 

thematic 

Secondary 

thematic 

Perfect 

Stative 
(Middle intr.) 

(Middle 

transitive) 



i 

ii 

IV 

hi 

V 

VI 

(VII) 


1st 

*-mi 

*-m 

*-oH 

*-om 

*-h,e 

*-h. 

*-mh. 

SG 

2nd 

*-si 

*-s 

*-eh,i 

*-es 

*-th 2 e 

*-th,o 

*-sth,o 


3rd 

*-ti 

*-t 

*-e 

*-et 

*-e 

*-0 

*-to 


1st 

*-mes 

*-me 

*-omom 

*-omo 

*-me 

*-med h h. 

*-me(s)d h h. 

PL 

2nd 

*-th,e 

*-te 

^ethje 

*-ete 

*-e 

*-d h we 

*-td h we 


3rd 

*-(e)nti 

*-(e)nt 

*-o 

*-ont 

*-(e)r 

*-ro 

*-ntro 


Using the asterisked Roman numbers to refer to Table 4.14, one may synopsize the his- 
tory of the attested Vedic personal endings in Table 4.14 (individual endings are explained 
only where their origin is not straightforwardly explained by the general correspondence 
given in the heading of the column). 


TABLE 4.15 PIE SOURCES OF VEDIC PERSONAL ENDINGS 


Active diathesis 



“primary” < *1 

“secondary” < *11 imperative perf. < *V 

SG 1 

. . . ami < *-oH + *-mi 

-am 1 -m (<— subj.) 


(= *1 + *IV) 

< *-m/-m 

2 


*-o, *-tod, *-d h i 



-ana <*. . .H-o 



(+ particle na (?)) 

3 


-tu < *-t (*II) 



+ particle u 

PL 1 

-mas(i) < *-mes(-i) (analogically) 

(<- subj.) 

2 

-tha (+ na - particle?) 

-ta ( + na - particle?) 

3 

-nti, -ati < *-nti/-nti 

-(a)n < *-(e)nt. 



-ur < *-r (from *V) 
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Middle diathesis 



“primary” 

“secondary” 

imperative 

Perfect 


< *11 + *VI + *1 

< 4 VI 


< “primary” or 
*VI + *i (from *1) 

SG 1 

e (analogically) 

-/- < *-h, (*VI) 

(<- subj.) 


2 

-se (analogically) 

-thus (*VI + *11) 



3 

-te < *-t + *-o + *-i 

-ta (*VII 

-(t)am < *-(t)o 

-e < *-o + *-i 


(*II + *VI * *1) 

or *11 + *VI) 

(*VI/VII)+ 
particle -am 

(*VI + *I) 

PL 1 

< *VI + / 

-mahi < *-med h h. 

(<- subj.) 


2 


< *VI + particle (?) m 


3 

< *-nt (*II) + *-o (from 

< *-nt (*II) + 

< *-nt (*H) + 

-re < *-ro + *-i 


sg.) + *-i (from act.) 

*-o (from sg.) 

-dm (from sg.) 

(*VI t *1) 


t = extracted from/under the influence of 


General overview of tense systems and formation of tense stems 

The system of the finite forms of the Vedic Sanskrit verb (and, particularly, its variant 
attested in the most ancient Vedic text, the RV) is generally considered extremely com- 
plicated and irregular as compared to the corresponding system of Classical Sanskrit. Yet 
this opinion seems to have been imposed by the Sanskritist tradition (essentially going 
back to the approach of Old Indian grammarians), which usually takes Classical Sanskrit 
paradigms as a starting point for a grammatical description of Vedic. Such a perspective 
presents the Vedic paradigms in terms of a list of differences from the Classical Sanskrit 
system. This approach is, in a sense, anhistorical and methodologically (as well as ped- 
agogically) misleading, since it does not reflect the evolution of the Old Indian morpho- 
logical system. As is well known, the Classical verbal system evolves from the Vedic, not 
the other way around. More specifically, the former system can be regarded as a result of 
reduction of the latter. 

To put it differently, the Vedic verbal system shows very few constraints on applying 
inflectional morphemes to various verbal stems as compared to what we observe in the 
Classical language. In fact, the Vedic system can be said to be much larger and yet much 
simpler, in terms of the number of combinatory constraints, as compared to the much 
smaller Classical system. 

The Vedic verbal system can be presented as based on two parameters (see Kulikov 
2008): (i) the type of stem and (ii) the type of inflection. There are four types of stems: 
present (pres.), aorist (aor.), perfect (perf.) and future (fut.). The types of inflection 
include (1) primary endings (1 sg. act. -mi, 2 sg. act. -si, 3 sg. act. -ti, . . ., 3 pi. act. -ntH-ati, 
etc.); (2) augment a- + secondary endings (1 sg. act. -(a)m, 2 sg. act. -5, 3 sg. act. -t, etc.); 
(3) secondary endings; (4) imperative endings; (5) subjunctive morpheme a + subjunctive 
endings (which are distinct from primary or secondary endings only for some middle 
forms); (6) optative morpheme Tlya (which coalesces with the preceding thematic vowel 
a into e) + secondary endings; and (7) perfect endings; all these types are exemplified 
in the upper row of Tables 4.16-17 by the 3 sg. act. and mid. endings. Combining these 
two sets, we obtain 4 x 7 = 28 logically possible formations. Most of them are actually 
attested in Vedic (though some are very rare). Only ten of them survive into the Classical 
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Sanskrit paradigm, including the present and aorist injunctive (which only survives with 
the prohibitive particle ma) and aorist optative (which is only preserved in the precative, 
based on the root aorist optative). Note that all formations that belong to the standard 
Classical Sanskrit paradigm (= cells bordered with a shadowed line in Table 4.16) are 
also present in Vedic, though some of them are very rare (or even exceptional) in the early 
language, as is the case with the conditional (one attestation in the RV). 

Some of these formations require special comments. Thus, “perfects with present end- 
ings” include such forms as 3 sg. act. jagar-ti ‘watches’ (V/f ‘become awake’), 3 pi. 
act. dTdv-ati ‘(they) shine’ (Vr/F ‘shine’). These formations do not represent a separate 
synchronic category. Historically, these forms are based on regular perfects, which, at 
some stage, have been reanalyzed as belonging to the present system - most often on 
the basis of non-indicative forms (in particular, subjunctives) that share endings with 
present forms. Once these forms had been transferred to the present system, they lost 
their synchronic link with the perfect system, becoming present formations with irregular 
(“perfect”) reduplication (see Kummel 2000: 59ff, 191-194, 227-230 et passim). 

The non-indicative moods of the perfect and aorist are attested only in early Vedic 
and are non-productive already in the early language (for aorist imperatives, see Baum 
2006). Still rarer, even exceptional, are “aorist injunctives with present endings” (labeled 
by Hoffmann “the type gatha”), which include a few isolated forms attested in the RV. 
Here belong, in particular, 2 du. act. kr-thas (V Ikr ‘make’) RV 1.112.8, 5.74.5, 10.39.8; 
bhu-thas (ylbhu ‘become’) RV 6.67.5, 3 du. act. bhu-tas RV 10.27.7, and 2 pi. act. ga-tha 
('Jgam ‘go’) RV 8.20.16; 1 pi. med. dhl-mahe (yldha ‘put’) RV 2.23.10 should perhaps be 
included in this group, too. These nonce formations (see Lubotsky 1997: 439, 471, 747, 
992) do not instantiate root presents (contra Whitney 1885: 21, 34 etc.). As Hoffmann 
(1952 [1957]: 128-130 [= Hoffmann 1976: 364-366]) has demonstrated, they are built on 
the basis of root aorist imperatives (such as 2 du. imp. act. kr-tam, 2 pi. imp. act. kr-ta, 2 
pi. imp. act. ga-ta etc.), and on the model of 2 pi. pres. imp. act. ha-ta : 2 pi. pres. ind. act. 
ha-tha ('than ‘kill’). Their status within the inventory of finite forms is clearly marginal. 

Future injunctives (= conditionals without augments) and optatives are extremely 
rare. A few such fonns are attested in Epic Sanskrit: 2 sg. opt. act. pari-trasyah {'itra 
‘rescue’) MBh 8.13.24, 3 sg. opt. act. bhavisvat (~ibhu) MBh 2.51.25, 3 sg. opt. act. 
dhaksvet {'idah ‘burn’) MBh 1.221.19 v.L, 3 sg. opt. med. draksveta (yldrs ‘see’) Ram. 
3. 1074*. 4; see Whitney 1889: 334; Renou 1930: 401f., 462; Oberlies 2003: 236f., 240. 

The system of present formations is the richest subsystem of the verbal paradigm, 
which is often said to abound in irregular and abnormal fonns. The traditional Old Indian 
system of ten “primary” present classes satisfactorily captures the main morphological 
types. However, it is also simplistic in some respects, grouping together some types of 
clearly different nature. In particular, it disregards the important difference between (i) 
the two main thematic types, classes I (with the accent on the root) and VI (with the 
accent on the thematic vowel), on the one hand, and (ii) secondary thematicizations, i.e. 
present types that have been transferred from some athematic types to classes I and VI or 
that can be regarded as such transfers (irrespective of their origin). 

Below I outline a somewhat different system of description based on a few parameters 
that enable us to generate virtually all present formations attested in Vedic. Combining 
the three main morphological and morphophonological features, we obtain all present 
stems, including “minor types”, which find no place in the traditional Indian nomencla- 
ture. These parameters include (i) the present stem formative (symbolized as F in Table 
4.18): no present stem morpheme (o); stem suffixes (-v(a)-, -nd-t-nu- etc.); nasal infix 
(- nd-l-n -; placed between • • in the illustrative examples); and reduplication; (ii) the pres- 
ence of the thematic vowel -a-; and, for thematic present stems, (iii) the placement of the 
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main accent: on the root vs. on the thematic vowel. Note that the stem suffix (or infix) of 
the secondary thematic classes always appears in the zero grade and contains no vocalic 
element; for that reason, it cannot bear the accent. 

For the sake of convenience, I use a complex notation that is as close as possible to 
the traditional Indian system of ten classes (symbolized by Roman characters). Each of 
the secondary thematic types (qualified by the Sanskritist tradition as belonging either to 
class I or to class VI) is related to the corresponding athematic type (I<— V, VI<— VII, etc.), 
thus being presented as the result of thematicization. This is a purely synchronic notation: 
the arrow (<— ) does not necessarily mean that the thematic type in question historically 
goes back to a (hypothetic) athematic pendant. Thus, for instance, I do not argue that, for 
the class “VI<— VII” present kr-n-t-a-ti, we have to reconstruct the athematic class VII 
present *kr-na-t-ti, etc. A few secondary thematic(ized) types are not actually attested. 
These include classes I<— IX, VI<— V 22 and VI<— III. 

In addition to the nine “primary” classes, this calculus generates one type that is tradition- 
ally not included in the system of “primary” present types, passives with the suffix -yd- (tra- 
ditionally considered as one of “secondary” fonnations, which also include -ava-causatives, 
intensives and desideratives). In fact, the only fonnal difference between class IV presents 
and -va-passives is the place of the stress (on the root vs. on the thematic vowel/suffix). 
Thus, this fonnal opposition follows the same pattern as the opposition between types 
I<— VII ( sirm-bh-a-ti ) and VI<— VII ( kr-n-t-a-ti ). Note that the -va-class also includes a few 
non-passive -va-presents (symbolized as *IV in Table 4.18) of the type mrivate (ylmr ‘die’) 
with passive accentuation (<— *mr-ye-te). On this type, see Kulikov 1997. 

There are no athematic presents with the suffix -/- in Sanskrit (= athematic counterparts of 
the - va-presents); one of the few traces of the Proto-Indo-European athematic /-type might be 
the present kseti (> Iksi ‘dwell’) < *tk-ey-ti; see Kortlandt 1989: 109; LIV 2001: 644, note 1. 

Next to the main present classes, Table 4.18 also includes two non-productive present 
types with the suffixes -cha- and -va- 23 (on which see, in particular, Goto 1987: 73). 

TABLE 4.18 THE VEDIC SYSTEM OF PRESENT STEM TYPES 


present stem formatives (suffixes and other morphemes) 



0 (no) 

-y(a>- 

-no-l-nu- 

-na-l-rii- 

infix -nd-l-n- 

pres. 

reduplic. 

-ch(a)- 

athematic: 

Vn+F- 

II 

at-ti 

(kseti) 

V, VIII 
su-no-ti, 
ta-no-ti {'•han) 

IX 

pr-na-ti 

VII 

yu-na-k-ti 

Ill 

da-da-ti 

- 

accent on the 

root: 

thematic: Vn+F+a- 

I 

bhav-a-ti 

IV 

lubh-ya-ti 

I<-V 

i-nv-a-ti 

(MX) 

I«-VII 

swm-bh-a-ti 

I<— III 
ti-sth-a-ti 

ga-cha-ti 

Vn +F +a- accent on the 

them, vowel: 

Vn+F+a- 

VI 

tud-a-ti 

pass., *IV 
han-ya-te, 
mri-ya-te 

(VI<— V) 
{*r-nv-a-ti ?) 

VI<-IX 

pr-n-a-ti 

VI<-VII 

kr-n-t-a-ti 

(VI<-III) 

Vpha 

r-cha-ti 


-v(a>- 


(VIII) 

( tar-u-te ) 


r a 


tur-va-ti 


Voice, causatives and transitivity oppositions 
Voice and passives 

There are several verbal formations in Vedic that can be employed in passive construc- 
tions. Non-finite passives are passive perfect participles with the suffix -ta-/-na- and 
gerundives, or future passive participles, with the suffixes -ya-, -tavva- and -antva-. Finite 
passive fonnations include (1) presents with the suffix -yd- (derived from the root by 
means of the suffix -v(a)-, which can only take middle endings; e.g. han ‘kill’: 1 sg. 
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han-ye, 2 sg. han-ya-se, 3 sg. han-ya-te, etc.); (2) medio-passive /-aorists (with defective 
paradigm: only 3 sg. in -i, 3 pi. in -ran/-ram and participle; e.g. vuj ‘yoke, join’: 3 sg. 
ayoji, 3 pi. ayujran, ptcp. yujana-); (3) middle perfects and statives (which supply passive 
perfects for some verbal roots; also with defective paradigm: 3 sg. in -e, 3 pi. in -re and 
participle; e.g. hi ‘impel’: 3 sg. hinve ‘(it) is impelled’, 3 pi. hinvire ‘(they) are impelled’; 
ptcp. hinvana -); 24 and (4) some (isolated) middle forms (e.g .vuj ‘yoke, join’: 3 pi. sigm. 
aor. avuksata; gf ‘praise’: 3 sg. class IX pres, grmte ‘is praised’). 

The inventory of the present passive fonns attested in the RV and AV is shown in 
Table 4.19. The members of the paradigm are mainly exemplified by fonns of the verb yuj 
‘yoke, join’ (which exhibits one of the most complete attested paradigms), supplemented by 
fonns of other verbs where those of yuj are unattested. The lacking tense-moods of the pas- 
sive paradigm (which include imperfect, injunctive, subjunctive and optative) are shown with 
dark gray shading - with the exception of a few hapaxes marked with middle gray shading. 


TABLE 4.19 THE INVENTORY OF THE PRESENT PASSIVE FORMS ATTESTED IN THE 
RVANDAV 


present 

SG 1 

2 yujya-se 

3 yujya-te 

DU 1 

2 

3 ucyete RV L lx 

PL 1 -panyamahe 

RV, 1 ' 

2 

3 yujya-nte 

PARTICIPLE 


injunctive imperative 



nya-ta RYj ' suya-ta RV, 


dhiya-sva AV 
dhlya-tam AV 



yujya-dhvam 
badhya-nlam 
yujya-mana- 


subjunctive optative 



1 x = one attestation; RV L stands for late RV. 


Only the 3rd person singular and plural fonns are well attested. Next to a dozen 2 sg. 
fonns ( yujyase ‘you are (being) yoked’, sasvase ‘you are (being) praised’ etc.), we find 
only one occurrence of a 3 du. fonn, ucyete (RV 10.90.11) ‘[the two feet] are called’, 
and one (philologically and grammatically rather unclear) form - panyamahe , which may 
represent a 1 pi. (‘we are (being) glorified’ (?); see Kulikov 2012a: 144—146). 1 sg., 1 du., 
2 du. and 2 pi. forms are unattested. 

Next to present fonns proper, participles and rare imperatives (10 forms or so in the 
RV and AV), only exceptional attestations of other tense-moods are found. These include 
as few as five forms: 


(i) 3 sg. impf. amyata ‘(she) was brought’ in the late RV (8.56.4 = Valakh. 8.4) and 3 
pi. impf. -asicvanta ‘(they) were besprinkled’ in AV 14.1.36; 

(ii) 3 sg. inj. suvata ‘(he) is consecrated’ in the late RV (10.132.4) (see Kulikov 2012a: 
284ff.) and 3 pi. inj. -aprcyanta in the late RV (1.110.4) ‘(they) united’ (non-pass, 
intr.) (see Kulikov 2012a: 154); 

(iii) 3 sg. subj. -bhriydte (RV 5.31.12) ‘(it) will be brought’. 
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Optatives of the present passive do not occur before the very end of the early Vedic 
period. 

Only from the middle Vedic period onwards, when the present passive system becomes 
well established, do we find a good many imperfects, subjunctives and optatives of 
-vd-passives. 

The early Vedic passive paradigm (for all three tense systems, present, aorist and per- 
fect) is summarized in Table 4.20. An almost complete paradigm is attested for the verbs 
su ‘press (out)’ and yuj ‘yoke, join’. (In the cases where forms of these two verbs are unat- 
tested, I put in square brackets forms made from other roots.) Different degrees of shad- 
ing show the status of the corresponding forms: dark gray = lacking and morphologically 
impossible; middle gray = morphologically possible but unattested or only exceptionally 
attested (underdeveloped part of the paradigm); light gray = morphologically possible 
but rare (perhaps primarily for pragmatic reasons; cf. the rarity of passive imperatives). 

TABLE 4.20 PASSIVE PARADIGM IN EARLY VEDIC 


present aorist perfect/stative 

indicative imperative indicative inj. indicative imper. 


SG 1 


2 

• • • ,yujyase 

[dhiyasva] 

3 

suyate , yujyate 

[dhlyatam] 

DU 1 



2 



3 

\ucyete] 


PL 1 

[-panyamahe (?)] 


2 


yujyadhvam 

3 

. . .,yujyante 

\ badhyantam \ 


PARTICIPLE suyamana-, vujydmana- 



dsavi.avoji . . ,,voji sunve,yuyuje [duham] 


. . . , ayujran . . . sunvire, yuyujre 

s“vand-, yujana- sunvana-, yuyujana- 


Furthermore, there are some reasons to assume that stative -ana-participles could have 
active counterparts, that is, participles derived from the stative stem with the active par- 
ticiple suffix -ant, cf. such forms as stavant- (active participle?) ‘praised’ (first noticed 
by Watkins 1969: 142ff.) and pepisat- ‘adorned’ (RV 10.127.7; see Schaefer 1994: 45, 
152fi), which may point to the unattested stative *pepise ‘is adorned’ of the type cekite 
(Schaefer 1994: 44); (iii) mahant- ‘great’ (probably belonging with the hapax stative 
mahe ‘is able’ (RV 7.97.2) (see Kulikov 2006b: 59ff. for details). These forms represent 
a good structural parallel to Hittite participles with the suffix -ant- that form passives for 
transitive verbs (but hitherto have generally been considered isolated phenomena, with- 
out parallels in other Indo-European branches), such as kunant- ‘killed’ ( kuen - ‘to kill’) 
or dant- ‘taken’ {da- ‘to take’). 

The hypothesized existence of active participles in the stative paradigm, which, in spite of 
their “active” morphology, were employed in passive constructions, still further supports the 
connection between the stative fonnation and the passive syntactic pattern, on the one hand, 
and, on the other hand, serves as additional evidence against the traditional assumption about 
a straightforward connection between middle morphology and passive syntax in Vedic. 
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Causative oppositions 

Vedic attests a remarkable variety of morphological types of causative oppositions and a 

rich system of morphological causatives. 

(i) The most regular and productive causative marker in the present system is the 
suffix -(p)aya- attached to the “Brugmann’s” (full/long) grade of the root, 25 cf. 
vrdh- ‘grow, increase’ - vardhdyati ‘makes grow, increases’, cit- ‘appear, per- 
ceive’ - cetavati ‘shows (= makes appear), makes perceive’ (~ citavati ‘appears’). 

(ii) In early Vedic we also find a few other types of the causative opposition within 
the present system, where the causative member is represented by another present 
type, usually by one of the presents with nasal affixes: suffixes -nd-/-nu- (pres, 
class V), -na-/-m- (pres, class IX) or infix -na-/-n- (pres, class VII); the intransitive 
member (anticausative) is often a class IV present (with the suffix ' -ya-) or a class 
I present. Generally, the intransitive (anticausative) member of the opposition is 
inflected in the middle, the transitive -causative in the active (but transitives with 
middle inflection are possible as well). Cf. i- ‘go, send’: eti (pres, class II) ‘goes’- 
inoti, invati (pres. V and its thematicization) ‘sends’; ksf- ‘perish, destroy’: ksTyate 
‘dies, perishes’ (pres. IV) - ksindti (pres, class IX) ‘destroys’;y'a«- ‘be bom, arise’: 
javate ‘is born’ (pres, class IV) - janati (pres, class I) ‘begets’; pu- ‘purify’: pdvate 
‘becomes clean, purifies oneself’ (pres, class I) - punati (pres, class IX) ‘purifies’. 

Two minor types of causative oppositions in the present system include pairs with 

(iii) reduplicated causatives (pres, class III, cf. vucchati ‘keeps away ' -yuvoti ‘makes 
keep away’) and 

(iv) present class VI causatives (cf. tarati (pres, class I) ‘crosses’ - tirati (pres, class 
VI) ‘passes over, rescues’). 

(v) Finally, for some verbs, the causative opposition is marked only by the diathesis (mid- 
dle: intransitive - active: transitive); although some middle fonns can also be used 
transitively (with their self-beneficent meaning), thus showing the labile syntax (Kulikov 
2014). This type of opposition is mostly attested for class I presents; see, for instance, 
Gotd 1987: 48ff, cf.: namate ‘bends’ (intr.) - namati (/-te) ‘bends’ (trans.); vahate 
‘drives, goes’ - vahati ( vahate ) ‘carries’; svadate ‘is sweet’ - svadati ‘makes sweet’. 

(vi) In the aorist system, the causative meaning is typically expressed by the redupli- 
cated aorist, cf. vrdh ‘grow, increase’ - avTvrdhat ‘has made grow’. 

(vii) Furthermore, the intransitive (anticausative) member of the causative opposition 
can also be expressed by the medio-passive -/-aorist, sometimes alongside the sig- 
matic aorist; cf.: jan ‘be bom, arise’: ajani, sigm. aor. ajanista ‘has been bom’ - 
ajijanat ‘has generated’. 

(viii) In the perfect system, the causative/non-causative distinction can only be ren- 
dered by the diathesis opposition, although not quite consistently: active fonns can 
be used both transitively and intransitively (thus being labile; see Kulikov 2014: 
1151-1152). Cf. 3 sg. perf. mid. paprathe ‘has spread (intr.)’ - 3 sg. perf. act. 
papratha ‘has spread (trans.), filled out’; 3 sg. perf. mid. vavrdhe, 3 sg. perf. act. 
vavardha ‘has grown (intr.)’ - 3 sg. perf. act. vavardha ‘has increased (trans.)’ (see, 
in particular, Kiimmel 2000: 319ffi, 469ff. et passim). 

(ix) Fora number of verbs, there is a correlation between transitivity and tense: fonns of 
the present system (i.e. the present proper and imperfect) are employed transitively, 
while perfect forms are intransitive; cf. 3 sg. perf. tatana ‘has stretched’ (intr.) - 3 sg. 
pres. act. tanoti, 3 sg. pres. mid. tanute ‘stretches’ (trans.); see Kulikov 1999, where 
this phenomenon (“split causativity”) is discussed in a typological perspective. 
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SYNTAX 

Within a short chapter, it is impossible to discuss in detail the main syntactic features of Old 
Indo-Aryan. Only a few remarkable syntactic peculiarities will be mentioned in this section. 

Passive and ergative(-like) constructions 

Passivization typically suggests (i) the promotion of the initial direct object to the subject 
position (= the subject of the passive construction or passive subject for short) and (ii) the 
demotion of the initial subject (usually, an agent). The demoted subject either becomes 
an oblique object (encoded by the instrumental case, as, e.g., in RV 9.86. 12d s u vayudhah 
sotrbhih puyate vrsa ‘[Soma], the well-armed bull, is being purified by pressers’; more 
rarely by the genitive case (cf. RV 1.61.15a asma id u tvad anu dayyesam ‘this very thing 
was granted to him by them’) or, more frequently, remains unexpressed (see Gonda 1951 : 
77f.), as in RV 9.97.35c somah sutah puyate ajyamanah ‘Soma, pressed out, is purified, 
being anointed.’ See Jamison 1979: 133ffi; P. K. Andersen 1986. There are some reasons 
to treat Vedic constructions with perfect passive participles in -ta-/-na- and genitive mark- 
ing of the agent separately from canonical passives with the instrumental. According to 
P. K. Andersen (1986), the genitive noun displays a number of subject properties (usually 
animate; definite and/or refers to old information) in such constructions, and therefore 
they should be qualified as ergative rather than passive properly speaking. 

Reflexive constructions 

The reflexive pronoun tanu- has developed from the substantive meaning ‘body’ and, 
like its Old Iranian (OAv.) cognate tanu-, is well attested in this grammaticalized usage 
in the RV, as in RV 1.147.2 vandarus te tan u vam vande ague ‘As your praiser, I praise 
myself, o Agni.’ In some cases it is nearly impossible to draw with accuracy the distinc- 
tion between the reflexive and non-reflexive (‘body’) meanings: both interpretations are 
perfectly appropriate in the context, as in RV 10.54.3 van mdtdram ca pitaram ca sakam / 
ajanayathas tan,yah svavah ‘ . . . since you produced (your) mother and (your) father 
together from your own body / from yourself. ’ 

Next to the reflexive usages proper, the Vedic reflexive pronouns can be employed 
in emphatic usages, i.e. as an emphatic reflexive, or intensifier, signaling the fact that 
its referent is somewhat unexpected in the role where it appears (cf. two usages of Eng. 
-self: Peter saw himself in the mirror ~ Peter drew this picture himself)', see Kulikov 
2007. In the more common adverbial case pattern we find the instrumental forms (as in 
RV 6.49. 1 3 quoted below). The nominal pattern is attested, for instance, with accusatives 
and datives, as inAV 1.13.2 = RVKh. 4.4.2 mrdaya nas tanubhvo / mavas tokebhyas krdhi 
‘Be gracious toward ourselves, make pleasure for [our] offspring.’ 

The reflexive usage of atman- becomes common after the RV but is still in competition 
with tanu- in the AV (see Kulikov 2007). In Vedic prose, atman completely ousts tanu-, 
see Delbriick 1888: 207ff, 262f, Wackemagel 1930: 489ffi, §240b; and, especially, a 
brief survey in Oertel 1926, with a rich collection of examples. 

The emphatic usage is attested for atman- from the AV onward, cf. TS 1.7. 3. 3 tato 
deva abhavan parasura yasyaivam viduso ’nvaharva ahriyate bhavatv atmana parasya 
bhratrvyo bhavati ‘Then the gods prospered, the Asuras perished. He, who, knowing 
thus, performs the Anvaharya-rite, prospers himself, his rival perishes.’ 

In contrast to atman-, the more archaic stem variant tman- already occurs in emphatic 
usage in the early RV. Its instrumental appears in the very frequent regular form tmana 
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(63 attestations in the RV) and in the form tmanya (built on the stem tmam- or tmanya-, of 
unclear origin), which occurs in the late RV (1.188.10, 10.110.10) and in the late mantras 
(VS 20.45 = TB m 2. 6. 8.4 etc.), cf. RV 10.110.10 upava srja tmanya ‘Release [the sacrifi- 
cial animal] yourself. ’ 

SOME REMARKABLE TYPOLOGICAL FEATURES OF THE LATER 
(MIDDLE AND NEW) INDO-ARYAN LANGUAGES 

Although this chapter mostly concentrates on the earliest chronological layers of Indo- 
Aryan, which furnish the richest evidence for Indo-European comparative and historical 
linguistics it will be appropriate to briefly summarize the most salient typological fea- 
tures shared by all or most of the later (Middle and New) Indo-Aryan languages, which 
continue the developments and result from the innovations and trends starting as early as 
Old Indo-Aryan. 

Replacement of the old case system with new agglutinative cases 

By the end of the MIA period, that is, at the turn of the second millennium AD, the 
Indo-Aryan languages lost most of the cases of the original Sanskrit, or Old Indo-Aryan 
(OIA), system of eight cases 26 (which, except for minor details, is nearly identical to 
the case system reconstructed for Proto-Indo-European). Generally, only two cases sur- 
vive, direct (resulting from the merger of nominative and accusative) and oblique (mostly 
going back to the OIA genitive), although in some languages isolated traces of some other 
oblique cases, such as the instrumental, locative or ablative, can still be found, sometimes 
even within the declension paradigm; cf. the Sinhala instrumental case suffix -en/-in and 
Assamese ergative -e, both reflecting the OIA instrumental singular ending of a-stems 
-ena. The functions of the lost cases are largely taken over by morphemes (bound or free, 
i.e. postfixes or postpositions) of different origin. 

These include: 

(i) Primary, or “old”, postpositions, going back to Proto-Indo-European morphemes 
that were used in the adpositional function already in the proto-language. An exam- 
ple of an old OIA postposition reflected as a case suffix in a daughter language, in 
MIA, is the MaharastrT ablative suffix -ahi < Skr. postposition adhi (constructed with 
the ablative in OIA); see Insler 1991-92; Bubenik 1998: 68f. 

Next to old postpositions, there are several markers that result from grammaticalization 
of some verbal and nominal forms: 

(ii) Postpositions descendant from 

(ii.a) non-finite verbal forms, in particular, converbs (traditionally called “abso- 
lutives” or “gerunds”, cf. Skr. adava ‘with’, lit. ‘having taken’), gerundives 
(participia necessitatis) and verbal adjectives; 

(ii.b) case forms of some nouns (such as Skr. grhe ‘in the house’; see below); 

(iii) final members of compounds, which, again, may represent 

(iii.a) non-finite verbal forms (cf. nominal compounds in - sthita -; see below) or 
(iii.b) nominal case forms (cf. nominal compounds in -artham ‘goal, purpose’; see 
below). 
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The markers of the first three types (i, ii.a and ii.b), representing free morphemes (words), 
were originally attached to (non-nominative) case forms of the noun, thus forming, after 
having become bound morphemes, the second layer of case forms (see below). In type 
(iii), the source of the new case morpheme was attached to the nominal stem, thus creat- 
ing a new case within the first layer. In fact, due to the erosion of the nominal inflection by 
the end of the MIA period, some (oblique) case forms may eventually become indistin- 
guishable from bare stems, and thus the border between types (ii) and (iii) cannot always 
be drawn with accuracy. 

A number of examples of grammaticalization of new postpositions and case suffixes 
can be found already in the MIA period, in particular, in Apabhramsa Prakrits (for details, 
see Bubenik 1998: 67, 80), cf. the ablative postfix -tthiu < OlAsthita- ‘standing’ (passive 
perfect participle of the verb stha ‘stand’) and the locative postposition majjhe < Skr. 
madhye (loc. sg. of madhya ‘middle’) in Apabhramsa Prakrit (Bubenik 1998: 67, 80; 
Bubenik & Hewson 2006: 113): 


a. hiaya-tthiu 
heart-LOC 

‘out of [my] heart’ 

b. gharaho majjhe 
house: gen in 
‘in the house’ 


In NIA languages we observe a rapid increase in use of such new postpositions, which 
are normally added to the oblique case form. This grammaticalization may result in the 
amalgamation of a postposition with the nominal stem or oblique case and, hence, in the 
rise of a new case. Such is, for instance, the origin of some new case endings, for instance, 
Sinhala dat. -to, Khowar dat. -to < Skr. -artham ‘goal, purpose’; Sinhala gen. -ge < Skr. 
grhe ‘in the house’ (loc. sg. of grha- ‘house’). 

Case markers containing k- and/or r-, which go back to nominal derivatives of the OIA 
verbal root kr- ( kar -) ‘make, do’, can be found in several NIA languages. These include, 
in particular, genitive morphemes in several NIA languages (see, in particular, Bubenik & 
Hewson 2006: 122fi), such as Hindi -ka, -ke < Apabhramsa -kera < Skr. gerundive karya- 
‘to be done’; Awadhi, Maithili -ker < Skr. ptcp. perf. pass, krta- ‘done, made’; Bhojpuri 
-kce < Skr. adj. krtya- ‘to be done’. 

Likewise, some dative ^-morphemes, such as Hindi -ko, Oriya -ku, Marathi -ke and 
Romani -ke/-ge, reveal a vestige of the same Sanskrit root kr- {kar-). 

The initial stages of the corresponding grammaticalization processes can be dated as 
early as OIA. Thus, the starting point of the grammaticalization path of Skr. -artha ‘goal, 
purpose’ toward the Sinhala dative case suffix -to is the adverbial usage of the accusative 
of the Sanskrit bahuvrlhi compounds in -artha- ( X-artham ), meaning ‘having X as a goal, 
purpose’ — > ‘for (the sake of) X’: udakartham ‘having water as a goal’ — > ‘for water’, 
sukhartham ‘having happiness, pleasure as a goal’ — > ‘for happiness, for pleasure’, tadar- 
tham ‘having that as a goal’ — > ‘for that, therefore’. 

In NIA languages, the morphological status of the resulting markers may vary from 
bound morphemes (case suffixes), tightly connected with the nominal stem (as in Sinhala, 
cf. examples above), to free morphemes (postpositions). The latter type can be illustrated 
by the Hindi dative-accusative morpheme -ko, which can be shared in some constructions 
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by several nouns (as in ram aur mohan ko ‘to Ram and Mohan’), exemplifying a “Grup- 
penflexion”, which pleads for a postposition rather than for a suffix analysis. 

The difference between these groups of case morphemes is often described in terms of 
the distinction between cases of the first, second and third layers (Zograf 1976; Masica 
1991: 230ffi; Matras 1997). The first layer corresponds to the case in the strict sense of 
the term and, in Hindi, is limited to the opposition between the direct and oblique cases. 
The third layer corresponds to clear instances of postpositional phrases, while the second 
one takes an intermediary position between cases proper and postpositional phrases. It 
is important to note that only the first-layer case can trigger agreement on adjectives. 
Although both “Gruppenflexion” and the lack of agreement with second-layer cases 
appear to distinguish these morphemes from cases proper, the high degree of grammati- 
calization makes appropriate their association with the category of case in general. 

Productive morphological causatives 

NIA languages are notorious for their system of productive morphological causatives, 
continuing the OIA causatives in -(p)ava-, which go back to going back to the Proto- 
Indo-European *-eye/o-causative. This distinguishes Indo-Aryan from other Indo- 
European branches and, in particular, from many Western Indo-European languages that 
lost the remnants of the Proto-Indo-European causatives. From the MIA period onward, 
we observe the tendency to develop double (second) causatives. Many NIA languages 
distinguish between contact (direct) and distant (indirect) causatives and may have up 
to four members of causative chains, differing both in suffixes and root vowel; cf. Hindi 
khul- ‘open’ (intr.) - khdl- ‘open’ (trans.), cal- ‘go’ - cal-a- ‘drive (make go)’. 


Ergative construction 

Many NIA languages exhibit an ergative pattern, usually alongside a nominative-accu- 
sative construction (split ergativity). Ergativity is particularly common for Central and 
Western NIA languages, where ergative pattern appears with perfective verb forms; cf. 
Punjabi: 

tarkhan-ne kursi-a bana-T-a 
carpenter-ERG chair:F-PL.DiR make-PAST.PFV-F.PL 
‘The carpenter has made chairs.’ 

Historically, the ergative construction continues the (late) OIA nominal passive con- 
struction with the perfect passive “participle” (more precisely, verbal adjective); see 
section 5.1. 

FURTHER READING 

Editions and translations of ancient Indo-Aryan texts 

Virtually all important Vedic texts have been critically edited, and for most of them there 
are European translations, with one important lacuna, some parts of the Paippalada recen- 
sion of the Atharvaveda (on which see below). The most ancient OIA text, which is also 
the most important source of information for comparative Indo-European studies, the 
RV, was edited by Aufrecht ( 1 877), and remains the standard edition; there is also a more 
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recent edition by van Nooten and Holland ( 1 994), which offers a metrically restored text. 
There are a number of complete translations of the RV, which include that of Geldner 
(1951/2003; in German, still remains standard and most often quoted); a new German 
translation by M. Witzel, T. Goto and several other scholars (2007-; work in progress; by 
now two volumes have been published, which encompass books I to V); a Russian trans- 
lation by Elizarenkova (1989-1999); and the most recent English translation, by Jamison 
and Brereton (2014). Renou’s translation (1955-1969), published in several issues of 
Etudes vediques et panineennes, has remained incomplete but nevertheless is of extreme 
value for a linguistic and philological study of the RV. 

For the second most ancient OIA (Vedic) text, the Atharvaveda in the recension 
SaunakTya, editio princeps by Shankar Pandurang Pandit (1895-1898), still preserves 
its value; the standard European edition by Roth and Whitney (2nd rev. ed. 1 924) has no 
critical apparatus, which is partly reproduced in Whitney and Lanman’s (1905) transla- 
tion. Alongside a number of anthologies, such as Bloomfield 1897, Whitney and Lanman 
1905 had remained the only complete translation of the AV (without the last, 20th, book, 
or kanda, which almost exclusively is taken from the RV, however) till recently, when 
Elizarenkova ’s (2005-2010) Russian translation appeared. Sadly, the second and third 
volumes of this last work of the great Russian Vedicist were published posthumously, 
and the last 30 hymns of kanda XIX have remained untranslated. 27 Another (and presum- 
ably more ancient) recension, Paippalada, which has survived into modem times, became 
available to Sanskritists in fairly readable Orissa manuscripts only in the middle of the 
20th century. The edition, translation and study of the Paippalada is work in progress, 
one of the biggest challenges of the contemporary Vedic (and, in general, Indo-Aryan) 
scholarship. By now, approximately half of the books (kandas) of this important text have 
been critically edited and translated; see, in particular, Bhattacharya 1997, 2008 (edition 
of kandas I-XVI); Zehnder 1999, Lubotsky 2002, Griffiths 2009 and Kim 2014 (edition 
and translation of kandas II, V, VI— VII and VIII-IX, respectively). 

Next to numerous printed editions and translations of Vedic texts, there are also a 
number of online resources, such as the website of Thesaurus Indogermanischer Text 
und Sprachmaterialien, or TITUS (http://titus.uni-frankfurt.de/indexe.htm7/texte/texte. 
htm); GRETIL - Gottingen Register of Electronic Texts in Indian Languages (see http:// 
gretil.sub.uni-goettingen.de/); a few cumulative websites providing links to existing 
resources, such as INDOLOGY (http://www.indology.info/etexts/); as well as several 
resources for MIA and (early) NIA languages, such as, for instance, the Online Pali 
Tipitaka Website (http://tipitaka.sutta.Org/#/). Note, however, that some of these online 
editions may represent mere mechanical transliteration of certain printed editions, not 
always corresponding to standard scholarly notation, 28 and therefore should be used 
with caution. 

Historical grammars 

Although the main details of the historical grammar of the Indo-Aryan branch are 
discussed at length in numerous studies and reference books on Indo-European com- 
parative linguistics, a comprehensive updated monographic study tracing the origin 
of the entire OIA linguistic system from Proto-Indo-Iranian and, further, Proto-Indo- 
European remains a desideratum. The monograph Burrow 1973 offers an overview 
of the Sanskrit grammar in a historical perspective, but many of Burrow’s analy- 
ses and claims (as, for instance, his variant of the laryngeal theory with one single 
laryngeal phoneme) are questionable and not adopted by most Indo-Europeanists. 
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Comprehensive historical surveys of the Indo-European origins of the Indo-Aryan 
phonological systems can be found in Kobayashi 2004 (which covers only the conso- 
nant system) and a short overview (chapter) of the origins of the whole phonological 
system by the same author Kobayashi (forthcoming/2017). An excellent outline of the 
origin of the OIA morphology from Proto-Indo-Iranian and Proto-Indo-European is 
Got5 2013. 

The further history of the OIA linguistic system throughout the MIA and NIA peri- 
ods is outlined in the classical studies Bloch 1934/1965 and Chatterji 1926/1970, which 
appeared nearly a half century ago and also requires an update. A brief sketch of the lin- 
guistic developments from OIA to NIA is given in Oberlies (forthcoming/2017). 

Synchronic grammatical descriptions and dictionaries of 
OIA and MIA 

The most comprehensive grammatical description of the OIA linguistic system is the 
fundamental compendium, consisting of three volumes, started by Jacob Wackemagel 
at the end of the 19th century with the first volume of his Altindische Grammatik, con- 
tinued after his death by Albert Debrunner (Wackemagel & Debrunner 1896-1957), and 
still remaining unachieved. The three published volumes encompass phonology, nominal 
inflection, pronouns, numerals, nominal compounds and nominal suffixes. The fourth 
volume that should give an exhaustive description of the verbal system is now probably 
the main desideratum and challenge of Old Indian linguistics. Although a number of 
monographic studies have appeared by now, dedicated to nearly all verbal formations 
(that is, individual present and aorist types, the perfect, so-called secondary derivatives 
and a variety of regular deverbative nominal formations), such as the classical mono- 
graphs Narten 1964 (on sigmatic aorists), Goto 1987 (on thematic root presents = Old 
Indian present class I) and Kiimmel 2000 (on perfects), a full systematic overview of the 
entire verbal system remains a task for future researchers. 

There are dozens of more condensed grammars and many reference handbooks on 
Sanskrit. A few of them have become classics, and have remained such over decades, 
even though they were written more than a century ago and require updates. These 
include Whitney 1889 (which encompasses both Vedic and Classical Sanskrit), two Vedic 
grammars by Macdonell (1910 and 1916) and a very condensed (and therefore hardly 
appropriate for beginners), yet very rich Vedic grammar by Renou (1952). An extremely 
well-organized condensed outline of Classical Sanskrit grammar is Zaliznjak 1978 - a 
masterpiece of grammatical description. For late OIA, as attested in the great Old Indian 
epics, see Oberlies 2003. 

For syntax, Delbriick 1888 still remains probably the most comprehensive mono- 
graphic study, alongside with two equally old handbooks by Speijer [Speyer] (1886 and 
1896). An overview of the most important aspects of the OIA (Vedic) syntax in modern 
linguistic perspective is given in Kulikov (forthcoming/2017). 

The most comprehensive complete dictionary of Sanskrit remains the famous Great 
Petersburg Dictionary of Sanskrit (Sanskrit -German) compiled by O. Bohtlingk and R. 
Roth (1855-75) - a true monument of the Sanskrit lexicography. Even in spite of sev- 
eral lacunae due to the fact that several Vedic texts were unavailable (or at least not yet 
critically edited) at that time, it offers the most complete coverage of the lexical mate- 
rial. Further dictionaries are largely dependent on this chef d’oeuvre, including the most 
widely used more compact one-volume Sanskrit-English dictionary, Monier- Williams 
1899. A new lexicographic project that started in India in the 1970s, An Encyclopaedic 
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Dictionaiy of Sanskrit on Historical Principles (Ghatage et al. 1976-), under the gen- 
eral guidance of A. M. Ghatage (from vol. 3 on, the supervisor of the project was S. D. 
Joshi; from vol. 8 on, V. P. Bhatta), aims to update and replace the Petersburg Dictio- 
nary. This fundamental dictionary is far from completion, however, still dwelling on 
the character a. 

There are several concordances of OIA texts. The largest among them, under the gen- 
eral editorship of Vishva Bandhu (1935-1992), covers nearly all Vedic texts, locating 
all Vedic word-forms attested in the entire Vedic corpus (with relatively few exceptions 
and lacunae). The concordances for the RV are Lubotsky 1997 (word-forms without 
translations); Grassmann 1873, out-of-date in many respects but still useful; and Krisch 
et al. 2006- (ongoing project, aiming to replace Grassmann’s dictionary), both with 
translations. 

Finally, one should mention the grammatical dictionary of the Old Indian (non- 
denominative) verbs, Whitney 1885, conveniently cumulating all verbal formations 
derived from each verbal root. This lexicon has several lacunae, and some of Whitney’s 
interpretations are out-of-date, but nevertheless it remains an indispensable tool for any 
student of Sanskrit. A dictionary of Old Indian verbs Werba 1997 (only one volume pub- 
lished so far) is very rich and informative, but uses highly cryptic notation and is more 
difficult for beginners. 

For MIA, the most comprehensive overviews are Pischel 1900 (outdated in several 
parts but still very useful) and Hinuber 2001; see also an important syntactic survey, 
Bubenik 1998. A convenient didactically oriented survey of individual Middle Indie lan- 
guages is Mylius 2013, based on the personal teaching material of the author. Grammati- 
cal descriptions of individual languages, starting with the oldest documented MIA idiom, 
Pali, include important reference grammars, such as Oberlies 2001. 

General and typological overviews of the Indo-Aryan languages 

A comprehensive survey of the Indo-Aryan branch is offered in the classic book Masica 
1991. Masica 1976 is a pioneering work that defined the major features of the South 
Asian linguistic area (which includes virtually all Indo-Aryan languages); a short over- 
view of the languages of this area can also be found in Zograf 1990. Zograf 1976 gives 
an excellent survey of the typologically relevant morphological features of the modern 
Indo-Aryan languages. Southworth 2005 is a useful overview of the issues arising at the 
crossroads of linguistics, history and archaeology, intended, above all, for non-linguists, 
focusing on the aspects of linguistic research that are relevant for those interested in the 
history and, especially, prehistory of South Asia. 

An encyclopedic survey of the major languages of the Indo-Aryan branch is given in 
the volume edited by G. Cardona and D. Jain (2003) within the Routledge language fam- 
ily series. Although this thousand-page compendium is very informative, it is, unfortu- 
nately, quite poorly edited, very inconsistent in terminology and inaccurate in glossing, 29 
which severely hampers the use of the volume as a reference handbook for the Indo- 
Aryan languages; besides, it offers rather limited coverage of the Indo-Aryan languages 
(about 20 languages). Much more comprehensive, methodologically consistent and uni- 
form are two encyclopedic volumes published within the series Jazvki mira (Languages 
of the World), Elizarenkova et al. (2004) (on OIA and MIA) and Oranskaja et al. (2011). 
The latter volume offers an unprecedented coverage of the NIA languages, containing 
short but very informative sketches of nearly 40 languages, prepared in accordance with 
rigorous and methodologically consistent principles. 
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NOTES 

1 This is a somewhat simplified picture, since, in some periods and/or communities, 
certain MIA languages could even overrun Old Indo-Aryan (Sanskrit) in prestige. 

2 Except for a few highly infelicitous attempts to introduce the Indian system of accent 
marking into the Latin transcription, such as the one adopted in Cardona & Jain 2003. 

3 dvi-pat ‘biped’, catus-pat ‘quadruped’. 

4 dirgha-srut ‘heard far’ RV 8.25.17. 

5 Almost exclusively from the late RV on and before vowels: sant-au RV 1.184.1, 
10.1 17.9, yant-auKV 1.139.4. 

6 vi-carant-ah (wrong accent) RV 8.55.4. 

7 Only santi RV 2.28.1, 8.8.23 (Padapatha sant-i). 

8 pasumanti RV 9.92.6, 9.97.1 and ghrtavanti ‘having ghee’ RV 9.96.13 (Pp. °ant-i). 

9 The ratio of the endings -d : -au in the RV is 7:1. 

10 -aih :: -ebhih : RV 1:1 ; AV 5:1 ; Br.+ almost exclusively -aih. 

11 -a:: -ani : RV 2:1 ;AV 1:2. 

12 In the case of thematic and thematicized suffixes such as -ya-, -sa-, -nva- etc., the the- 
matic vowel (a) is traditionally regarded as a part of the suffix, the suffixes “properly 
speaking” being -y-, -s-, -nv-. 

13 Athematic conjugation, after vowels. 

14 Athematic conjugation, mostly after consonants. 

15 Pres, class IX. 

16 With reduplicated presents (class III). 

17 With nearly all reduplicated presents (class III), some root presents (class II) and 
many root aorists. 

18 Mainly late early and middle Vedic. 

19 Mainly late early and middle Vedic. 

20 Only 1 x in the RV. 

21 In the optative. 

22 An example of this type might be the Rgvedic present pjvati ( *r-nv-a-ti ?) ‘moves, 
raises’, which does not occur unambiguously accented, however. 

23 The only formation that might be qualified as the athematic counterpart of this latter 
type is the RVic hapax tar-u-te (Vtf ‘pass, overcome’), attested in RV 10.76.2. 

24 Athematic middle participles with the suffix -ana- exhibit unusual syntactic proper- 
ties in the language of the RV. While the corresponding finite forms are used only 
transitively, the -ana-participles are attested in both transitive and intransitive (pas- 
sive) patterns (see Delbriick 1888: 264). For instance, the participle hinvana- (root 
hi ‘impel’), taken by all grammars as the middle participle of the nasal present with 
the suffix -no-/-nu- (class V in the Indian tradition), occurs 18 times in intransitive 
(passive) patterns and 10 times in transitive patterns in the RV). Similarly, the par- 
ticiple yujana- (root yuj ‘yoke’) occurs 8 times in intransitive (passive) syntactic 
patterns (e.g. ratho yuj-anah ‘a chariot that has been yoked’) and 14 times in transi- 
tive syntactic patterns (as in yuj-ano harita rathe ‘yoking two fallow [horses] to the 
chariot’). As demonstrated in Kulikov 2006a, 2006b, the grammatical characteristics 
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of such passive -ana-participles should be reconsidered. These participles are gram- 
matically ambiguous; that is, they belong to the following two paradigms: (1) to the 
paradigm of the (middle) root aorist and medio-passive aorist and (2) to the paradigm 
of the (middle) present and stative. Thus, the participle hinvana in its transitive use, 
meaning ‘impelling’, belongs to the paradigm of the transitive nasal present ( hinvate 
etc.), but it is a member of the paradigm of the stative (3 sg. hinve, 3 pi. hinvire), i.e. 
a stative participle, when used intransitively (in a passive syntactic pattern), mean- 
ing ‘impelled’. Similarly, vujana- is a member of the paradigm of the (transitive) 
root aorist ( ayukta etc.) when used transitively (‘yoking’), but it is a member of the 
paradigm of the passive aorist (3 sg. ayoji, 3 pi. ayujran), that is, a passive aorist 
participle, when used in passive syntactic patterns (‘yoked’). Likewise, 3 sg. and 3 
pi. middle perfect forms (with the endings -e and -re, respectively) attested in passive 
constructions should be taken as statives built on perfect stems, rather than as middle 
perfects proper. For instance, the form dadhe (root dha ‘put’) should be taken to be 
a 3 sg. form of the middle perfect when it means ‘has put’, and a 3 sg. form of the 
stative with the passive interpretation ‘is put/has been put’. 

25 As opposed to the -dva-presents with the short grade of the root, which are mostly 
intransitive. 

26 For a general survey and discussion of the evolution of the Indo-Aryan case sys- 
tem, see, in particular, Bloch 1934; Zograf 1976; Bubenik 1998: 99-101; Bubenik& 
Hewson2006: 102ff. ; Masica 1991: 230ff. 

27 An annotated translation of these hymns is now being prepared by the author of 
this chapter (Kulikov, in preparation) and will be published as a separate volume, 
together with a translation of the independent (i.e. not borrowed from the RV) Kunta- 
pa-hymns of book XX. 

28 This is, for instance, the case with the TITUS online edition of the Satapatha- 
Brahmana, in particular, as far as the word division and accent notation is concerned. 

29 For instance, readers interested in Indo-Aryan ergativity will find a few remarks on 
ergative markers and constructions in Hindi and Urdu, but no mention of ergativity 
in the chapter on Assamese. In fact, however, the authors of this chapter posit two 
endings of the nominative case, -e and -0 (p. 419), used in transitive and intransitive 
constructions, respectively, and thus representing two different cases, ergative and 
absolutive/nominative. Likewise, readers will not find any mention of converbs (one 
of the remarkable typological features of Indo-Aryan languages); instead, individual 
chapters use a few less standard terms such as “conjunctive participle” or “absolu- 
tive”. Furthermore, the chapter on the earliest attested Indo-Aryan language, Sanskrit, 
uses heavily Panini-dependent notation and is barely accessible for non-Sanskritists. 
These are obvious editorial slips and inconsistencies; see detailed criticism in the 
reviews Kulikov 2004 and Peterson 2006. 
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CHAPTER 3 


IRANIAN 


Nicholas Sims-Williams 


INTRODUCTION 

At present, Indo-European languages of the Iranian group are spoken over a wide area 
including virtually the whole of Iran, Afghanistan and Tajikistan, together with the neigh- 
bouring parts of Turkey, Syria, Iraq, Pakistan and Uzbekistan, as well as larger or smaller 
enclaves in Oman, Annenia, Georgia, Azerbaijan, Turkmenistan and western China (see the 
map at the end of Schmitt 1989). In mediaeval times Iranian languages such as Sogdian and 
Khotanese were well established even further east, in the area which later became Chinese 
Turkestan (Xinjiang); at a still earlier period, the original homeland of the Iranian-speaking 
peoples seems to have lain to the northeast of the present state of Iran. 

The present chapter will concentrate on the earliest attested Iranian languages, Avestan 
and Old Persian, which are naturally the most important for Indo-European studies. These 
two Old Iranian languages will be described to a large extent in terms of their similarities 
to and differences from the closely related Old Indian, a procedure justified in the first 
place by pragmatic considerations. Old Indian (Vedic and Sanskrit) is attested by a huge 
and varied corpus of literature, written in a clear, almost phonemic script which allows 
the phonological and morphological structure of the language to be clearly perceived. 
On the other hand, each of the attested Old Iranian languages is known from a limited 
corpus - in the case of Old Persian, a tiny corpus - of rather repetitive and monotonous 
texts, one written in an ambiguous cuneiform writing system, the other by means of an 
over-elaborate, almost phonetic alphabet, whose intricacies obscure rather than illumi- 
nate its grammatical structure. Although some of these deficiencies are made good by 
the more abundant Middle and Modern Iranian material, it cannot be denied that Iranian 
evidence is usually more difficult than Indian for a student of Indo-European to evaluate. 

From a theoretical point of view, too, it is proper to treat the Iranian languages in 
constant comparison with Indian, since the two groups are not merely closely related 
but jointly constitute a single Indo-Iranian branch of the Indo-European family, as is 
indicated by the innumerable phonological, morphological and lexical isoglosses which 
they share to the exclusion of all other branches of Indo-European. One such isogloss is 
the use of OInd. arya-, Av. a‘riia-, OPers. ariya- (from which the name of the country 
“Iran” derives) as a self-designation for the speakers of Indo-Iranian, whence the alterna- 
tive term “Aryan”. The closeness of the relationship between Indian and Iranian is most 
clearly demonstrated by the fact that it is possible to find not just words but whole sen- 
tences in Vedic or Avestan which may be transposed from the one language into the other 
merely by observing the appropriate phonological rules; e.g. Av. tam amauuantam yaza- 
tam suram damohu sauuistam midram yazai ‘this powerful, strong (being) worthy of wor- 
ship, Mithra, the strongest amongst creatures, I shall worship’ (Yasht 10.6) = Ved. *tam 
amavantam yajatam suran dhamasu savistham mitram yajai (Jackson 1892: xxxi-xxxii). 

Despite the overwhelming similarity of Indian and Iranian, each is distinguished from 
the other by a number of characteristic innovations. Phonological innovations on the Indian 
side include the loss of the Indo-Iranian diphthongs *ai, *au (> e, o ) and voiced sibilants (*z, 
*z, etc.) and the development of a series of retroflex consonants ( t , n, s, etc.), whilst Iranian 
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languages typically show the loss of the voiced aspirates *b h , *d h , *g h , etc. (> b, d, g), the 
development of the voiceless fricatives ^ 6 , x (from Ilr. *p, *t, *k before consonants, and 
from Ilr. *p h , *t h , *k h ), the depalatalization of Hr. *c, *j( h ) (> *ts, *dz, OPers. 6 , d, Av. 5, z) 
and the change of *s (in most positions) to h. Some apparent exceptions to these isoglosses 
may be due to the reversal of a sound-change: for instance, Av. pt (as in hapta ‘seven’) may 
derive from the expected *ft (as attested, directly or indirectly, in all other Iranian languages, 
e.g. Pers. haft) rather than preserving Ilr. *pt (cf. OInd. sapta). In other cases, however, it 
is clear that a development characteristic of Iranian cannot in fact have been fully carried 
through at the Common Iranian stage: cf., for instance, p. 271 below on evidence for the 
survival of the palatal *c in certain clusters. Similarly, the development of *s to h, though 
common to Avestan, Old Persian and all later Iranian languages, is demonstrably later than 
the earliest attestations of Iranian in Ancient Near Eastern sources (cf. the next paragraph on 
the divine name Assam mazas). Thus, at least in phonology, the innovations attributable to 
Common Iranian are comparatively few in number (though significant in kind). 

The original homeland of the Aryans, the speakers of Common Indo-Iranian, cannot be 
precisely identified, but is thought to have been in western Central Asia, to the east and 
northeast of the Caspian Sea. At a time when “Proto-Indian” and “Proto-Iranian” (i.e. the 
ancestral dialects from which the Indian and Iranian languages respectively derive) had 
already become differentiated to some extent, perhaps about the beginning of the 2nd mil- 
lennium B.C.E., two groups of “Proto-Indian” or “Indo-Aryan” speakers began to migrate 
from this homeland, one towards the west (cf. above, p. 205, on traces of the Indo-Aryans 
in the Hurrian empire of Mitanni in northern Mesopotamia) and the other towards India. 
At a later date, Iranian tribes too began to migrate westwards, reaching central and western 
Iran by the middle of the ninth century B.C.E., at which period they are referred to for the 
first time in Assyrian sources; whether they had come from the northeast by the most direct 
route, to the south of the Caspian, or more circuitously through the Caucasus is still uncer- 
tain. (For a more detailed summary and references to the literature on the prehistory of the 
Aryans see Schmitt 1987.) From the ninth century B.C.E. onwards, a scattering of Iranian 
linguistic material is to be found in Mesopotamian sources, beginning with the names of the 
Medes ( Matai ) and Persians ( Parsuas ) and most notably including the name of the principal 
deity of the Iranians in the form Assam mazas (= Common Iranian *Asura-mazdas, later 
*Ahura-mazdah, cf. OPers. Aummazda, Av. Ahum Mazda). 

It is in part as a result of the fact that Common Iranian cannot have differed greatly 
from Common Indo-Iranian (or “Aryan”) that it is difficult to determine the precise sta- 
tus of the so-called Nuristani languages (formerly known as “Kafiri”). This group of 
languages, recorded in modern times in the northeast of Afghanistan and neighbouring 
parts of Pakistan, undoubtedly belongs to the Indo-Iranian family, but the nature of the 
relationship is not clear. All three theoretically possible solutions have been advocated: a 
third, independent sub-group beside Indian and Iranian (Morgenstieme 1973: 327-343); 
an archaic form of Iranian, much influenced by several millennia of proximity to lan- 
guages of the Indian group (Mayrhofer 1983); or a branch of “Proto-Indian” which split 
off before the arrival of the Indo-Aryans in India and subsequently developed in intensive 
contact with Iranian languages (Degener 2002). 

Only two Old Iranian languages are attested by texts, namely Avestan and Old Persian. 
Others, such as Median and Scythian, are known to us only through occasional words and 
names transmitted in texts in other languages. 

Avestan is the language of the Zoroastrian scriptures, the Avesta, the earliest parts of which 
are the Gathas (“Songs”) of Zoroaster or Zarathushtra - whom tradition places in the sixth cen- 
tury B.C.E., though many scholars argue, partly on linguistic grounds, for a date five centuries 
or more earlier- and the Yasna Haptarjhffti “Service consisting of seven chapters”. These texts, 
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together with a few short prayers, are preserved in Old or “Gathic” Avestan, a highly archaic 
dialect comparable to Vedic in its stage of development. Later Avestan, also known as Younger 
Avestan, is attested by a much larger corpus of texts, including the Yashts (hymns in honour of 
individual divinities) and the Videvdad (“Law against the demons”). The manuscript tradition 
of the Avesta goes back to the Sasanian period (224—651 C.E.), when these orally transmitted 
texts were written down, probably for the first time, in a specially invented and extremely elab- 
orate alphabetic script (see Hoffmann & Narten 1989 and the script table below). 
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The Avestan orthography was designed to preserve the traditional pronunciation with 
great accuracy and contains much phonetic detail which is irrelevant for the compara- 
tivist. For example, the word for ‘land’ appears in such different forms as da‘tjhu- and 
daxiiu-, both representing what is etymologically and probably phonemically *dahyu-. 
A particularly confusing feature of the Avestan writing system is the frequent notation 
of anaptyctic and epenthetic vowels. In the present chapter such unetymological vowels 
will be written in superscript, as in dadSti ‘he gives’ (= OInd. dadati) - contrast the diph- 
thong ai in aidi ‘come! ’ - or GAv. d ai bitiia- ‘second’ (= OInd. dvitTya-). Note too that the 
semi-vowels y and w are regularly represented by ii and uu (which can equally represent 
the sequences iy and uw ) and that T, it are not consistently distinguished from /, u. 

Old Persian, which is known from inscriptions of the Achaemenian period (sixth to 
fourth centuries B.C.E.), represents a later stage of linguistic development as well as 
a different dialect from the language of the Avesta. Like Avestan, it is written in a spe- 
cially invented script, in this case a form of the cuneiform writing commonly used in the 
Ancient Near East. The Old Persian script combines syllabic and alphabetic principles. 
For example, there are two F-signs, of which t“ is syllabic (representing [tu(:)], since F and 
it are not distinguished from i and it) whilst t can represent either a syllable [ta, to] or the 
simple consonant [t]. Since there is no sign for *t‘ (though comparable signs such as d‘ do 
exist), [ti] or [ti:] has to be written by means of two signs (t-i), a combination which can 
also denote [tai] . The fact that a sign such as t has both syllabic and consonantal values 
is the source of much ambiguity, as is the fact that in most cases a nasal is not written 
before another consonant. As a result of these two deficiencies, for instance, the 3 sg. 
pres. ind. act. and mid. (-ti and -tai) and the equivalent pi. endings (-nti and *-ntai) are all 
indistinguishable in writing. In this chapter, for the sake of clarity, Old Persian forms will 
generally be cited in phonemic transcription rather than in transliteration. (On the Old 
Persian writing system see further Hoffmann 1976: 620-645.) 

Only a brief survey can be given here of the great variety of languages attested at the 
Middle Iranian stage. Western Middle Iranian is represented by Middle Persian, which 
is essentially, though not in every detail, a later form of the same dialect as Old Persian, 
and by Parthian. The Eastern Middle Iranian languages include Khotanese and the closely 
related Tumshuqese, which are the most conservative of the Middle Iranian languages in 
their morphology, Sogdian, Bactrian and Choresmian. Amongst the even more numerous 
Modern Iranian languages we shall occasionally have reason to refer to Persian (or New 
Persian), Pashto, Ossetic and the Shughni group. Further information on these and other 
Iranian languages may be found in the relevant chapters of the Compendium Linguarum 
Iranicarum (Schmitt 1989); on the Middle Iranian languages see also Henning 1958. 

PHONOLOGY 

The vocalic system of Common Iranian is almost identical to that of Old Indian. The main 
difference is the lack of e and 6, the diphthongs from which OInd. e and o derive being 
preserved in Old Iranian as ai (Av. ae or oi) and au (Av. ao or §u). Cf. OPers. daiva-, 
Av. daeuua- ‘(evil) god, devil’ = OInd. deva- ‘god’; OPers. rautah- ‘river’ = OInd. sro- 
tas- ‘stream’. The comparatively rare long diphthongs ai and au, which are shortened 
in Indian, also survive in Old Iranian, cf. Av. instr. pi. vasnais ‘by sacrifices’ = OInd. 
yajnais; Av. nom. sg. gaus ‘ox, cow’ = OInd. gaus. 

The etymological origins of the Iranian vowels a, i, u, r (= [a r ]), a, T, ft are in general 
the same as those of the equivalent Old Indian vowels. In particular, as in Indian, PIE 
*a, *e, *o, *n, *m fall together as a, and the corresponding long vowels (including those 
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which ultimately derive from short vowel + laryngeal) as a. Brugmann’s Law, according 
to which PIE *o gives a in open syllables, seems to apply in the same circumstances 
in both branches of Indo-Iranian, e.g. Av. nom./acc. sg. dd"ru, OInd. dam ‘wood’ = Gr. 
56pu. However, the contexts in which the PIE laryngeals are vocalized (to Iran, i, OInd. 
X, e.g. OPers./Av. pitar-, OInd. pitar- ‘father’ < PIE *ph 2 ter-) are more restricted in Ira- 
nian, resulting in many cases of the correspondence Iran. 0 : OInd. t, e.g. GAv. dug’dar-, 
LAv. duydar- = OInd. duhitar- ‘daughter’ < PIE *d h ugh 2 ter-; GAv. var’rjte = OInd. vrrfite 
‘chooses’ < *wl-nH-toy. Where Old Indian has ir, ur (before vowels) or Tr, ur (before 
consonants) from PIE *rH or *1H, Iranian has uniformly ar: 

Av. sarah- = OInd. siras- ‘head’ < *krh 2 os- 
OPers. paru- = OInd. puru- ‘much’ < *plh,u- 
OPers. darga-, Av. dar’ga- = OInd. dTrgha- ‘long’ < *dlh,g h o- 
Av. vai-’nd- = OInd. lima- ‘wool’ < *h,wlh[neh 2 - 

Since short a and long a probably differed markedly in quality as well as in quantity (as 
they do both in Sanskrit and in many Modem Iranian languages), the system of simple 
vowels in Common Iranian may be represented diagrammatically as in Figure 4.3. 



FIGURE 4.3 SIMPLE VOWELS 

This simple system seems to survive almost unchanged in Old Persian (in so far as the 
inadequate cuneiform orthography allows one to tell), though the unitary sound r had 
probably developed into a sequence of vowel + consonant, most likely [or] (written a-r- in 
initial position, but distinct from the sequence [ar], as is proved by its different fate in Mid- 
dle and New Persian). A similar development is found in Avestan, where r usually gives 
ara (i.e. ar 3 ) as in kar’nao'ti ‘makes’ = OInd. krnoti. But many other outcomes of r are 
found in Avestan, e.g. ar before s ( arsti - ‘spear’ = OPers. a-r-s-t-i- [srfti-], OInd. rsti-), ahr 
before k, p ( vahrka - ‘wolf’ = Pers. gurg < OPers. *v3rka-, OInd. vrka-, cf. below, p. 272— 
273), ir before y (pres, stem kiriia- ‘to be done’ = OPers. kariva- < *krya-, OInd. kriya-), 
ra after t (pres, stem trafiia- ‘to steal’ = Sogd. caf- < *trfya-, cf. OInd. °trp- ‘stealing’). 

The example of r may serve to illustrate the complexity of Avestan vocalism. Consid- 
erations of space make it impossible to follow in equal detail the development of all the 
vowels (for which see de Vaan 2003), but the most important contextual changes must 
be noted. These include the (re)appearance of mid-high vowels e and d. In final posi- 
tion Hr. *-ai and *-au give -e and -o (or -uud) respectively (e.g. 3 sg. pres. mid. ending 
-te = OPers. -tai, OInd. -te < *-tai, PIE *-toy), while *-ya gives LAv. -e (e.g. a-stem gen. 
sg. ending LAv. -ahe = GAv. -ahiia, OPers. -ahaya, OInd. -asva, PIE *-osyo). Internally, 
a often becomes e between two palatal sounds, as in GAv. yehiia, LAv. verjhe ‘of whom’ 
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(= OInd. yasva), and o between p/g/m/v and u, as in po“ru- ‘much’ (= OPers. paru-, cf. 
above on this page). Final -5 (GAv. -5 or -§) and -a most often derive, via *-ah and *-ah, 
from *-as (PIE *-os, *-es) and *-as respectively, e.g .yd (GAv. ya) ‘who’ (nom. sg. m.),ya 
‘who’ (nom. pi. f.). Before nasals, especially in final syllables, a and a normally develop 
into a and q (= [aa]) respectively, so that -am and -qm are the regular ace. sg. endings of 
a-stems and a-stems. The a which arises thus is subject to further changes, for instance 
to i after a palatal, as in °cinah- ‘desire’ < *canah- (beside OPers./Av. °canah-, OInd. 
canas-). The sequences *(i)ya, *(u)w 9 frequently contract to F, u (or i, u, since the length 
of these vowels is not consistently noted in Avestan), *aya, *awa to ae, ao. Cf. Tm ‘this’ 
(= OPers. iyam, OInd. iyam, nom. sg. f.); LAv. turn ‘thou’ (= GAv. tuuSm, OPers. tuvam, 
OInd. tuvam, nom.); aem ‘this’ (beside GAv. aiiam, OInd. aydm, nom. sg. m.); baon ‘they 
became’ (= OInd. (a)bhavan). 

The consonantism of the Iranian languages diverges much more fundamentally from 
that of Old Indian. Two major innovations in Iranian are the loss of all aspirates and the 
appearance of a series of fricatives ( f 6, x ) unknown to Old Indian. In most cases these 
fricatives derive from p, t, k in preconsonantal position, but they also correspond to the 
Old Indian voiceless aspirates ph, th, kh (in all positions). The voiced aspirates (PIE *b h , 
*d h , *g h /g wh , OInd. bh, dh, gh ) merely lose their aspiration, thus merging with the original 
non-aspirate series: 

Iran, p = OInd. p: OPers./Av. pitar-, OInd. pitar- ‘father’ 

Iran./ = OInd. p: A v.friia-, OInd. priva- ‘dear’ 

= OInd. ph: Av. kafa- ‘foam’, OInd. kapha- ‘slime’ 

Iran, t = OInd. t: OPers. tuvam, GAv. tuuSm, LAv. turn, OInd. tuvam (nom.) ‘thou’ 
Iran. 0 = OInd. t: OPers. Quvam, Av. Ofiqm, OInd. tvam (ace.) ‘thee’ 

= OInd. th: Av. paOd, OInd. pathas (gen. sg.) ‘way’ 

Iran, k = OInd. k: Av. kuOra, OInd. kutra ‘whither?’ 

Iran, x = OInd. k: OPers./Av. xsap-, OInd. ksap- ‘night’ 

= OInd. kh: Av. xa-, OInd. kha- ‘spring, well’ 

Iran, b = OInd. b: Ossetic bal ‘group’, OInd. bala- ‘power’ (?) 

= OInd. bh: OPers./Av. bratar-, OInd. bhratar- ‘brother’ 

Iran, d = OInd. d: Av. dantan-, OInd. dant- ‘tooth’ 

= OInd. dh: Av. daenu- ‘female’, OInd. dhenu- ‘cow’ 

Iran, g = OInd. g: Av. ga‘ri-, OInd. giri- ‘mountain’ 

= OInd. gh: Av. gar’ma-, OInd. gharma- ‘heat’ 

Despite their loss of aspiration in Iranian, the PIE voiced aspirates are still occasionally 
distinguishable from the equivalent non-aspirates by the effects of Bartholomae’s Law 
(cf. above, p. 229), according to which a combination such as *g h +t was assimilated to 
*gd h in Indo-Iranian (and perhaps already in Indo-European), whereas *g+t gave *kt. By 
this rule, which applied to all combinations of voiced aspirate + voiceless stop or sibilant, 
one may deduce from a form such as GAv. aog’da ‘he said’ (= aog + morpheme -ta) that 
the root aog originally ended in an aspirate *g h or *g wh ; in this case *g h is confirmed by 
Gr. euxeoBai, etc. Unfortunately, the contrary deduction cannot usually be made from 
the presence of a voiceless cluster, since the effects of Bartholomae’s Law tended to be 
cancelled out by the restoration of the normal form of the morpheme, as in LAv. aoxta 
for GAv. aog’da or OPers./LAv. basta- ‘bound’ for expected *bazda- (= OInd. baddha-). 

Common Iranian was rich in both sibilants ( s , z, s, z) and affricates (c,J, i.e. [tf, <± 3 ] - 
differing from the OInd. c,j, which, at least in the earliest period, were palatal stops - and 
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possibly *c, *j, i.e. [ts, dz]). These stem in part from PIE *s and in part from the “two 
series of palatals”, i.e. (i) PIE *k, *g, *g h (> OInd. s,j, /?) and (ii) PIE *k/k w , *g/g w , *g h / 
g wh when secondarily palatalized before PIE *e, *i, *y, etc. (> OInd. c,j, h). The history 
of these sounds is rather complicated, but is worth examining in some detail in view of 
the fact that Iranian here retains evidence of distinctions which are lost in Old Indian. 

For Indo-European only one sibilant is to be assumed, namely *s (with the allophone 
*z in clusters such as *zd). In addition to its role as an independent phoneme, PIE *s 
has a secondary origin as an automatic feature of the juncture of two dental stops (cf. 
above, p. 54): *t+t/*d+t = *tst, *d h +t = *dzd h , e.g. PIE *sed+to- = *setsto- ‘seated’ > 
OInd. satta -, MPers. [nijsast (< OPers. *[ni]sasta-), Lat. sessus, cf. also OIr. sess ‘seat’, 
etc.; *wrd h +to- = *wrdzd h o- ‘increased’ > OInd. vrddha-, Av. var’zda- (the development 
of a voiced group in the latter case being a further instance of Bartholomae’s Law). 
As these examples indicate, the resulting clusters were simplified in different ways in 
Indian, where the sibilant disappeared, and in Iranian, where the first of the two dental 
stops was lost, giving the regular correspondences OInd. tt : Iran, st and OInd. ddh : 
Iran. zd. 

In Indo-Iranian, much as in Slavic (cf. below, p. 526-527), PIE *s and *z underwent a 
split, becoming Indian s, *z, Iran, s, z after the sounds collectively known as “RUKI” (i.e. 
r, r; it, au\ k and other velars and palatals; i, ai ) but remaining, at least in the first instance, 
unchanged in other contexts. Examples: the loc. pi. ending Av. -su (OInd. -su) after stems 
in u (etc.) but -su after stems in ant; Av. mizda- ‘reward’, OInd. rmdha- < *mizdha-, PIE 
*miz-d h (h,)o- (Gr. piaPoq) but OPers./Av. Mazda- (divine name) < PIE *mnz-d h eh 1 -, cf. 
OInd. medha- ‘wisdom’ < PIE *mnz-d h h,-eh 1 -. (Note that the Iranian forms with z and z 
here clarify their Indian counterparts, which have become opaque as a result of the loss 
of voiced sibilants in Old Indian.) This change does not affect Iran, st, zd < *tst, *dzd h , 
showing that the sibilant was still protected by the preceding stop when the RUKI rule 
operated: Av. vista- ‘known’ < *witsto- = *wid+to- (Gr. piaBoq, Olr.fess). In Iranian (but 
not Indian) the change to s, z also takes place after a labial: Av. difiza-, OInd. dipsa- < 
*di(d)bz h a-, desiderative of Av. dab, OInd. dabh ‘to injure, deceive’. 

Finally, those instances of PIE *s which had so far survived unchanged in Iranian 
underwent a further split, s remaining in groups such as sn, sp, st, *ts (> s) but becoming 
h in all other contexts, e.g. OPers. a(h)mi, Av. ahmi, Khot. Tma ‘I am’, OPers. hand, Av. 
hand, Khot. indd ‘they are’ (= OInd. dsmi, sand < PIE *h,esmi, *h[Senti), but OPers. asti, 
Av. asti, Khot. asta ‘he is’ (= OInd. asd < PIE *h,esti); a-stem loc. pi. Av. -a-hu (= OInd. 
-a-su). Although this development is found in all Iranian languages it must be compar- 
atively late, since Proto-Iranian forms with s (for later /?) are preserved in ancient Near 
Eastern sources (cf. above, p. 264). 

An important implication of this fact is that the development of the PIE “first palatal 
series” (*k, *g, *g h ) to sibilants (s, z), which occurs in Avestan and all branches of Ira- 
nian other than Old Persian (and later dialects of southwestern Iran), must also be later 
than the Common Iranian period, since the 5 arising from PIE *k does not participate 
in the change of PIE *s to /?. As a plausible intermediate stage between the attested Ira- 
nian series (Av. s, z, z, OPers. 0, d, d) and the presumed Indo-Iranian palatal affricates 
*c, *j, *j h (< PIE *k, *g, *g h ) the dental affricates *ts and *dz may be reconstructed for 
Common Iranian. In Figure 4.4 the postulated development of this “first series” of pal- 
atals is set beside that of the “second series” (i.e. the Indo-Iranian palatal stops arising 
from the secondary palatalization of PIE velars or labio-velars, which eventually gave 
palatal affricates in Iranian) in order to show how the resulting Iranian and Indian forms 
disambiguate one another. 
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FIGURE 4.4 THE FIRST AND SECOND SERIES OF PALATALS IN IRANIAN 

As Figure 4.4 shows, only the voiceless sounds (OInd. s, Iran. *ts < PIE *k; OInd. c, 
Iran, c < PIE *k/k' v before a palatal) are etymologically unambiguous. Each of the voiced 
sounds, OInd. j and h, Iran. *dz and f, has a double origin, since Old Indian confuses 
the two palatal series while Iranian (as always) confuses aspirates and non-aspirates. 
However, the ambiguity is resolved in the case of words preserved in both branches of 
Indo-Iranian, each of which preserves the distinction lost in the other: 

Iran. *ts = OInd. s: OPers. 0ard-,Asr. sai-’d-, cf. OInd. sarad- ‘year’ (PIE *k) 

Iran. *dz = OInd./: OPers. yad, Av. yaz, OInd. yaj ‘to worship’ (PIE *g) 

= OInd. h : OPers. daraniya-, Av. zaraniia-, OInd. hiranya- ‘gold’ (PIE *g h ) 

Iran, c = OInd. c: OPers. ci, Av. c/7, OInd. cit (enclitic) ‘also, even’ (PIE *k w ) 

Iran./ = OInd./: Av.ja'ni-, OInd. yaw- ‘woman’ (PIE *g w ) 

= OInd. h : OPers./Av./an, OInd. han ‘to strike, kill’ (PIE *g wh ) 

The depalatalization seen in the unconditioned reflexes of the PIE palatals (Av. s, z, 
OPers. 6, d) failed to take place in most combinations with consonants, where the usual 
outcome in all Iranian languages is palatal s, z, as in A v.fsu- beside pasu- ‘sheep’ (OInd. 
pasu-, PIE *p(e)ku-). In most cases the Old Indian equivalent is retroflex s, *z, cf. Av. 
asta ‘eight’ = OInd. asta (PIE *Hoktoh,); Av. msr'zdika- ‘mercy’ = OInd. mrdika- (< 
*mrzdlka-, PIE *-gd-). An important special case is that of PIE *sk, which gives OInd. 
(c)ch, Iran, s, as in the inchoative present stem OInd. gaccha- ‘to come’, Choresmian 
[n]ys- < *[ni]gasa- ‘to arrive’ (a more archaic fonn than A v.Jasa-), all < PIE *g w m-sko- 
(Gr. pdoKs). Regarding PIE *kw (> OPers. s, Khot. ss [f], elsewhere sp), *g( h )w (> OPers. 
z, Khot. s [3], elsewhere zb) and *kr see below, p. 27 1 . 

Finally, we should note the outcome of PIE clusters of velar, labio-velar or palatal + 
*s. All such groups give OInd. ks, whilst Iranian distinguishes four possibilities: (i) xs 
< *k( w )s, e.g. Av. vaxsiia-, OInd. vaksva-, future of vak ‘to speak’; (ii) gz < *g( w )z h (for 
*g( w ) h +s by Bartholomae’s Law), e.g. GAv. aog’za ‘saidst’; (iii) s < *ks, e.g. Av. mosu, 
OInd. maksu ‘quickly’; and (iv) z < *gz h (for *g h +s), e.g. GAv. dTdsr’za-, desiderative of 
dar’z ‘to make firm’ (OInd. dnnhati). 

The following schema shows the minimum complement of consonantal phonemes to 
be assumed for Common Iranian. An asterisk (*) indicates those which do not survive as 
such in any attested language. 
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Regarding the reappearance of PIE *1 (> OPers./Av. r) as / in later Iranian see the next 
paragraph. On the reconstructions *ts and *dz see above, p. 269-270. The symbol *H 
represents a consonant deriving from the PIE laryngeals, whose survival, at least in 
certain positions, is indicated by metrical and other considerations; e.g. GAv. mazda, a 
form which is disyllabic as the nom. sg. but trisyllabic as the gen. sg., indicating nom. 
*mazdaH-s (> *mazdas) ~ gen. *mazdaH-as. 

Not all of the phonological developments shared by Avestan and Old Persian can be 
ascribed to Common Iranian. The change of *s to h (except in certain groups), which 
occurs in all attested Iranian languages, cannot have been completed until after the arrival 
of Iranian speakers in western Iran, as has already been pointed out. The replacement of 
*1 (and *1) by r (and r), which Avestan and Old Persian have in common with Vedic, was 
nevertheless not universal in Iranian, as is shown by the later reappearance of dialectal 
forms with l < PIE *1, e.g. Pers. list an ‘to lick’ beside Av. raez (PIE *leyg h -, Gr. T-ei/co). 
Similarly, the two Old Iranian languages share a development of PIE *k( w )y to sy, as in 
OPers. siyav, Av. s(ii)auu ‘to go’ (PIE *kyew-, OInd. cvav-, Gr. aebopat and Kiveco), but 
the preservation of an affricate in Khot. tsii- [ts h u:-], Tumshuqese cch- ‘id.’ indicates that 
only the intermediate stage *cy is to be attributed to Common Iranian. 

The most important isogloss separating Old Persian from Avestan is to be seen in the 
treatment of the “first palatal series”, PIE *k, *g, *g h , which are thought to have devel- 
oped via palatal affricates (Hr. *c, *j, *j h ) and dental affricates (Common Iranian *ts, 
*dz) and which give 0 and d in Old Persian (and later dialects of southwestern Iran) but 
5 and z in Avestan and all other Iranian languages (cf. above, p. 269 ff). The treatment 
of the PIE combinations *kw and *g( h )w provides a three-way isogloss, giving sp, zb in 
most Iranian languages (including Avestan), s, z in Old Persian, and s, z in the group of 
northeastern Iranian Saka (Scythian) languages represented by Khotanese. Examples: 
Av. aspa-, OPers. asa-, Khot. assa- [aja-] ‘horse’ (= OInd. asva-, PIE *-kw-); Parthian 
dzban, OPers. hdzan-, Khot. bisaa- [|3i3aa-] ‘tongue’ (cf. OInd.y7/?va-, PIE *-g h w-). Since 
the palatals s, z can hardly be derived from *tsw and *dzw, it is simplest to assume Com- 
mon Iranian *cw and *jw. The palatal nature of Ilr. *c < PIE *k seems also to have 
been preserved up to the Common Iranian stage in the case of the cluster *cr, cf. Khot. 
ssara- [[era-] ‘good’ (= Av. srlra-, OInd. srila- ‘beautiful’, cf. Gr. Kpeicov). In Old Persian 
*cr gives g (a sibilant of unclear phonetic character), a development which may have 
proceeded via *tsr and *0r, since g is also the outcome of Iran. *0r < PIE *tr or *tl, as in 
puga- ‘son’ (= A v.puOra-, OInd . putra-). 

It is not surprising to find that it is the languages spoken at the fringes of the Ira- 
nian world - Old Persian in the extreme southwest and the languages of the nomadic 
Saka peoples of the Eurasian steppes - which stand out as aberrant in respect of the old 
isoglosses mentioned above. In each case, Avestan represents the Iranian mainstream. 
Avestan is often regarded as an Eastern Iranian language, which is no doubt correct from 
a purely geographical point of view, but it shows none of the phonological developments 
which are characteristic of Eastern Iranian in later periods, such as the voicing of the fric- 
ative in the groups *xt and *ft or the depalatalization of *c. Avestan does indeed have its 
peculiarities, such as the reversion of *ft to pt (cf. above, p. 264), the development of *rt 
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to s (cf. below, p. 272-273) or the frequent insertion of a nasal ij before h (e.g. aqhat, 3 
sg. subj. of ah- ‘to be’, OInd. asat), but they do not seem likely to be very ancient, nor do 
they provide evidence of a particularly close relationship with any other Iranian language. 

MORPHOPHONOLOGY 

At the end of the word certain special phonological changes take place. In the attested 
Old Iranian languages the original distinctions between long and short final vowels are 
lost. In general, Old Persian and Old Avestan tend to lengthen short final vowels, while 
Later Avestan shortens many that were originally long. In the a-declension, for instance, 
both the voc. sg. (originally *-a) and the instr. sg. (originally *-a) appear as OPers./GAv. 
-a, LAv. -a, so that the length of the final vowel no longer has any phonemic (or etymo- 
logical) significance. The merging of long and short final vowels was not universal, how- 
ever; cf. Morgenstieme 1973: 108-9 on remnants of a distinction between *-a and *-a in 
Shughni and other Modem Iranian languages of the Pamir mountains. 

A feature common to all the Iranian languages is the loss of final *-h (< PIE *-s). In 
some languages the loss of *-h is accompanied by a change in the quality of the preceding 
vowel, whereby *-ah > Av. -o (GAv. also -a), Khot. -a [-e], Sogd. -i, and *-ah > Av. -a, 
Khot. -e [-£:] (but Sogd. -a; cf. the similar changes accompanying the loss of final *-m 
in Middle Iranian: *-am > Khot./Sogd. -u; *-am > Khot. -o but Sogd. -a). In Old Persian, 
on the other hand, *-h is lost without a trace, as are *-d/-t and perhaps some other final 
consonants, so that *-ah/-ad and *-ah/-ad give -a and -a respectively (thus re-establishing 
the recently lost phonemic distinction between long and short final vowels). Such devel- 
opments had a significant impact on the morphology of the Iranian languages, as may be 
seen from the paradigm of the a-stems in Table 4.21. 

The changes typical of absolute word-final position are sometimes found also inter- 
nally, in compounds and before particular morphemes: cf. Av. v&.co.mar’ta- ‘recited 
aloud’ and instr. pi. vacsbis, both from vacah- ‘speech, word’ with the same treatment of 
*-ah < *-as as occurs in final position in the nom./acc. sg. vacd/vacS (= OInd. vacas, Gr. 
(F)oto<;). In other cases, however, compound-juncture is treated as internal position, as in 
Av. vacas. tasti- ‘strophe’, where the original *s “reappears” in accordance with the reg- 
ular treatment of the PIE cluster *st. Such combinatory variants as vacas° are referred to 
as sandhi-forms, “sandhi” being the Sanskrit term for the “combination” both of elements 
within a word and of words within a sentence (cf. above, p. 229-230); in Old Iranian, 
however, the occurrence of sandhi is almost entirely restricted to the juncture of elements 
within a single accentual unit, i.e. of morphemes in a word, of words in a compound or 
of a clitic with its host as in A v.fradatae-ca ‘and (it) will prosper’ (= *fra5a‘te, 3 sg. subj. 
mid. of firad + encl. -ca ‘and’, cf. Hoffmann 1975: 262ffi), kas-cit ‘someone’ (= nom. 
sg. m. ko ‘who?’ + encl. indefinite ptcl. - cit ), OPers. kas-ci. As these examples show, the 
forms occurring in sandhi before enclitics often preserve older phonological forms of the 
inflections: -tae° < PIE *-toy (cf. above, p. 267), kas° < PIE *k w os. The shortening of 
the vowel in the first syllable of fradatae-ca is probably due to a shift of accent to the 
syllable preceding the enclitic -ca (= Gr. re, Lat. -que, etc.). 

Since the accent is not noted in writing in any Old or Middle Iranian language, its 
position and nature can only be deduced - as in Germanic - from its observable effects. 
In Avestan the most important phonological change connected with the accent is the 
devoicing of r (and dr < *r) before k,p, t, which is restricted to forms in which the accent 
falls on the syllable containing r. The working of this rule, which results in written hrk. 
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hrp and (*hrt >) s, indicates the existence of a free accent, which is often though not 
always on the same syllable as in the equivalent Vedic form, e.g. vahrka- ‘wolf’, amasa- 
‘ immortal’ = Ved. vfka-, amrta-, but mahrka- ‘destruction’ = *marka- (as against 
Ved. market-). The formation of a compound or the addition of a suffix or enclitic (cf. 
the preceding paragraph) can result in a shift of accent, as in amar’ta-tat- ‘immortality’ 
(cf. Ved. sarva-tat(i)- beside sarva-). See Mayrhofer in Schmitt 1989: 12-13, Beekes 
1988: 55-69. 

Whether the Avestan accent was still a musical (pitch) accent like that of Greek and 
Vedic or a dynamic (stress) accent is controversial, but there is no doubt that most Middle 
and Modem Iranian languages have developed a strong stress accent, which often causes 
syncope in unstressed syllables. In many Iranian languages the position of the stress has 
come to be wholly determined by the quantitative shape of the word, but a free stress, 
possibly reflecting the PIE accent, is still found in some modem Eastern Iranian lan- 
guages; cf. Morgenstierne 1973a on the difference in stress in such pairs as Pashto wuca 
(f.) ‘dry’ (= Ved. suska -) and rica ‘nit’ (= Ved. liksa-). 

The assumed close relationship between accent and ablaut (cf. above, p. 68) has become 
effaced in Iranian, as in other branches of PIE, to the extent that the accent can fall on any 
syllable, regardless of its vocalism. As a result of the merger of *e and *d in Ilr. a, the 
PIE qualitative ablaut has disappeared, although the palatalization of the IE (labio-)velars 
before *e occasionally allows its former presence to be discerned, as in the inflection of 
GAv. aogah- (n.) ‘strength’, ace. sg. aogo, instr. sg. aojaijha < *h 2 ewg-os, *h 2 ewg-es-eh,, 
cf. Gr. pevoq, psveoq (Hoffmann 1958: 14 15), or the interrogative pronoun Av. ka-, ca- 
< *k w o-, *k w e- (cf. below, p. 277). Some such contrasts between forms with and without 
palatalization survive into Middle Iranian, as in Parthian parvoz beside paiyoy ‘victory’ 
or Khot. tcamana, instr. sg. of kve ‘who’. On the other hand, the quantitative ablaut (the 
PIE alternation 0 ~ e/o ~ eld) is well preserved and productive in Indo-Iranian, where it 
appears as 0 ~ a ~ a, or, in combination with a following semi-vowel or consonant, i/y ~ 
ai/av ~ ail ay, r/r ~ ar ~ dr, a/n (< *n/n) ~ an ~ an, etc. It is to be noted that the Indo-Ira- 
nian long grade (a, etc.) does not always derive from a PIE long grade but can also repre- 
sent the o-grade by Brugmann’s Law (cf. above, p. 267). 

These alternations, which can occur in any part of a word (root, suffix or ending), are 
of great importance for the historical morphology of Iranian (cf. also p. 282 below on the 
function of “vrddhi” in word-formation). Ablaut occurs both within a single paradigm, 
a particular grade of the root and/or suffix being associated with each individual ending, 
and between contrasting paradigms. 

Ablaut of the root is most often attested in formations without a suffix, particularly in 
root-presents such as ah-/h- ‘to be’ and reduplicated presents such as dada-ldad- ‘to give’ 
(< *de-deh 3 -/*de-dh r ). In formations containing a suffix (or infix) it is usually this ele- 
ment which shows alternation, e.g. nouns in -tdr-l-tar-l-tr- (-Or-), athematic optatives in 
-yd-l-T- or present stems with infixed -na-l-n-. The preservation of an alternation in both 
root and suffix, as in Av. nom. sg. panta, gen. sg. paOo ‘path’ (< PIE *pent-oh 2 -s/*pnt- 
h,-es), is exceptional. 

Each individual form in such an alternating paradigm is characterized by a particular 
ablaut grade of the stem as well as by a specific ending. In the root-present, for instance, 
the 1/2/3 sg. pres. ind. active generally require the full grade of the stem (as in GAv. 
mrao-mi, etc., from mrauu ‘to say’), while the equivalent middle forms require the zero 
grade (°mru-ye, etc.). Similarly, a w-stem such as OPers. Kuru- ‘Cyrus’ has the zero 
grade of the stem in the nom. sg. (Kur-u-s) but the full grade in the gen. sg. ( Kur-au-s ). 
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Occasionally the occurrence of an abnormal ablaut grade (e.g. the long grade of the root 
in GAv. staumi, 1 sg. pres. ind. act. of stauu ‘to praise’, or the long grade of the suffix 
in OPers. dahavaus, nom. sg. of the u-stem dahayu- ‘country’) indicates that a category 
such as “root-present” or “«-stem” is not unitary but is made up of stems which origi- 
nally belonged to various classes characterized by different configurations of accent and 
ablaut. 

The endings do not normally display ablaut variation within a single paradigm, 
but only between contrasting paradigms (but cf. Table 4.22 on the inflection of Av. 
xratu-). Thus, the gen. sg. ending is attested as *-as (PIE *-es/-os) in Av. ratio, uxsno 
and OPers. piga (< *pi0ras, cf. Gr. iraipoq, Lat. patris) from the stems raiii- ‘wealth’, 
uxsan- ‘bull’ and pitar- ‘father’, but as *-s in Av. gardis, GAv. casmang (with -rig 
< *-nh < *-ns) and mar’s from the stems ga'ri- ‘mountain’, casman- ‘eye’ and nar- 
‘man’. Not all of the individual forms attested are ancient: war’s, for instance, with 
its remarkable combination of the zero grade in both stem and ending, is probably 
an innovation for expected *nar5 (cf. OInd. naras, Gr. dvdpoq). Nevertheless, since 
the innovation must have been based on an already existing form - in this case per- 
haps *bra-tr-s (= OInd. bhratur, ONor. brodor), gen. sg. of bratar- ‘brother’ (cf. 
Hoffmann 1976: 598) - such a form can justifiably be used as evidence that Iranian 
inherited r-s terns with “aerodynamic” accent and the associated type of ablaut (cf. 
above, p. 69). 


MORPHOLOGY 

Nouns 

In Avestan, as in Old Indian, the system of three genders, three numbers and eight 
cases is well established (though it is only in the singular of a few declensions that all 
eight cases are formally distinct). During the later history of Iranian this system was 
gradually simplified. Old Persian has already reduced the cases to six by conflating 
the dative with the genitive and the instrumental with the ablative; Khotanese has 
gone further, retaining only remnants of the neuter gender and the dual number, while 
Sogdian has replaced most of the old plural inflections by forms derived from a collec- 
tive noun in *-ta-. Many Modern Iranian languages have dispensed both with the case 
system and with grammatical gender, so that in New Persian, for instance, the only 
morpheme surviving from the Old Iranian system of nominal inflection is the plural in 
-an (< OPers. gen. pi. -anam). 

In Old Iranian the various declensions are principally distinguished by the final sound 
of the stem: stems in a, a, i, r, etc. They are further divided into sub-classes by gender 
(e.g. stems in a into masculines and neuters) and, to a limited extent, by the different 
accent and ablaut patterns referred to on p. 273-274 above. The number of distinct 
declensions is very much reduced in Middle Iranian, where there is a marked tendency 
to transfer all masculine and neuter nouns to the a-declension and feminines to the 
a-declension. 

The most common declension in all Iranian languages is that of the masculine 
a-stems (PIE *o-stems), whose inflection in Avestan (exemplified by yasna- ‘sac- 
rifice, worship’ = Ved. yajna-), Old Persian, Khotanese and Sogdian is shown in 
Table 4.21, together with the corresponding Vedic forms. (Only a selection of the 
numerous variant forms attested, especially in Avestan and Khotanese, is included 
in the table.) 
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TABLE 4.21 DECLENSION OF MASCULINE A-STEMS (PIE STEMS IN *0) 



Vedic 

Avestan 

Old Persian 

Khotanese 

Sogdian 

Sg. nom. 

yajn-as 

yasn-o 

-a 

-a 

-i 

acc. 

yajn-am 

yasn-dm 

-am 

-u 

-u 

instr. 

yajn-a 

yasn-a 

-a 

-na 

(= abl.) 

dat. 

yajn-aya 

yasn-ai, GAv. also -ai.a 

(= gen.) 

(= gen.) 

(= gen.) 

abl. 

yajn-at 

yasn-at, LAv. also -aSa 

-a 

(= instr.) 

-a 

gen. 

yajn-asya 

yasn-ahe, GAv -ahiia 

-ahoya 

-i 

-e 

loc. 

yajn-e 

yesn-e, yasn-aiia 

- ai , -aya 

-‘a 

-ya 

voc. 

yajn-a 

yasn-a 

-a 

-a 

-a 

Du. nom.acc. 

yajn-a , -du 

yasn-a 

-a 


-a 

instr. dat.abl. 

yajn-dbhydm 

yasn-ae'biia 

-aibiya 



gen. 

yajn-ayos 

yasn-aiia 




loc. 

(= gen.) 

yasn-aiid 




voc. 

yajn-a , -au 

yasn-a 




PI. nom. 

yajn-as , -asas 

yasn-a 

-a 

-a 

-a 

acc. 

yajn-an 

yasn-q, GAv. -§ng 

(= nom.) 

(= nom.) 


instr. 

yajit-ais, -ebhis 

yasn-ais 

-aibis 

-yau 


dat. 

yajn-ebhyas 

yasn-ae’biio 

(= gen.) 

(= gen.) 


abl. 

(= dat.) 

(= dat.) 


(= instr.) 


gen. 

yajn-anam 

yasn-anqm 

-anam 

-anu 

-an 

loc. 

yajn-esu 

yasn-aesu, -aesuua 

-aisuva 

-uvo’ 


voc. 

yajn-as , -asas 

yasn-a 


(= instr.) 



This type of stem seems always to have had a fixed accent (with the exception of the 
voc. forms, which in Vedic are either unaccented - cf. Sogd. encl. voc. sg. / lay ‘sir!’ 
beside stressed fiaya - or accented on the first syllable regardless of the position of the 
accent in the rest of the paradigm, a rule for which there is some evidence also in Aves- 
tan; see Hoffmann 1975: 266 and cf. Gr. aSeT-ipe ~ aSeTxpoq, etc.). As for the individual 
endings, the majority of the Iranian forms are directly comparable with their Old Indian 
equivalents (see Table 4.10 above). The Av./OPers. instr. sg. in -a corresponds to the rarer 
Ved. instr. in -a rather than to that in -ena (which is of pronominal origin, as is Khot. 
-na < Old Iranian *-ana). The usual Av. dat. sg. -ai, which may be compared directly 
with Gr. -ro, is more archaic than GAv. -ai. a, OInd. -ava; the final -a of the latter fonn 
seems to be a fossilized postposition, which may be found also in some Iran. abl. sg., 
loc. sg. and loc. pi. forms. In the nom./acc./voc. du. the Iranian forms agree with Ved. -a 
(= Gr. -co, cf. also Lat. ambo) rather than Ved. -au; the two forms are thought to be old 
sandhi-variants. In the remaining cases of the dual the Iranian and Indian forms are not 
precisely comparable, the most important difference being the preservation of a distinc- 
tion between gen. and loc. du. in Avestan. In the nom./voc. pi. the regular equivalents of 
OInd. -as and -asas are the rare endings Av. -a and -aijho, OPers. -aha, which seem to be 
particularly favoured for words pertaining to the sacral sphere (Av. anwsa ‘the immortal 
ones’, yazataijho ‘(beings) worthy of worship’, OPers. bagaha ‘gods’). The usual form in 
both Avestan and Khotanese is -a, which has been explained as a PIE collective in *-a (< 
*-eh 2 ), cf. Lat. loca ~ locus (Hoffmann 1958: 13); OPers. -a and Sogd. -a are ambiguous 
and may equally well derive from *-a or *-as (or both). 

Although Iranian inherited many varieties of stems showing ablaut variation (origi- 
nally associated with a mobile accent), these seldom survive as independent types. As 
a result of a tendency to harmonize the inflection of all stems ending in the same sound 
(e.g. all stems in u), forms deriving from different ablaut types may be combined in the 
inflection of a single word, often making it difficult to discern its original ablaut pattern. 
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This point may be illustrated by the u-stem Av. xratu- (m.) ‘mental power, intention, etc. ’ 
(= OInd. b'dtu- ‘power’), of which all the attested forms are shown in Table 4.22. (The 
only forms which occur in Old Persian are the two ace. sg. forms xratum and xraOum, the 
latter showing generalization of 0 from a form such as instr. sg. *xra0uva = Av. xraOfia.) 


TABLE 4.22 DECLENSION OF AV. XRATU - 



sg- 


pi. 


(GAv.) 

(LAv.) 

(GAv.) 

nom. 

xratus 

xratus 

xratauud 

acc. 

xratum 

xratum, xraQfidm 

xratus 

instr. 

xratu , xraOpa 

xraOpa 

*xratubls 

dat. 


xraOfie 

*xratubiio 

abl. 

(= gen.) 

xrataot 

(= dat.) 

gen. 

xratdus 

xratdus, xraQfio 

*xratun^m 

loc. 

xrata 


*xratusu 

voc. 


[hujxratuud 

(= nom.) 


In this paradigm the suffix appears in the zero grade as *u (Av. u) or *w (> Av. p after 
6 ), in the full grade as *au (Av. Su/ao, in final position d/uud ) or *aw (Av. auu) and in the 
long grade as *au (> Av. a in final position). Note also the occurrence of two ablaut vari- 
ants of the ending itself in the instr. sg. (-« < *-u-h,; -fia < *-w-eh 1 or *-w-oh,) and gen. 
sg. ( -Sus < *-ow-s; -[So < *-w-es or *-w-os). The etymologies of the remaining endings 
are as follows. Singular: nom. -s < *-s; ace. -m < *-m (LAv. variant -am borrowed from 
the a-stems); dat. -e < *-ey. Originally the abl. sg. was formally distinct from the gen. 
only in the a-declension; LAv. xrataot exemplifies a later tendency to create special abl. 
forms by borrowing the final -t of the a-stems. The loc. and voc. sg. are both endingless 
but differ in the grade of the suffix. Plural: nom./voc. -o < *-es; ace. -(u)s < *-(u-)ns; instr. 
-bis < *-b h is; dat./abl. -biio < *-b h yos; loc. -su < *-su. The gen. pi. (like that of the a-stems 
and most other declensions) was remodelled in Indo-Iranian after that of the M-stems, 
but the older ending -qm (< *-om, Gr. -cov) is occasionally attested, as in LAv. vadfiqm 
(besid eyatunqm), gen. pi. oiyatu- ‘sorcerer’. 


Adjectives, pronouns and numerals 

In general, adjectives are inflected exactly like nouns, though a few common adjectives, 
such as Av. vTspa- ‘all’ and its cognates, display some of the peculiarities of pronominal 
declension (see below), e.g. Sogd. abl. (originally instr.) sg. m. wispna, LAv. nom. pi. 
m. vfspe (= Khot. bissa, contrast with GAv. vTspaijhd), gen. pi. m. vispaesqm (beside 
vispanqm). The feminine forms of adjectives are usually derived from a separate stem 
in -d or -F (even where the m./n. stem belongs to a class, such as the a-declension, which 
includes feminine nouns). Examples from Avestan: sura-, f. sura- ‘strong’; po'Tu-, f. 
pao‘ri- ‘much’; bai^zaijt-, f. bar’zaTT- ‘high’. 

As in Old Indian, comparatives and superlatives can be formed in two ways: with the 
suffixes -tar a- and -tama- added to the stem of the positive (e.g. Av. as.aojah-, as.aojas- 
tara-, as.aojas-tama- ‘possessing much, more, most power’) or with the suffixes -vah- 
and -ista- added directly to the underlying root in the full grade (e.g. Av. uy-ra-, aoj-iiah-, 
aoj-ista- ‘strong, -er, -est’). Also formed directly from the root is the compound form 
in as in Av. tizi.asura- ‘sharp-tusked’ (< *tij-i- beside tiy-ra- ‘sharp’), bar’zi.caxra- 
‘high-wheeled’ (beside bar’z-aijt-), cf. OInd. rj-i-, Gr. apy-t- as the compound form of 
rj-ra-, apyoq (< *dpy-po-<;) ‘swift; bright’, etc. (cf. above, p. 79). 
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The principal Avestan demonstrative pronouns are ho (nom. sg. m.), ha (nom. sg. f.), tat 
(nom. sg. n.) ‘this; he, she, it’, and its compound aeso, aesa, aetat; aem, Tm, imat ‘this’; and 
hau, hau, auuat ‘that’. In their inflection these show the same kinds of peculiarities as the 
equivalent Old Indian forms (cf. above, p. 85 ff), including the employment of suppletive 
stems, often opposing the nom. sg. m. and f. (e.g. ho, ha) to the rest of the declension (stem 
ta-), and the prefixation or suffixation of deictic particles (e.g. ae- in ae-sa-, etc., *-am in 
aem, Tm = OInd. ay-am, iy-am). The use of certain endings different from those of nouns 
(e.g. nom./acc. sg. n. in -t, instr. sg. m./n. in -na, nom. pi. m. in -e) and the infixation of 
additional elements between the stem and ending (e.g. -hm- and -hv- respectively in several 
cases of the m. and f. sg., -h-t-s- in the gen. pi.) may be exemplified by the following forms 
of the demonstrative Av. aem ‘this’ (stems ay-H-, a- and ima -): nom. sg. n. ima-t, instr. sg. 
m. a-na, dat. sg. m. a-hm-ai, dat. sg. f. a-hjh-ai (< *a-hy-ai), nom. pi. m. im-e, gen. pi. m. 
ae-s-qm, gen. pi. f. a-ijh-qm. The Old Persian forms follow the same principles. 

Similar irregularities occur in the inflection of the relative pronoun, Av. yd (GAv. 
vs), yd, vat, OPers. hava, haya, tava (where the relative has been compounded with the 
demonstrative *ha-, *ta-), and of the interrogative pronouns. In Old Iranian, unlike Old 
Indian, all of the four interrogative stems, ka-, ka-, ca- and ci-, still function as pronouns 
and tend to combine into a suppletive system like that of the demonstratives: Av. ko, ka, 
cit (nom. sg. m., fi, n.), cf. OPers. kas-ci ‘someone’, cis-ci ‘something’. 

The inflection of the personal pronouns differs even more markedly from that of nouns, 
as may be illustrated by the following selection of first person forms: 


nom. sg. 

Av. azsm, OPers. adam 

ace. sg. 

LAv. mqm, OPers. mam 

dat. sg. 

GAv. ma‘biia, ma‘biid, LAv. mauu d iia 

gen. sg. 

LAv. mana, OPers. mana 

nom. pi. 

Av. vaem, OPers. vayam 

dat. pi. 

GAv. ahma^biia 

gen. pi. 

LAv. ahmaksm, OPers. amaxam 


(cf. OInd. aham, mam, mahya(m), mama; vayam, asmdbhya(m), asmakam). 

It will be noted that, as in Old Indian, these forms show no distinction of gender and 
that the singular and plural forms are derived from apparently unrelated stems. A further 
peculiarity of the personal pronouns is the existence of alternative unaccented (enclitic) 
forms in certain cases, e.g. 1 sg. Av. ma, OPers. -ma (ace.), GAv. moi, LAv. me, OPers. 
-mai (gen./dat.). In the plural, Old Avestan preserves a distinction between the encl. ace. 
na ‘us’, va ‘you’ (cf. Lat. nos, vos) and the encl. gen./dat. ns, vs, while LAv. no and vo, 
like OInd. nas and vas, are used for all three cases. Finally, we may note the GAv. nom. 
sg. f. forms Ofidi and x'ae[ca \ (from the possessive adjectives 6/ia- ‘thy’, x'd- ‘(one’s) 
own’), whose ending may be compared with that of Lat. quae, etc. (Hoffmann 1958: 16). 

The numerals show many peculiarities, both archaisms and innovations, in their inflec- 
tion and word- formation. These are discussed in detail by Emmerick (1992), whose study 
shows that the rich data from Middle and Modern Iranian languages can be useful in 
clarifying the Old Iranian forms. 


Verbs 

In Old Iranian, and especially in Avestan, the inflection of the verb is extremely rich as 
a result of the numerous intersecting categories into which its forms are classified: per- 
son (first, second or third), number (singular, dual or plural), mood (indicative, injunc- 
tive, subjunctive, optative or imperative), tense (present, aorist, perfect, etc.) and voice 
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(active, middle or passive). In general, the category of tense is indicated by the stem of 
the verb; that of mood by the presence or absence of a modal suffix following the tense- 
stem, the presence or absence of the augment a- before the tense-stem and the choice 
of ending; those of person, number and voice by the verbal endings alone (except in 
the case of the passive present stem in -ya-). In addition to the finite forms of the verb, 
participles and infinitives are also attested. The following survey (based on the compre- 
hensive description of Kellens 1984) is primarily concerned with Avestan; Old Persian 
provides examples of most of the corresponding types and categories but no complete 
paradigms. 

Present stems can be formed in many ways, of which only the principal types can be 
mentioned here. The most important division is that between “thematic” and “athematic” 
presents. The thematic presents are formed by adding to the root (in a particular, invari- 
able ablaut grade) a suffix consisting of or ending in -a- < PIE *-e/o-: 

Av. Oflar’s- a- ‘to fashion’ (zero grade of root + suffix -a-) 

bauu-a- ‘to become’ (full grade + -a-) 

bu‘8-iia- ‘to notice’ (zero grade + - ya -) 

zb-aiia- ‘to invoke’ (zero grade + -ava-) 

bmid-aiia- ‘to bind’ (full grade + -ava-) 

xsnauu-aiia- ‘to satisfy’ (long grade + -ava-) 

ja-sa- ‘to come, go’ (zero grade + -sa- < PIE ‘inchoative’ *-sko-) 

The various types of athematic present have in common certain endings different from 
those of thematic stems (see below on the indicative and imperative) and the occurrence 
of ablaut alternation in the stem: 

jan-lyn- ‘to strike’ (root-present) 
da-da-lda-d- ‘to give’ (reduplicated present) 
vi-na-d-lvi-n-d- ‘to find’ (infixed nasal) 
d 3 b s -nao-l* d 3 b 3 -nu- ‘to deceive’ (zero grade + -nao-l-nu-) 
star' -nd-l star’ -n- ‘to spread’ (zero grade + -nd-l-n-) 

In origin, these last two classes are special cases of the preceding type, the nasal infix hav- 
ing been inserted into a root with final *-w- or *-H-, cf. a.d’bao-ynan- ‘deception’ (which 
demonstrates the existence of a root dbav beside dab), star^ta- ‘spread’ (< *strh 3 -to-, Gr. 
oipcoioq), etc. 

Certain types of present stem, notably the passives in -va-, causatives in -ava- and 
future stems in *-sya- (> -hva-, -sva-), express a special or modified sense of the verb: 

jan-iia- ‘to be struck’ (beside jan-lyn- ‘to strike’) 
fam-aiia- ‘to cause to go’ (beside ja-sa- ‘to come, go’) 

fut. ptcp. act. bu-siia-nt- ‘about to be’ (beside pres. ptcp. act. bauu-a-nt- ‘being’) 

The future stem is most often attested by its participles, the sense of a fut. indicative being 
more commonly expressed by the pres, subjunctive. The passive stem in -ya- (which in 
Iranian, unlike Old Indian, takes mid. or act. endings indifferently) is also comparatively 
rare, in part as a result of the fact that a passive sense can alternatively be expressed by the 
use of the normal (non-passive) pres, stem with mid. instead of act. endings, e.g. vaena'te 
(mid.) ‘is seen, seems’ as opposed to vaena'ti (act.) ‘sees’. 
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The principal types of aorist stem are the sigmatic aorist, e.g. xsnau-s-lxsnao-s- ‘to 
satisfy’, in which the suffix *-s- (> -s-, -h-, -s- according to phonological context) is 
combined with alternation between the long grade and full grade of the root, and the 
root-aorist, e.g .jam-lym- ‘to come, go’, which displays alternation between the full grade 
and zero grade as in the most common type of root-present (though the distribution of the 
two alternants is slightly different in the aorist). The perfect stem is usually formed by 
reduplication, e.g. va-uuac-lva-oc- ‘to say’. As in other IE languages, the verb ‘to know’ 
irregularly forms an unreduplicated perf. stem vaed-lvid-, cf. OInd. veda, vidma, Gr. 
(F)oi8a, (p)t8pev (cf. above, p. 98), etc. The role of the aor. and perf. stems is very much 
restricted in Later Avestan - even more so in Old Persian - a development marking the 
first stage in the creation of the Middle Iranian verbal system (based on the pres, stem and 
a new “past stem” derived from the past participle in -to-). 

It is convenient to begin a survey of the formation of the moods with the injunctive, 
which is formed by the addition of the so-called secondary endings - in fact the verbal 
endings in their most basic forms, see above, p. 95 - directly to the pres, or aor. stem. 
The “secondary” endings (omitting those of the dual, since they are poorly attested and 
often etymologically obscure) are given in Table 4.23. With the exception of 2 sg. mid. 
*-sa (cf. Gr. sOot), Horn. sOso < *h,e-d h h 1 -so, etc., as against OInd. -thas) and 3 pi. act. 
-at (< *-nt), an ablaut variant lost in Old Indian, these endings correspond precisely to the 
equivalent Old Indian forms. 


TABLE 4.23 SECONDARY ENDINGS IN AVESTAN 




active 

middle 

sg. 

1 

-m or -dm 

-i 


2 

*-s (> -h, -s, etc.) 

*-sa (> -ha, -sa, etc.) 


3 

-t 

-ta 

pi. 

1 

-ma 

-ma'di 


2 

-ta 

-dum or -Spdm 


3 

-dn (< *-e/ont), -n (< *-nt) or -at (< *-nt) 

-nta or -ata (< *-nto) 


The imperfect is formed, as in Old Indian, by prefixing the augment a- (= OInd. a-, Gr. 
£-, Arm. e-) to the pres, injunctive. The imperfect is well attested in Old Persian and in some 
later Iranian languages such as Sogdian, but comparatively rare in Avestan, where the pres, 
injunctive has largely taken over its function as a past tense. The even rarer aor. indicative, 
of which a few forms are attested in Old Avestan and Old Persian, is similarly formed by the 
prefixation of the augment to the aor. injunctive. The pres, and perf. indicatives, however, are 
characterized in a different way, by the use of endings distinct from those of the injunctive. 

The so-called primary endings of the pres, indicative (again omitting the dual forms), 
as attached to athematic pres, stems, are given in Table 4.24. All of these endings have 
exact cognates in Old Indian. The inflection of thematic stems differs only in the 1 sg. 
active, where Old Avestan attests the ending -a (= Gr. -co, Lat. -o, etc.) as against OPers./ 
LAv./OInd. -a-mi. The thematic vowel, in general a, appears as a (< *o by Brugmann’s 
Law, cf. above, p. 267) in 1 pi. act. -a-mahi and mid. -a-ma‘de; on the other hand, the 
thematic 1 sg. middle has merely -e < *-ai, where *-ai < *-a-ai might have been expected. 
Since the thematic pres, indicative is well attested in most Iranian languages, it is possi- 
ble to give some complete paradigms, at least of the sg. and pi. forms, based on the pres, 
indicative of bar , pres, stem bar-a- (act.) ‘to carry’, (mid.) ‘to ride’; see Tables 4.25 and 
4.26. 
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TABLE 4.24 PRIMARY ENDINGS IN AVESTAN 




active 

middle 

sg- 

1 

-mi 

-e 


2 

*-si (> -hi, -si, etc.) 

*-sai (> -he, -se, etc.) 


3 

-ti 

-te 

pi. 

1 

-mahi 

-ma'de 


2 

*-0a 

-duiie 


3 

-anti, -nti or -a'ti (< *-nti) 

-nte or -a’te (< *-ntoy) 


TABLE 4.25 CONJUGATION OF THE THEMATIC PRESENT INDICATIVE ACTIVE 




Avestan 

Old Persian 

Khotanese 

Sogdian 

sg- 

1 

bar-a-mi 

bar-a-mi 

banma 

param 


2 

bar-a-hi 


blri 

pare 


3 

bar-a-'ti 

bar-a-ti 

bida 

Parti/part 

pi. 

1 

bar-a-mahi 

bar-a-mahi 

barama 

parem 


2 

bar-a-Oa 


bada 

parOa/parta 


3 

bar-d-nti 

bar-a-nti 

barlnda 

parand 


TABLE 4.26 CONJUGATION OF THE THEMATIC PRESENT INDICATIVE MIDDLE 




Avestan 

Old Persian 

Khotanese 

Sg- 

1 

ba'r-e 

bar-ai 

bare 


2 

bar-a-he 


bara 


3 

bar-a-‘te 

bar-a-tai 

bade 

pi. 

1 

bar-a-ma'de 


baramane 


2 

(GAv.) bar-a-duiie 


barlru 


3 

bar-a-nte 


barare 


The 2 pi. act. ending *-ta in Khot. bada, Sogd. fiarta (beside fiarOa) is a secondary 
ending borrowed from the injunctive, etc., as is the 2 sg. mid. *-ha in Khot. bam; the 
Khot. 2 pi. mid. ending -Tru is borrowed from the optative. The ending of Khot. 1 pi. act. 
barama seems to correspond more closely with Classical Skr. -mas than with its Vedic 
variant -masi (= OPers./Av. -mahi). A more significant division amongst the various Ira- 
nian languages is found in the 3 pi. middle, where Khotanese and some other languages 
attest an ending *-arai (= Av. -a‘re) or *-rai (= OInd./Av. -re). In Avestan, as in Old Indian, 
this ending is restricted to a small group of root-presents, some of which also have a 3 sg. 
middle in -e rather than -te. These special endings, which are also found in the perf. ind. 
middle, seem originally to have characterized a particular sub-class of root-presents (with 
a fixed accent on the root and ablaut alternation between long grade and full grade instead 
of between full grade and zero grade; see Narten 1968). 

The endings of the perf. ind. active (sg. and pi.) are as follows: singular: 1 -a, 2 -da, 
3 -a; plural: 1 -ma, 3 - ai ■* or -ai^s. It is not clear which of the two 3 pi. endings is to be 
equated with OInd. -ur (< *-r = -ai~ > or < *-rs = -dr’s). The endings of the perf. ind. middle, 
in so far as they are attested, are the same as those of the pres. ind. middle, with 3 sg. -e 
and probably 3 pi. *-re (cf. Khot. byaure ‘they exist’ < *abi-af-rai, originally 3 pi. perf. 
middle of abi-ap ‘to find, obtain’); see above. 

The subjunctive is characterized by a suffix -a-, which is inserted between the stem 
(whether present, aorist or perfect) and the endings. In the case of thematic stems, the 
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subj. suffix combines with the final vowel of the stem to a long a. The endings are a mix- 
ture of primary and secondary, the choice being fixed in some cases and free in others, 
except in the first person singular, where GAv. act. -a and mid. -ai are later replaced by the 
special endings -dni (= OInd. -ani beside -a; cf. above, Table 4.13) and -ane respectively. 

The optative is similarly characterized by the insertion of a suffix between the pres., 
aor. or perf. stem and the endings, which in this case are always the secondary endings, 
apart from the special endings 3 pi. act. -a;* 5 or -ar^s (beside secondary -n) and 1 sg. 
mid. -a. In the case of most athematic stems, the optative suffix shows ablaut alternation 
between -yd- and -I- (from *-yeh,-/*-ih r ). In all other cases the suffix is a non-alternating 
-F-, which combines with the final vowel of thematic stems to form the diphthong *ai (> 
Av. ae or of). A special feature of Iranian (attested in Avestan, Old Persian and Sogdian) 
is the employment of the augment with certain optative forms which express a repeated 
or habitual action in the past (cf. below, p. 282). 

The endings of the imperative are added directly to the pres, or aor. stem. (No perf. 
imperative is attested.) Active: singular: 2 (thematic) -0, (athematic) -di, 3 -tu; plural: 
2 -ta, 3 -ontu or -ntu. Middle: singular: 2 *-swa (> -suua, -huua, -suua), 3 -tqm or -qm; 
plural: 2 -dum or -SfJam, 3 -ntqm. These endings, all of which have exact cognates in Old 
Indian, are peculiar to the imperative (except for those of the second person plural, which 
are identical with the secondary endings). There is no first person imperative in Iranian. 

Pres, and aor. stems fonn their active participles by means of the suffix -ant-/ -at- (ath- 
ematic) or -nt- (thematic), while perf. stems employ the suffix -uuah-/-us-. All three types 
of stem form their middle participles in the same way, with the suffix -ana- (athematic) or 
-mna- (thematic). The latter fonn may be directly equated with Gr. -pevoq (< *-mh 1 no-), 
while its OInd. equivalent, -mana-, shows the influence of the athematic suffix -ana- (< 
*-mh[no-). Certain other verbal adjectives or participles are not fonned from a tense-stem 
but directly from the root, the most important being the “past participle” in -ta-, which 
has a passive sense in the case of transitive verbs and which comes to provide the basis 
for all the past tense formations in most Middle and Modem Iranian languages. Various 
types of infinitive are attested in Avestan, although none of them are common. As in the 
case of the participles, some are derived from a tense-stem, others directly from the root. 
The Old Persian infinitives, on the other hand, are all of a single type (not found in Aves- 
tan or Old Indian), the suffix -tanai being added to the full grade (PIE e-grade) of the root, 
e.g. cartanai ‘to do’ from the root kar. 

WORD-FORMATION 

The principal means of creating new nominal stems in Iranian are suffixation and com- 
pounding, the individual suffixes and types of compound being largely identical with 
those found in Old Indian. There are also a few prefixes, such as a-, an- ‘un-’’ and hu- 
‘good’, cf. above, p. 79-80. A peculiarity of Avestan, of which traces survive in some 
Middle Iranian languages, is the tendency to replace the bare stem by the nom. sg. fonn, 
both in compounds and before certain suffixes, e.g. bazus.aojah- ‘strong-anned’, daeuuo. 
data- ‘devil-created’, daeuuo. toma- ‘arch-devil’ (beside bazu.stauuah- ‘as thick as an 
arm’, daeuua-iiasna- ‘devil-worshipper’, etc.). A further Avestan development is the 
employment of the compound fonn in -5°, originally the nom. sg. m. of the a-declension 
(cf. daeuuo. data-, etc.), without regard to the gender or declension of the stem, as in 
daeno.sac- ‘well versed in the religion’ (from the feminine noun daena-) or kai M po.tat- 
(a collective noun derived from kar"pan-, the designation of a class of priests). Similarly, 
in Sogdian, a feminine a-stem such as xana < *xanaka- ‘house’ appears before certain 
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suffixes as xane- < *xanaki- (-/ being the nom. sg. m. ending of the Sogd. a-declension; 
cf. Table 4.21), e.g. pi. xane-t ‘houses’, in origin a collective noun with suffix *-ta-. 

The use of “vrddhi” of the first syllable as a derivational device (cf. above, p. 273) 
is well established in Iranian, although it never became common as it did in Classical 
Sanskrit. As parallels to the Old Indian forms with a and dr as vrddhi of a and r/ar respec- 
tively, one may cite such forms as Av. hduuani- ‘(time) appropriate for pressing’ from 
*hauuana- ‘act of pressing’ (OInd. savana-), var 3 0rayni- ‘victorious’ from vaVOrayna- 
‘victory’, OPers. Margava- ‘inhabitant of Margu’. Some Iranian languages seem to have 
agreed with Old Indian also in using the long diphthongs ai, au (= OInd. ai, au ) as vrd- 
dhi of i and u; cf. the OPers. month-name Qaigraci-, probably from *0igra-ka- ‘garlic’ 
(cf. Pers. sir ‘id.’ < *0igra-), MPers. waspuhr (< *waispuhr) ‘principal’ from wispuhr 
‘prince’, etc. In such cases, however, Avestan consistently follows an older derivational 
pattern in employing the short diphthongs *ai, *au (> ae, ao, etc.), as in duuaepa- ‘island’ 
(contrast OInd. dvipa- ‘island’ < *dwi-h,p-o- ‘(between) two waters’, without vrddhi), 
daozaifha- ‘hell’ from duz-ahu- ‘id.’ (literally ‘evil existence’). 


SYNTAX 

Traditionally, much less attention has been paid to the syntax of the Iranian languages 
than to their phonology and morphology. In this respect many of the chapters in Windfuhr 
2009 offer a refreshing break with tradition, dealing systematically with topics such as 
word-order, noun phrase structure, use of the cases, tenses and moods, co-ordination and 
subordination. On the syntax of Old Avestan see also West 2011. Here it must suffice to 
mention some of the more important points in which the Iranian languages differ from 
Old Indian. 

One of the most remarkable features of Old Iranian nominal syntax is the ability of the 
instr. pi. form to substitute for other cases of the plural, as in Av. vispais aoi karsuuqn 
vais hapta ‘to all the seven continents’ (instr. vispais, vais for ace.), OPers. XIV raucabis 
9akata aha ‘ 14 days had passed’ (instr. raucabis for nom.). Cf. also the use of the instr. pi. 
for the voc. pi. in Khotanese (see Table 4.21) and as a generalized oblique case of the plu- 
ral in some of the Modern Iranian languages of the Pamir mountains (Wakhi -av < *-aibis, 
etc.). Equally noteworthy is the use of the relative pronoun (Av. yd-, OPers. hayd- 
Itava-, see p. 277 above) in attributive constructions such as Av. daeumyim apaosam ‘the 
demon Apaosha’ (ace.) or OPers. dahayunam tavaisam parunam ‘of many lands’, a usage 
which results from the reinterpretation of a nominal relative clause such as Av. daeuuo 
yd apaoso, originally ‘the demon who (is) Apaosha’, and the attraction of the relative 
pronoun (and predicate) into the case of the antecedent, giving daeiim vim apaosam for 
*daeumyo apaoso. (See Reichelt 1909: 370-371.) 

Several characteristic features of the syntax of the verb in Old Iranian have already 
been referred to above, including the use of the injunctive rather than the imperfect as 
the normal narrative past tense, which is peculiar to Avestan, and the use of the optative 
(sometimes with augment) to express a repeated or habitual action in the past, e.g. Av. 
turn zamarguzo dkaVnuud vfspe daeuua, zaraQustra, ydi para ahmat vTrd.raoda apataiian 
paid aiia z 3 ma ‘you, Zarathushtra, drove underground all the demons who previously 
used to go about on this earth in human form’; OPers. yaOa-sam haca-ma aOahava, avaOa 
akunavavanta ‘as was said to them by me, so they used to do’; Sogd. caf awva nara awf 
Sasta mya.se, a'ti-sixa nara can dasta wapate ‘however many pomegranates she took in 
(her) hands, the pomegranates fell from her hands’. 
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The loss of the PIE perfect system, which is incipient in Late Avestan and almost 
complete in Old Persian, is made good by the creation of a new type of perfect based on 
the past participle (with an obligatory passive construction, the agent being originally 
in the dative, replaced in Old Persian by the genitive): Av. yezi-ca he aniia aya siiaodna 
frauuarsta ‘and if he has committed other evil deeds’; OPers. ima taya mana kartam 
‘this (is) what I have done’, lit. ‘what (has been) done by me’. In many later Iranian 
languages this construction comes to express a simple past tense, as in MPers. man kard 
‘I did (it)’. Another verbal periphrasis which later becomes widespread, especially in 
Eastern Middle Iranian, is the so-called potential construction (Sims- Williams 2007). 
This is first attested in Old Persian, where the past participle of a transitive verb is used 
with the auxiliary kar ‘to make’ (in the active) or bav ‘to become’ (in the passive) to 
express either a potentiality or the consummation of an action: nai aha martiya . . . hava 
avam Gaumatam tavam magum xsagam dftam caxriya ‘there was no-one . . . who could 
have deprived that Gaumata the magus of the kingship’; vaOa kantam abava, pasava 
Oika avaniya ‘when it had been dug, then it was filled with gravel’ . In Middle Iranian the 
potential construction also occurs with intransitive verbs (auxiliary ‘to become’, Sogd. 
[iw-, Khot. ham-), e.g. Sogd. ne nipasta Pot ‘he cannot lie down’; Khot. ku va drai masta 
parrate hamate, balvsa rrundu ksamotte ‘when three months had passed, the Buddha 
took leave of the king’. 

Note: The author would like to thank Almut Hintze and Philip Huyse for valuable 
comments on earlier drafts of this chapter. 


FURTHER READING 

In addition to bibliographic details of works cited in this chapter the following list 
includes a selection of basic literature on the Iranian languages, especially Avestan and 
Old Persian. 

The most up-to-date survey of the whole field, with a primarily typological rather 
than historical orientation, is to be found in Windfuhr 2009, with a monograph-length 
chapter on Old Iranian (p. 43-195) by P. O. Skjcervo. The Compendium Linguarum Irani- 
carum (Schmitt 1989), which includes chapters on the prehistory of the Iranian languages 
(p. 4—24, by M. Mayrhofer), on Old Persian (p. 56-85, by R. Schmitt) and on Avestan 
(p. 32-55, by J. Kellens), is still very useful. For Avestan see also K. Hoffmann 1987. 
An earlier article by the same author (Hoffmann 1958) gives an incisive characterization 
of the special features of Old Iranian, as compared with Old Indian, a topic covered in 
greater detail in Goto 2013. These surveys supplement but do not replace Geiger & Kuhn 
1895-1903, whose chapters on the Old Iranian languages (p. 1-248, by Christian Bar- 
tholomae), though in many respects dated, remain unsurpassed in comprehensiveness. 

An etymological dictionary of the Iranian languages (Rastorgueva and Edel’man 
2000-2007, Edel’man 20 11 — , in Russian) is in progress; Cheung 2007 lists Iranian verbal 
roots with their derivatives. 

The standard edition of the Avesta is that of K. F. Geldner (1886-1896, in the orig- 
inal script). The dictionary of Bartholomae (1904) has likewise not been superseded. 
Martinez and de Vaan 2014 provide an introduction to Avestan from an IE perspective 
and include text specimens, a glossary and an up-to-date basic bibliography. On Avestan 
syntax see Reichelt 1909: 218-387 and Skjcervo 2009: 94—166. A modem treatment of 
Avestan phonology and morphology is provided by Hoffmann and Forssman 2004; see 
also Kellens 1984 and 1995 on the morphology and syntax of the Avestan verb. 



284 NICHOLAS SIMS-WILLIAMS 


All the works mentioned above cover both Old and Later Avestan. Modern edi- 
tions of the Old Avestan texts, with translation and commentary, include Insler 1975 
(Gathas only), Narten 1986 and Hintze 2007 (both Yasna Haptaqha'ti only), Kellens 
and Pirart 1988-91, Humbach 1991 and Humbach and Faiss 2010. The phonology 
and morphology of Old Avestan are treated in Beekes 1988 (cf. also Kellens & Pirart 
1988: 42-88 on “phonetique et graphie”), aspects of its syntax in Kellens and Pirart 
1990 (which also contains a complete lexicon to the Old Avestan texts) and in West 
2011 . 

The most comprehensive edition of the Old Persian inscriptions is that of R. G. Kent 
(1953; supplemented by Mayrhofer 1978). Kent’s book also contains a historical gram- 
mar (more detailed but less reliable than Schmitt 1989: 56-85 or de Vaan & Lubotsky 
2012) and a lexicon, the latter now superseded by Schmitt 2014. Several of the most 
important inscriptions have been re-edited by R. Schmitt (1991, 2000). 

Finally, the existence of useful on-line resources for a number of Iranian languages 
should not be overlooked. These include introductory texts by P. Oktor Skjaervo on Aves- 
tan, Old Persian and Sogdian (all at http://www.fas.harvard.edu/~iranian/). 
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PART 5 


GREEK 


Rupert Thompson 


INTRODUCTION 

Greek is today the official language of Greece and of the Republic of Cyprus, and is spo- 
ken by sizeable diaspora communities across the globe. Varieties of Greek are spoken by 
minorities in southern Italy (Griko) and on the coast of the Black Sea (Pontic). Until the 
early part of the twentieth century there were large numbers of Greek speakers in Turkey; 
dialects such as Cappadocian are now all but extinct. All of the modem dialects of Greek, 
with the possible exception of Tsakonian (spoken by a declining minority in villages on 
the slopes of Mount Pamon), which may be a survival of the ancient Laconian dialect, 
descend from the variety known as the Koine (Gr. koivt] fiidLsKioq, ‘common dialect’) 
which itself developed from the “internationalised” version of the dialect of Athens of the 
fifth century BC, “Great Attic”, and was adopted by the Macedonian kings as the official 
court language. From an IE perspective Greek is a member of the “core” group of lan- 
guages left behind by the split of Anatolian and Tocharian. It is generally now considered 
that it developed in situ in the Balkan Peninsula in the second millennium BC. A num- 
ber of words, including some place names, terms for flora and fauna, and cultural items 
(KopivOoq ‘Corinth’, cukov ‘fig’, dad|iiv0oq ‘bath-tub’) attest substrate influence of non- 
Greek (even non-IE) languages. Some, with problematic phonological details (e.g. Inno g 
‘horse’ < PIE *ekwos, with unexplained aspiration and vowel quality), suggest contact 
with otherwise unknown IE varieties. 

Before the prestige of the Koine caused their ultimate demise in the Hellenistic period 
there had existed in the Greek-speaking world a large variety of regional dialects. None 
had the status of a standard language; the Greeks were clear enough that they all spoke 
varieties of Greek - famously according to Herodotus 8. 144 to 'ELLqviKov ‘Greekness’ is 
opoyLcoccov ‘of one language’ - but “Greek” in this sense was no more than an abstrac- 
tion (Morpurgo Davies 1987). Certain dialects, often with extreme local features watered 
down or altogether removed, did become the conventional medium for certain genres of 
literature. For example, no matter what the native dialect of the poet, choral lyric - even 
the lyric passages of Attic tragedy - was written in a literary version of Doric (although 
in tragedy this largely amounts to the replacement of Att. r| by Dor. a and the use of the 
Dor. first-declension gen. pi. in -civ rather than Att. -cov). Epic poetry was composed in the 
largely Ionic-based Homeric Kunstsprache. 

The dialects have traditionally been classified into two main families, designated East 
Greek and West Greek according to their distribution in the first millennium. East Greek 
comprises Attic-Ionic and the more conservative Arcado-Cypriot; West Greek consists of 
(Peloponnesian) Doric and North-West Greek. A third group, Aeolic, made up of Thes- 
salian and Boeotian on the Greek mainland, and Lesbian, spoken on the island of Les- 
bos and the facing northern part of the coast of Asia Minor, is difficult to fit into this 
binary classification. Garcia Ramon (1975) argues that these dialects arose from contact 
between East and West Greek varieties; recent scholarship (following Parker 2008) calls 
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into question the very existence of an Aeolic group and suggests that the three dialects in 
question may be unrelated archaic varieties. The dialect of Pamphylia is similarly hard 
to place (see Brixhe 1976). On the question of whether or not ancient Macedonian was a 
Greek dialect - a matter of no less heated debate in the ancient world than it is today - see 
Mendez Dosuna 2012. 



MAP 5.1 THE GREEK DIALECTS 

Source: Adapted from: L. R. Palmer, The Greek Dialects, London: Faber and Faber, 1980 

From the second half of the eighth century BC (the earliest examples are the “Dipylon 
pitcher” from the Kerameikos cemetery in Athens, dated to ca 740 BC, and the “cup of 
Nestor” from Pithekoussai on the Italian island of Ischia, dated to 750-700 BC) inscrip- 
tional material is written in a number of related local (“epichoric”) alphabets, all deriv- 
ing ultimately from an adaptation of the Phoenician abjad. Gradually they came to be 
replaced by a standardised alphabet of East Ionic origin (Table 5.1). Modem conventional 
orthography mixes upper- and lower-case graphs and employs diacritics of Alexandrine 
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and Byzantine origin for the accent, for aspiration (or the lack of it) on vowels and r, and 
for the second member i of long diphthongs. For details of the epichoric alphabets see 
Jeffery 1990. In addition to the epigraphic record the contemporary written material is 
augmented by a large number of papyri from Egypt dating to the Hellenistic and Roman 
periods. These include a variety of personal and official documents which provide invalu- 
able evidence of the linguistic behaviours of a broad spectrum of social strata. They 
also include early copies of some literary texts. The literary evidence has otherwise been 
mediated by a long tradition of manuscript copyists. 


TABLE 5.1 THE GREEK (IONIC) ALPHABET. Phonetic values are those of Attic ea 400 BC. The 
equivalent spellings in the Old Attic alphabet, used in Athens before 403 BC, are also shown. 


Grapheme 

Name 

Phonetic value 

Old Attic equivalent 

Notes 

A, a 

alpha 

[a, a:] 

A 


B,(3 

beta 

[b] 

B 


T y 

gamma 

[g] / It)] 

r 

(1) 

A, 8 

delta 

[d] 

A 


E, E 

epsilon 

[9] 

E 


f,f 

digamma (wau) 

[w] 

— 

(2) 

z,C 

zeta 

[zd] 

/ 


H, q 

eta (heta) 

[E] 

E(H=[h]) 

(3) 

0,9/9 

theta 

[t»] 

© 


I,t 

iota 

[i, i:] 

I 


K, K 

kappa 

M 

K 


A, X 

lambda 

[1] 

A 


M, p 

mu 

[m] 

M 


N, v 

nu 

[n] 

N 


3,5 

xi 

[ks] 

XE 


O,o 

omicron 

[ 0 ] 

0 


n, 71 

Pi 

[Pi 

n 


p,p 

rho 

fc f] 

p 

(4) 

E, a/q 

sigma 

[s] 

E 

(5) 

T, x 

tau 

[t] 

T 


Y, u 

upsilon 

[ii, u:] 

Y 

(6) 

<J>, tp 

phi 

[P h ] 

<D 


X,x 

chi 

[k h ] 

X 


*P, ip 

psi 

[ps] 

<DE 


n, to 

omega 

[o:] 

O 


Et 


[e:] 

El = [ei], E = [e:] 

(7) 

on 


[u:] 

OY= [ou], 0 = [ 0 :] 

(7) 

a 


[a:(i)l 

AI 

( 8 ) 

11 


[e:(i)] 

EI 

( 8 ) 

to 


[a:(i)l 

OI 

( 8 ) 


Notes to Table 5.1: 

(1) Before k, x, p or another y the letter y has the articulation of a velar nasal [tjJ, 

(2) The sound [w] was lost prehistorically in Attic-Ionic, and consequently the graph- 
eme <F> was not used in those dialects. The name “digamma” comes from its shape, 
which resembles two capital gammas superimposed. 
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(3) In most varieties of the Greek alphabet the grapheme <H> has the value [h]. East 
Ionic, lacking the phoneme h, redeployed the redundant sign as a grapheme for the 
long vowel which became e. (See below under “Phonology”.) Current practice, orig- 
inating in Alexandrian notation, is to print the rough breathing <’> above initial 
vowels (and r) which have aspiration and the smooth breathing <’> above those 
which lack it. 

(4) Word-initially, and perhaps in the second member of a geminate, r is voiceless and 
aspirated, [r h ] . Elsewhere it is voiced and unaspirated [r]. Word-initial rho is thus 
traditionally written with the rough breathing, p. 

(5) The lower-case form a is used medially, q word-finally. The ancient Greeks them- 
selves used only upper-case letter forms, and so made no such distinction. Conse- 
quently, some scholars prefer to use “lunate” sigmas C, c, with the same lower-case 
fonn used in all environments. 

(6) In most dialects, and originally in Attic, the value of u is [u, u:]. See below under 
“Phonology”. 

(7) The sequences £i and ou are known as “spurious diphthongs”. Originally they rep- 
resented diphthongs [ei] and [ou], but sometime before 403 BC in Attic they had 
monophthongised to close-mid vowels [e:, o:] (transcribed here e, 5), the latter sub- 
sequently raising to [u:]. 

(8) These represent original “long diphthongs” ai, ei, oi - see Allen 1987a: 84-88 for 
the likely phonetic reality of these sounds - which were in inscriptions written AI, 
EI (later HI), OI (later f2I), but which by ca 200 BC lost the second element to merge 
with a, e, 5 and were then written A, H, £2. The convention of indicating this lost 
i with an iota subscription is of Byzantine origin and is generally followed today, 
except in the case of capitals, e.g. AiSr|q = Haides ‘Hades’: were it Haides the accent 
and breathing would be printed over the iota: A!8r|q. Some scholars prefer to print 
such iotas adscript, oil, r|i, coi. 

Before 1952 the earliest Greek material known to us was provided by the two epic 
poems attributed in antiquity to Homer, the Iliad and the Odyssey. The language of 
these poems is broadly Ionic, but has an admixture of archaic, Aeolic and wholly arti- 
ficial forms. Thanks to the work of Milman Parry (Parry 1928a, 1928b; see also Parry 
et al. 1971) it is known that this Homeric Kunstsprache owes its formation to a long 
tradition of oral poetry, dating back to the Bronze Age and beyond. The poems are 
composed in strict dactylic hexameters, and to assist the process of oral composition 
the poets had evolved a complex system of formulae, “building-blocks” of convenient 
metrical shape, e.g. nokbxkaq 8ioq '05uaasu<; ‘much-suffering godlike Odysseus’ or 
jroA,u|ir|Ti<; '08vaaex>q ‘much-cunning Odysseus’, which fill in the shapes, so conve- 
niently completing the line from the weak third-foot caesura and strong fourth-foot 
caesura respectively. Where such formulae contained material which was no longer 
linguistically current the Ionic poets replaced it with contemporary Ionic forms if they 
had the same metrical shape; otherwise with suitable contemporary forms from a par- 
allel Aeolic tradition; and failing that they retained the archaic material (so Horrocks 
1987, 1997; but the idea that the tradition went through a distinct Aeolic phase before 
passing entirely into Ionian hands is not totally laid to rest. For discussion see the con- 
tributions by Janko and Jones in Andersen and Haug 2012). The result is an amalgam 
from various time -depths, containing, for example, contemporary AI o-stem gen. sg. in 
-ou alongside its earlier form, -oio; and some features, such as “tmesis” - the separation 
of a preverb from its verb - and the scansion of dv5p6xr|Ta ‘manhood’ at Iliad 16.857 



GREEK 291 


as if *anrtata, must pre-date the Bronze Age. The poems took their present form prob- 
ably in the eighth or seventh century BC, and it is doubtful whether there was a single 
poet responsible for each poem, let alone both; but the name “Homer” is a convenient 
shorthand for the poetic tradition. 

Excavating in Knossos on Crete in 1900, Sir Arthur Evans discovered clay tablets writ- 
ten in an unknown script, today called Linear B, dating to ca 1400 BC. Similar documents 
of a slightly later date were unearthed in subsequent excavations at Pylos in Messenia, at 
Mycenae in the Argolid and at Thebes in Boeotia, and continue to be found to the present 
day, the most recent discoveries coming from Ayios Vasilios in Lakonia. A collaboration 
in the 1950s between Michael Ventris and John Chadwick, building on earlier work by 
the American scholar Alice Kober, revealed that these documents were the economic 
records of Bronze Age palatial elites written in a very early form of Greek which today 
we call Mycenaean. Despite its early date and obvious conservatism in some areas (e.g. 
retention of PIE labiovelars and *w, preservation of instrumental morphology) Myce- 
naean has undergone the assibilation of Proto-Greek *ti to si which characterises the East 
Greek dialect family, indicating that the split into East and West Greek had taken place 
already in the second millennium. 

Linear B evidently derives from an earlier script, Linear A, used to write an unknown 
language of Crete (and which itself superseded a more elaborate writing system called 
Cretan Hieroglyphic). The script is not especially well adapted to writing Greek. In addi- 
tion to a large number of ideograms it contains some 88 syllabic signs, each of which 
denotes a simple vowel, a sequence of consonant + vowel or, in a small number of cases, 
more complex sequences such as rai or nwa. It does not normally distinguish between 
voiceless, voiced and aspirated stops (except that t and t h are distinguished from d, and 
there is an optional sign representing p h u), nor between r and /, and vowel length is 
not marked. Consonant clusters cannot be written: the scribes either omit consonants 
(“partial spelling”, e.g. pe-ma = sperma ‘grain’) or insert “dummy vowels” (“plene spell- 
ing”, e.g. wa-na-ka-te = wanaktei ‘for the king’). Diphthongs are not always marked, and 
word-final consonants almost never are. 

Another syllabic script derived ultimately from Linear A was used to write the ancient 
dialect of Cyprus. The earliest example may be the spit inscribed with the gen. of the 
man’s name Op''eltau dating from the tenth century BC, although it has recently been sug- 
gested that this may be the earlier “Cypro-Minoan” script (used to write one or more non- 
Greek languages). The Cypriot script differs from Linear B in lacking signs for voiced 
dentals, and in its strategies for writing consonant clusters. 


PHONOLOGY 

The consonantism of Greek is fairly conservative. Being a centum - type language, the 
PIE velars and palatals have merged as velars. The major development of the conso- 
nant system which characterises Proto-Greek is the loss of voicing on the PIE voiced 
aspirates, PIE *b h , *d h , *g h becoming <p, 0, % = [p h , t h , k h ]. The voiceless aspirated stop 
articulation of these phonemes is preserved into the Roman period: the earliest evi- 
dence for a pronunciation [f] for (p and [0] for 0 is from the first century AD. (This is 
true at least for Attic and its descendant, the Koine. In Laconian 0 may have become 
a fricative by the end of the fifth century. In the Lvsistrata, Aristophanes has the Spar- 
tans use forms such as aioq for Attic 0eoq, perhaps an attempt to write [0].) Similarly, 
(3, 8, y retained their articulation as voiced stops throughout the ancient period; it is 
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not clear when they became the fricatives [(3, 5, y] of Modern Greek, but it may be as 
late as the ninth century AD. 


Labiovelars 

The PIE labiovelars are originally retained, except when adjacent to u or w when they lose 
their labialisation and merge with the plain velars. Mycenaean spells all labiovelars with 
a series of signs transcribed qa, qe, qi, qo, e.g. a-pi-qo-ro = amp h ik"'olos (Att. diupuiokoq) 
‘attendant’ while qa-si-re-u = g"asileus (Att. Paoikerx;, a local official in Mycenaean, 
‘king’ in later Greek). In qo-u-ko-ro = g"'oukolos (Att. PouKokoq) ‘cowherd’ the first labio- 
velar is retained, and the second has become velar because of the preceding u. 

In the dialects of the first millennium the remaining labiovelars merged with various 
other consonantal phonemes. In Attic-Ionic and West Greek the development is twofold: 
(i) before e (and in the case of *k w also before i) the outcome is dental, *k w > t, *g w > d, 
PIE *g wh > Proto-Greek *k wh > /*; (ii) elsewhere the outcome is labial, *k w > p, *g w > b, 
PIE *g wh > Proto-Greek *k wh > p h . This is the major source of the b phoneme in Greek. 
Examples are given in Table 5.2. Other dialects behave differently. In Aeolic, for exam- 
ple, the outcome is labial across the board (e.g. thus Lesb. jrepjie = Att. 7ievi£ ‘five’), but 
even in Aeolic the pronominal stem *k w i- gives Tiq etc. 


TABLE 5.2 TREATMENT OF LABIOVELARS IN ATTIC-IONIC AND WEST GREEK 



* k w 

*g w 

*gwh > p ro to-Greek *k wh 

before e 

*penk w e > tievts ‘five’ 

*g w elb h u- > or./.(pi) 3 ‘womb’ 

*g wh en- > 0svvco ‘kill’ 

before i 

*k w is > lie. 'who?’ 

*g w ih 3 - > pio<g ‘life’ 

*h 3 eg wh i- > otpiq ‘snake’ 

elsewhere 

*penk w tos > Jispntoi; 'fifth' 

*g w m- > Patvco ‘go’ 

*g wh on- > (povoq ‘murder’ 


Laryngeals 

The behaviour of the PIE laryngeals is similar in Greek to in the other daughter languages, 
with two important exceptions. First, and unique to Greek, is the so-called triple -reflex of 
laryngeals in positions where they vocalise. In all other IE languages all three laryngeals 
give the same reflex, i in Indo-Iranian, a elsewhere; in Greek, however, *h, vocalises to 
e, *h, to a, *h, to o. Thus, *d h h 1 tos > fiexoq ‘set’, *sth 2 tos > omToq ‘placed’, *dh,tos > 
doToq ‘granted’. Second, word-initial *H before a resonant other than *y also vocalises 
rather than being lost as in other languages (other than Armenian): *h 1 rud h - > epuBpo:; 
‘red’, *h,ner- > dvijp ‘man’, *h 3 nomn > ovopa ‘name’. This is the origin of at least some 
prothetic vowels in Greek (compare Lat. ruber, Nero and nomen for the equivalent forms 
without). The triple reflex is also seen in the outcome of *RH, as pq, poi, pco etc. 

Semivowels 

PIE *w was inherited into Proto-Greek, is preserved in Mycenaean, but was lost at var- 
ious stages in the later dialects. Where retained, it was written with the letter p. In Attic- 
Ionic its disappearance was prehistoric. It was originally present in the Homeric tradition, 
where at the convenience of the poet it is either observed - to block hiatus (e.g. pekiqSea 
(p)oivov ‘honey-sweet wine’) or make a preceding vowel long by position (e.g. euiaq 
(p)sno<;) - or ignored (e.g. pskiqSsoq oivoo). 
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Medially, PIE *y was lost via [h], Word-initially it shows a double treatment. In 
some roots it gives Greek C,- (originally with the value [dz]), in others [h]. The out- 
come is consistent across the dialects. Those roots which give later Greek are in 
Mycenaean spelt with signs of the z-series (e.g. PIE *yewg- > Myc. ze-u-ke-si = dze- 
uges(s)i ‘pair (dat. pi.)’, Att. ^euyeai); those which give [h] are spelt with signs of the 
y-series or simple vowels (e.g. from the *yo- pronominal stem adverbs jo- = vo and 
o- = ho ‘how’). This probably shows that the change *y > h was in progress at the 
time the Linear B documents were written. The alternation between h- and ^-roots has 
not been satisfactorily explained, although as *Hy- does not give a prothetic vowel, 
word-initial laryngeals have sometimes been supposed to underlie one development 
or the other. 

Later Greek C, has two other sources: (i) the palatalisation of *dy and *gy, e.g. *ped- 
yos > neC,ot; ‘on foot’, *meg-yos- > Ion. pe^cov, comp, of psyaq ‘big’; and (ii) the com- 
bination of s+d, e.g. ABqva^e < AOqvaa+Se ‘to Athens’. The former type is spelt in 
Mycenaean with z-series signs (e.g. me-zo-e = medzohes ‘bigger’), the latter with signs 
of the rf-series (e.g. te-qa-de = T'eg"'ans-de ‘to Thebes’). This indicates that the original 
outcome of *dy and *gy metathesized to zd and thus merged with s+d in the post- 
Mycenaean period. 

Voiceless *t (h) y and *k (h) y also underwent palatalisation. There are two distinct phases. 
First, those sequences of *ty and *t h y where no morpheme boundary intervened pala- 
talised to [Is], which gives -tt- in Boeotian, -ss- elsewhere, simplifying to -s- in Attic- 
Ionic, e.g. *yotyos > AI oaoq ‘how big’ vs. Lesb., Thess., Dor. (h)oooo<;; PIE *med h yos 
> Proto-Greek *met h yos > AI peaoq ‘middle’, Boe. peiToq, elsewhere peaao<;. Where 
a morpheme boundary intervenes the change is delayed. For most dialects the outcome 
is the same as homomorphemic *t (h) y (viz. Boe. -tt-, general -ss-), but Attic this time 
follows the Boeotian treatment (e.g. *melit-ya > Att. peA-iira ‘honey bee’) and in Ionic 
the -55- does not simplify (e.g. pfAiaoa). The outcomes of *k lh) y and *tw are the same: 
Proto-Greek *p h ulak-yo ‘guard’ > Att. and Boe. (piAdiico vs. (prAdoo© elsewhere; and 
Proto-Greek *k w etwr- ‘four’> Att. Tenape:;, Boe. ;r£TTape<; vs *k w etwer- > Ion. reoaepeq. 
Interestingly, words of non-Greek origin show the same distribution of -it- vs -go-, e.g. 
Att. and Boe. Banana vs. general Qalaaaa ‘sea’. 

The PIE fricative 

Between vowels, and before a vowel at word-beginning, PIE *s > h, and medially is then 
lost. The lenition of *s > h is pre-Mycenaean, the subsequent loss post-Mycenaean: from 
Proto-Greek *p h arwesa ‘cloths’, Mycenaean has pa-we-a 2 = p^arweha. This has interest- 
ing implications for Grassmann’s Law, by which the first in a sequence of two aspirates 
loses aspiration (e.g. *sek h - > Proto-Greek *hek h o > Att. pres. £%©, vs. fut. *hek h so > 
*hekso > e^©). Since *d h esos > Proto-Greek *t h ehos > Beoq ‘god’ we can perhaps infer 
that, although it is common to all dialects of the first millennium, Grassmann’s Law 
operates after the loss of medial -h-, i.e. in the post-Mycenaean period. Otherwise, the 
classical form would be *T£ 0 <;. 

The vowel system 

Proto-Greek inherited from PIE a simple vowel system with five short vocalic phonemes, 
a, e, i, o, u, each with a corresponding long equivalent. There were also diphthongs ai, 
ei, oi, au, eu, ou, and long diphthongs ai, ei, di, au, eu, ou. Early in the history of the 
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language, perhaps at the Proto-Greek stage, certain medial clusters involving a liquid or 
nasal and s underwent loss of the 5 and gemination of the resonant. Mycenaean attests 
this stage; thus, e.g., from Proto-Greek *agersantes, the aor. ptcp. m. nom. pi. of the verb 
ayeipco ‘collect’, Mycenaean has a-ke-ra 2 -te = agerrantes. The geminate is preserved 
in Lesbian and Thessalian, but the other dialects further simplify the cluster by degem- 
ination with compensatory lengthening of a preceding short vowel. In the case of a, i 
and u the outcome is identical to the existing long vowels; this is also true of e and o in 
Boeotian, Arcadian (and one supposes Cypriot, although the writing system obscures 
the details) and some Doric dialects labelled by Ahrens (1843: 403-414) as Doris seue- 
rior (‘severer Doric’); in other dialects, however, including Attic, Ionic and those Doric 
varieties labelled as Doris mitior (‘milder Doric’), e and o lengthened not to inherited e 
and o (which were [e:, o:]) but to new close-mid vowels e and 5 (phonetically [e:, o:]). 
Allen (1959, 1987b) suggests that in these varieties the short-vowel system was skewed 
towards the top of the vocalic space (Figure 5.1), so that short e, o were of noticeably 
closer quality than inherited long e, o. He cites in support the fact that in these dialects 
PIE *r and *1 give ap~pa and aX~ka\ the original outcome is likely to have been [or] etc., 
and the top-skewing would mean that [a] was sufficiently close to a as to merge with it. 
The result of this is shown in Figure 5.2. The new e and 5 phonemes were spelt in Old 
Attic orthography as <E, 0>. After 403 BC and in standard modem orthography they 
are spelt ei and ou for reasons which are explained below. Examples of the treatment of 
various clusters are: 

Proto-Greek *k h eslioi > Ion. %£$aoi, “severe” Dor. xh^toi, Lesb. and Thess. /tALioi 
‘thousand’ 

Proto-Greek *asmes > AI r|pd<; (< *dp£t<;), Dor. dpa;, Lesb. appa; ‘we’ 

Proto-Greek *esmi > AI dpi, “severe” Dor. f|pi, Lesb. eppi ‘I am’ 



Cl 


FIGURE 5.1 INHERITED VOWEL SYSTEM OF PROTO-ATTIC-IONIC 
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FIGURE 5.2 VOWEL SYSTEM OF PROTO-ATTIC-IONIC AFTER COMPENSATORY 
LENGTHENING 

The resulting system seems to have suffered from overcrowding on the back axis, 
which is substantially shorter than the front. In Proto-Attic-Ionic this had implications 
for both the top and bottom of the back axis. First, the a phoneme underwent front- 
ing to become [re:]. A second wave of compensatory lengthenings, this time involving 
word-final ns clusters and those with secondary s, added to the inventory of e and o 
and reintroduced a new a. It is at this stage that Eastern Ionic redeployed the redundant 
grapheme <H> to represent the shifted phoneme: a seventh-century BC inscription 
from Naxos spells the word ‘sister’, phonetically [kasigne:tae:], as KAEITNETH, using 
<E> for inherited e (and also for e and e) and <H> for ce < a. <A> is used for inherited a 
and the new a from the second wave of compensatory lengthening. Eventually, in Ionic, 
ce merged completely with <?, taking its grapheme with it, so the standard spelling of the 
‘sister’ word became Kaoiyvf)xr|, representing [kasigne:te:]. In Attic it split, merging with 
the new a after e, i or r, and with e elsewhere. 

The effects of overcrowding were now felt again, this time causing a shift in the top of 
the back axis. Inherited u (spelt <Y>) was pushed over onto the front axis, becoming ii 
(a high front vowel with lip-rounding), and for reasons of symmetry the short u followed 
it to become ii. The original value of <Y> is seen in the onomatopoeic verb puKdopai 
‘moo’, and in the spellings <AY, EY> of the diphthongs au, eu. The close-mid o then 
moved up to occupy the space so vacated at the top of the back axis and became [u:]. 
This is its value in Classical Attic. As a consequence there is no longer a corresponding 
short u phoneme. These developments explain why the Boeotians, when they adopted 
the Attic alphabet in the fourth century, considered <Y> unsuitable for writing their own 
phonemes which had retained the values [u, u:] and instead used the digraph <OY>. 
Occasionally, too, spellings <AO, EO> are found for au, eu. 

In the Old Attic orthography <E> was used not only for e but also e and e, and <0> 
likewise for d, 6 and 5 (> u). In 403 BC, under the archonship of Eucleides, Athens 
adopted the Ionic alphabet, which used <H> and <D> for e and o. At the same time 



296 RUPERT THOMPSON 


<EI> and <OY> were employed for e and u < 5. These spellings originally represented 
the diphthongs [ei] and [ou], which had by the fifth century BC monophthongised and 
merged with e and o (> u), and it is unsurprising that they should then be used for the sim- 
ple vowels from whatever source. They are traditionally called “spurious diphthongs” but 
are properly speaking digraph spellings of monophthongs. The vowel system of Classical 
Attic, with the post-Eucleidean spellings, is shown in Figure 5.3. 


<I> = i, i <Y > = u,u <OY> = u 



FIGURE 5.3 VOWEL SYSTEM OF CLASSICAL ATTIC, SHOWING OFFICIAL SPELLINGS 
AFTER 403 BC 

It is worth saying a few words about the later developments of the vowel system into 
the Koine and beyond. Around the end of the fourth century BC e raised and merged with 
f; one imagines that e itself then raised slightly from [e:] to [e:]. By the end of the first 
century AD a new wave of monophthongisations caused oi to merge with ii, and turned ai 
into [e:]; in turn, e raised to [e:] and eventually [i:], merging with F. Between the second 
and third centuries AD phonemic vowel length was lost, probably in connection with the 
change from a pitch to a stress accent, and, finally, sometime after the fourth century ii 
(including the outcome of earlier oi) lost its lip-rounding and merged with giving the 
simple five- vowel system of Modem Greek: a <a>, e <e, at>, i <i, et, r|, u, ot>, o <o, co>, 
u <ou>. 


Accent 

The details of the Greek accentual system are beyond the scope of the present chapter. 
The reader is referred to Probert 2003. Only a basic overview is presented here. 

From the time of Plato ancient commentators on the accentual system of Greek 
describe two categories of accent, o<qvq ‘sharp’ and Paptx; ‘heavy’. Two of the general 
terms for accent, Tovoq and took;, both meaning ‘stretching’, are metaphors of the tension 
in the strings of a musical instrument, and the third, JtpoaraSla ‘singing-along-with’, also 
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implies an accent based on pitch rather than stress. The categories b^vc, and Papxx; are 
themselves associated with the terms £7iixaoi<; ‘stretching’ and aveavq ‘slackening’, and 
are thus clearly high and low pitch respectively. 

The tradition of marking the accent with diacritics originates in Alexandria ca 200 BC, 
but the present system of notation is Byzantine. The acute marks a rise in pitch followed 
by a fall in pitch on the next syllable, e.g. av0pcojro<; with rise on the a and fall on the 
co. Each change of pitch requires one mora (the length of a short vowel). If the accent is 
borne by a long vowel or diphthong (both of which equal two morae) the pitch may rise 
on the second mora and then fall on the following syllable, in which case it is still marked 
acute (e.g. dvBpccmo'o, which may be regarded schematically as ant h rodpuu), or it may 
rise on the first mora and fall on the second mora of the same syllable, in which case it is 
marked with the circumflex (e.g. Jtpaypa, schematically praagma). When a word which 
bears an acute on its final syllable (the traditional term is “oxytone”) is followed by 
another accented word, its accent is written grave (barytonesis, e.g. ayaQoq av0pccmo<;). 
It is not clear what this represents, but it is a reasonable assumption that the pitch was 
required to return to the base level by the word end, and that the grave accent indicates 
that the expected rise in pitch is suppressed, or at least moderated. 

The position of the accent is free within certain constraints, known collectively as 
the law of limitation: if the final syllable is light (i.e. contains a short vowel followed 
by at most one consonant) an acute can stand on the final, penultimate or antepenulti- 
mate syllable and a circumflex on the penultimate; if it is heavy, an acute can stand on 
the final or penultimate syllable and a circumflex on the final. A word which bears its 
accent as far from the word-end as permitted by the law of limitation is said to have 
recessive accent. Most verbal forms are accented this way. A handful of verbal catego- 
ries have specific accents, for example the strong aorist participle, accented finally, e.g. 
eurcov ‘having said’. Nouns, adjectives, adverbs and pronouns have a particular accent 
as part of their lexical specification. A further rule, known as the sotera rule, stipulates 
that if the accent falls on a long vowel in the penultimate syllable and the final vowel 
is short, the accent is circumflex (thus ocoxfjpa, not N acoxf|pa). The accent is also cir- 
cumflex if it falls on a contracted final syllable. For all of the foregoing purposes the 
diphthongs ai and oi, when word-final, count as short rather than long vowels except in 
optatives and locative adverbs. 

Certain types of word, including indefinite pronouns, adjectives and adverbs, and some 
sentence particles and forms of the verbs eipi ‘be’ and (pqpi ‘say’, are atonic enclitics 
which cohere so closely to a preceding word as to form a single unit for accentual pur- 
poses. An oxytone word followed by an enclitic retains its acute accent (thus ayaQoq xvq 
av0pccmo<; vs ayaQoq av0pco;to<;). In some other cases either the preceding word or the 
enclitic may gain a secondary accent (e.g. av0pco;t6<; xtq). 

The remarks made thus far concern the accentuation of Attic and the Koine, until 
around the second or third century AD. Very little is known about the accentual systems 
of other dialects. Ancient grammarians and papyri concur that in Lesbian the accent was 
recessive; thus Lesb. 0£Otai = Att. 0eoi<; ‘gods (dat. pi.)’. Grammarians and papyri also 
suggest that in Doric the sotera rule did not apply, and that final -ai and -oi counted as 
long in nominative plurals. Chadwick (1992) argues that Thessalian had a word-initial 
stress accent. Previously, editors of epigraphic texts have accented dialect material 
(except Lesbian) in the manner of Attic. It is common today to omit accents in dialect 
texts. 

Between the second and third centuries AD the accent became one of stress, as it is in 
Modern Greek. 
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MORPHOLOGY 

In both nominal and verbal morphology Greek inherited three values of the category 
number: singular, dual and plural. In Mycenaean whenever two items are recorded the 
dual is used, for example in the list ofbanqueting equipment from Pylos (PYTa 641.2) 
dipahe medzohe tridwehe ‘two larger three-handled goblets’, where the noun and both 
adjectives are dual. In Homer and in classical Attic the dual is used (i) for “natural 
pairs” (such as eyes, hands, etc.); and (ii) for “accidental pairs” when the numeral 8uo 
‘two’ (itself a dual form) or the word ^£uyo<; ‘pair’ is used in conjunction; but even 
in these cases the plural is frequently used. Ionic had lost the category early, and its 
retention in Attic was considered parochial. It was therefore eliminated in Great Attic 
and the Koine. 

Nominal morphology 

There are three main nominal declensional types: the a-stems (the Greek “first declen- 
sion”) continue the PIE *-eh 2 and *-ih, types, the “second declension” the IE o-stems, and 
the “third declension” the various IE athematics. 


TABLE 5.3 FORMS OF THE FIRST DECLENSION (A-STEMS) IN CLASSICAL ATTIC 




-a 

type (f.) 

-a 

type (f.) 

m. 




Xcapa- 

yvcopa- 

uyisia- 

So ija- 

vsavia- 

Kpua- 



‘place’ 

‘ thought’ 

‘ health’ 

‘opinion’ 

‘ youth’ 

‘judge’ 

sg- 

nom. 

Xcopa 

yv6pr| 

hyisid 

86^6 

vsavia q 

Kpixriq 


voc. 

Xcbpa 

yv6pp 

hyisid 

86^6 

vsavia 

Kpixa 


acc. 

Xtbpav 

yvcbppv 

hyisiav 

8o^av 

vsaviav 

Kpixpv 


gen. 

Xwpdc; 

yvcbppc; 

hyisiaq 

8oihl<; 

vsaviou 

KpiXOU 


dat. 

X(bpa 

yvcbpp 

hyisia 

Sof;q 

vsavia 

Kpixfj 

du. 

nom./acc./voc. 

Xcbpa 

yv6pa 

hyisid 

So^a 

vsavia 

Kpixa 


gen./dat. 

Xcbpaiv 

yvwpaiv 

hyisiaiv 

So^atv 

vsaviaiv 

Kpixaiv 

pi. 

nom., voc. 

X«>pai 

yvcbpai 

hyisiai 

86sat 

vsaviai 

Kpixai 


acc. 

Xcbpdq 

yvcbpaq 

hyisiaq 

56^d<; 

vsaviag 

Kpixai; 


gen. 

Xcopcov 

yvcopcov 

hyisicav 

So^tov 

vsavicov 

KpiXCOV 


dat. 

Xcbpait; 

yvcbpaii; 

hyisiaiq 

So^ati; 

vsaviaic; 

Kpixaiq 


The first declension (Table 5.3) has two sub-types. The first, deriving from the *-eh, 
declension, has -a- throughout the sg., becoming -r|- in Ionic and in Attic except after £, i 
or p. The second, deriving from the *-ih 2 type, has -a- in the nom., voc. and ace. sg., and 
-a- (becoming -r|-) in the gen. and dat. sg. Masculines of the first declension have nom. 
sg. in -dq (becoming -qq) by analogy with the second declension and gen. sg. in -ou (in 
Attic-Ionic), imported from the second declension. (Other dialects have forms deriving 
from -do, where the -a- is the stem and the -o has been imported from the second declen- 
sion original form in -oio.) Ace. pi. -dq is from -ans, which is retained in some dialects, 
including Mycenaean; Lesbian has -aiq by the normal treatment of word-final -ns in that 
dialect. The gen. pi. of all a-stems is originally in -ahon (preserved in Mycenaean), which 
contracts in Attic-Ionic to -cov and in West Greek to -fiv. There are no neuters of the first 
declension. 



GREEK 299 


TABLE 5.4 FORMS OF THE SECOND DECLENSION (O-STEMS) IN CLASSICAL ATTIC 




m./f. 

n. 



A>yo- 

yjyo- 



'speech' 

‘yoke’ 

sg- 

nom. 

A-oyoi ; 

ijuyov 


voc. 

A,oye 

ijuyov 


acc. 

Aoyov 

i^ryov 


gen. 

A,oyou 

ijuyou 


dat. 

Aoyw 

?vytp 

du. 

nom./acc./voc. 

Aoyco 

ijuycb 


gen./dat. 

Aoyoiv 

£vyoiv 

pi. 

nom., voc. 

Adyoi 

i^uya 


acc. 

Aoyoui; 

ijuya 


gen. 

Voytov 

ijuycov 


dat. 

Aoyou; 

^uyoii; 


The second declension (Table 5.4) consists largely of masculines and neuters. 
There are a handful of feminines (e.g. f) o8oq ‘way’) which decline in the same way 
as the masculines. The gen. sg. is originally in -oio, which is preserved in Mycenaean 
and Homer. Loss of the intervocalic glide gave -oo (perhaps to be reconstructed in 
some Homeric forms), which contracted in Attic-Ionic to -ou. Ace. pi. -ovq is from 
-ons, retained in some dialects, including Mycenaean; Lesbian has -oiq. In Attic, 
stems ending in -o and -e undergo contraction (e.g. vovq ‘mind’ = voo q, ooiouv 
‘bone’ = ooisov). Stems in -e are affected by twin processes of quantitative metathe- 
sis and prevocalic shortening, which cause both *-eo- and *-eo- to become -era-. Thus 
Proto-Greek *lawos > *leos > Aedxg. This type is known as the “Attic declension” 
because in the Koine it is replaced by the (non-Attic-Ionic) common Greek forms 
such as Xaoq. 


TABLE 5.5 UNDERLYING ENDINGS OF THIRD DECLENSION, AND CONSONANT STEM 
FORMS IN CLASSICAL ATTIC 






m./f. 


n. 



underlying 

rcaxep- 

spcox- 

rcoipev- 

acopax- 



ending 

‘ father’ 

iove’ 

‘ shepherd’ 

‘body’ 

sg- 

nom. 

-s 

7iaxf|p 

spcoq 

7lOipf|V 

ad> pa 


voc. 

(zero) 

7idx£p 

epooq 

7ioipf|v 

ad) pa 


acc. 

-n > -a 

7iaxspa 

epcoxa 

7lOl(J,8Va 

ad) pa 


gen. 

-os 

7raxpo<; 

epcoxoq 

7TOip8VO(; 

acbpaxoq 


dat. 

-i 

7raxpi 

spcoxi 

rcoipevi 

acbpaxt 

du. 

nom./acc./voc. 

-e 

7iaxsp8 

epcoxe 

7TOl(J,8V8 

acapaxe 


gen./dat. 

-oin 

7tax8poiv 

epwxoiv 

7lOip8VOlV 

acopaxoiv 

pi. 

nom., voc. 

-es 

Tiaxspei; 

epcoxeq 

rcoipevei; 

acopaxa 


acc. 

-ns > -as 

naxepac, 

epcoxag 

7ioip8va<; 

acopaxa 


gen. 

-on 

rcaxepcov 

epwxcov 

7TOl|X8V(OV 

acapaxcov 


dat. 

-si 

Tiaxpaai 

spcoai 

7101^801 

acbpaai 
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The third declension consists of many sub-types. Only an overview can be given here; 
consult a reference grammar such as Smyth and Messing 1956 for a complete catalogue 
of forms. The basic underlying endings are given in Table 5.5, but because they are joined 
directly to the stem with no intervening vowel, if the stem ends in a consonant it under- 
goes various changes. Variations in stem ablaut must also be taken into account. Where 
the stem ends in a consonant other than -5 or *-w (also Table 5.5) the oblique cases (other 
than the dat. pi.) generally show the underlying form of the stem. An important archaic 
type is reflected in the noun jrairjp ‘father’, which shows the e-grade of the stem in the 
nom., voc., ace. sg., in the dual, and in the nom., voc., ace. and gen. pi, but the zero- 
grade in the other cases. The nom. sg. of consonant stems regularly ends in -s, but when 
the stem ends in a liquid or nasal it has no termination, and the preceding vowel, if short, 
lengthens (as in both ;iaTfjp, stem Jiaisp-, and wot|.if)v ‘shepherd’, stem ^oipev-). In the 
ace. sg. the ending *-n (< PIE *-m) surfaces as -a, and in the ace. pi. *-ns as -aq. 

Neuter C-stems are endingless in the nom., voc. and ace. sg. The stem-final consonant 
is dropped if it is one which is not permitted at word-end (as in acopa ‘body’ from the 
stem acopon;-). 

When the stem-final consonant was -s, between vowels it had passed to [h] in the 
second millennium (hence Myc. p h arweha < Proto-Greek *p h arwesa ‘cloths (nom./acc. 
pi.)’). In Attic the result undergoes contraction (Table 5.6), and it is therefore customary 
to treat the 5-stems as a special type apart from the C-stems proper. There are two main 
sub-types. For those with a stem in *-es- throughout (e.g. ScoKpaiqq), most, if not all, are 
masculine personal names (the one exception, ipirjpr|<; ‘trireme’, is probably in origin an 
adjective). The second major group consists of neuters with nom./voc./acc. sg. in -os and 
*-es- in the remaining cases, e.g. yevoq ‘family’. 


TABLE 5.6 FORMS OF THE THIRD-DECLENSION S-STEMS, /-STEMS AND //-STEMS IN 
CLASSICAL ATTIC 




5-stems 


/-stems 

w-stems 



m. 

n. 






ScoKpaiso- 

ysvso- 

JtoAl- 

on- 

7TT1XU- 



(man’s name) 

‘ family’ 

‘city’ 

‘Pig’ 

‘ forearm’ 

sg. 

nom. 

ScoKpdxrn; 

yevo^ 

jr6A.ii; 

auq 

Tifixix; 


voc. 

ScbKpaxsq 

ysvoi ; 

noAi 

auq 

7ifjxu 


acc. 

ScoKpaxri 

ysvoi ; 

jioAiv 

cruv 

7ifjxuv 


gen. 

EcGKpaxooi; 

ysvooq 

JtoAscoi; 

oooc; 

Ttrixecog 


dat. 

ScoKpaxei 

ysvsi 

7t6Asr 

(TUI 

7CT1X81 

du. 

nom./acc./voc. 

— 

ysvr| 

7t6A£l 

ous 

7TTixet 


gen./dat. 

— 

ysvoTv 

jioAeoiv 

ODOIV 

7ir|X£oiv 

pi. 

nom., voc. 

— 

ysvr| 

jioAeii; 

<JU£q 

nr\x£i<; 


acc. 

— 

ysvr| 

jtoAek; 

cruac;, auq 

nr\xzy; 


gen. 

— 

ysvcov 

ttoAecov 

(TUCOV 

7lf|X £C0V 


dat. 

— 

ysvsot 

rcoAEar 

(TUCl 

^rixeai 


The /-stems in Greek (Table 5.6) consist largely of feminines, though there are a few 
neuters (e.g. civoun ‘mustard’) which differ only in the nom. and ace. sg., which are 
endingless, and pi. (otvoutri < otvoursa). The type has been extensively remodelled from 
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PIE. First, the forms which had the full-grade stem *-ey- have been changed to *-ey- 
(e.g. gen. sg. *poleyos). Loss of the intervocalic *-y- leads to forms such as gen. sg. 
;r67.r|o<; (attested in Homer). In Attic we see the effects of quantitative metathesis (;r67.r|oq 
> no’X.eaq) and prevocalic shortening (*poleon > ;i67.£cov). Other dialects have gener- 
alised a stem in -i (gen. sg. nokioq etc.). 

The M-stems (Table 5. 6) are of two types. The more common (e.g. cru<; ‘pig’, gen. sg. 
cru6<;) follows the PIE -u/7-stems, having an invariant stem in -u- to which the C-stem 
endings are added. The other (e.g. jrrjxw; ‘forearm’) follows the PIE u-stems proper with 
an ablauting stem in -(e)u-. Nouns of this type are considerably rarer, though adjectives 
are well represented. In nouns Attic has a gen. sg. in -£(»<;, which has been imported from 
the /-stems. Other dialects (and, in adjectives, Attic) have -eoq, which goes back to a 
remodelled *-ewos. 


TABLE 5.7 UNDERLYING FORMS OF THE -EU STEMS (forms directly attested in Mycenaean are 
not asterisked), AND THE RESULTING FORMS IN CLASSICAL ATTIC 




underlying ending 

early AI 

Att. 






Sg- 

nom. 

*-ews > -eus 

-8U<; 

PaaiAAig 


voc. 



PaaiXei) 


acc. 

*-ewn > *-ewa 

-r|a 

PaaiAia 


gen. 

-ewos 

-r|o<; 

PaaiAicoi; 


dat. 

-ewi 

-rji 

PaaiTeT 

du. 

nom./acc./voc. 

*-ewe 

-r(8 

PaoAsT 


gen./dat. 

*-ewoin 

-r|oiv 

Puai/xoiv 

pi. 

nom., voc. 

*-ewes 

-rise; 

PaaArji;, PaoilEig 


acc. 

*-ewns > *-ewas 

-\]aq 

PaaiAiai; 


gen. 

*-ewon 

-rjcov 

PaaPiEtov 


dat. 

*-ewsi > -eusi 

-suai 

PaaPiEuai 


Special mention must be made of the nouns in -£u<; (Table 5.7), mostly denominatives 
denoting occupations (e.g. ui^erx; ‘horseman’) or instruments (e.g. roperx; ‘knife’), but 
also ethnics (e.g. Meyapeuq ‘Megarian’) and personal names (e.g. OSuocreuq). The type is 
unique to Greek. The underlying forms have stems in -ew- to which the standard C-stem 
endings are added. When the stem comes into contact with a consonant - i.e. in the nom. 
sg. and dat. pi. - Osthoff’s Law shortens the stem vowel to give nom. -eus and dat. -eusi. 
Other forms have intervocalic -w-, which is lost early in Attic-Ionic. In Attic the paradigm 
is further affected by quantitative metathesis (e.g. -eos > -eos) and prevocalic shortening 
(gen. pi. eon> -eon). 

Turning now to the inventory of case forms, the eight cases of PIE (nom., voc., ace., 
gen., dat., abl., instr. and loc.) are reduced in the dialects of the first millennium to five. 
Of these the nom., voc. and ace. directly continue their PIE counterparts. The PIE abl. 
was distinct only in the singular of the o-stems and was elsewhere identical with the 
gen. in the singular and the dat. in the plural. In Proto-Greek the abl. has been subsumed 
completely in the gen. (By a simplification of prepositional case government ablatival 
prepositions have come to govern the dative rather than genitive in Arcado-Cypriot and 
probably Mycenaean (Morpurgo Davies 1966), but this does not imply an ablative-dative 
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syncretism in these dialects, since the genitive continues ablatival functions in non- 
prepositional constructions. See also Thompson (1998, 2000, 2014) for further discussion 
of the Mycenaean evidence.) 

The PIE dat., loc. and instr. collapse in a complex series of mergers which are under- 
way at the time of the Mycenaean tablets. The first-millennium dialects have a case con- 
ventionally called “dative” which carries the functions of the PIE dat., loc. and instr. In 
the a- and o-stem declensions this continues, in the singular, the forms of PIE dat. sg. 
in *-ai and *-oi. (In some dialects this undergoes later shortening, e.g. in Arc., Boe. and 
NWGr. -oi. The Mycenaean spellings -a and -o are ambiguous between the PIE dat. and 
loc. in -ai, -oi, but in light of the developments in the other dialects it is generally assumed 
that they represent -ai, -oi.) The PIE loc. sg. -ai, -oi and -ei of these declensions survive 
only in adverbs (e.g. eicei ‘over there’) and isolated forms such as oikoi ‘at home’. In the 
third declension, however, the dialects of the first millennium continue not the PIE dat. 
in *-ey but the loc. in *-i. In Mycenaean we see a transitional stage, with forms spelt 
<-e> = dat. -ei and <-i> = loc. -i used interchangeably with both datival and locatival 
function. The instr. sg. of all three declensional classes would be spelt in an identical way 
with the corresponding dat. and loc. It is thus impossible to tell whether a separate instr. 
sg. still existed. 

In the plural no Greek dialect preserves forms of the PIE dat. in *-b h os (cf. Lat. -bus). 
Already in Mycenaean the endings of the loc. pi. -asi > -ahi, -oisi > -oihi, -si (remodel- 
lings of the PIE forms in *-asu, *-oysu, *-su) have replaced those of the dat. Mycenaean 
does, however, preserve a distinct instr. pi. with endings -a-pi = -ap h i, -o = -ois and 
-pi = -p h i. The -p h i endings of the a- and C-stems have disappeared from first-millennium 
Greek, except in Homer, where they have been reanalysed as generic oblique case mark- 
ers and extended into the singular and the o-stems. Although these are the only plural 
exponents of instrumental force in Mycenaean they can also have locatival and even pure 
datival force: Sp h agiamp''i (instr. pi.) occurs in parallel with Helehi (loc. sg.; both are 
place names), and k h itomp h i describes cloth ‘for khitons’. In Mycenaean, then, we seem 
to have a transitional stage where remnants of the morphology of dat. and loc. remain in 
free variation to express a syncretic dat.-loc., with which the instr. is also beginning to 
fall together. 

In Attic-Ionic and in Lesbian the o-stem dat. pi. -own simply continues the loc. pi. 
Attic had generalised a shortened form in the definite article, xotq (presumably a pre- 
vocalic sandhi variant) by the beginning of the fifth century, and the same ending was 
standardised in nouns and adjectives by 420 BC. Other dialects have -oi q from the ear- 
liest times, which is presumed to continue the instr. pi. In the a-stems early Attic and 
Ionic inscriptions have -qoi (also -dm in Attic), remodelled by analogy with -oigi to give 
-r|oi/-qoi (written -rpm, -dim), but in Attic these were replaced by -atm, and, by 420 BC, 
by the short form -aiq. (Lesbian has -aioi from the earliest times by analogy with -oim, 
and the other dialects have -aiq by analogy with -oiq.) 

Adjectives and adverbs 

Adjectives follow the nominal declension. The most common type has forms of the sec- 
ond declension for the masculine and neuter and of the first for the feminine, e.g. ootpoq, 
aocpf), ooipov ‘wise’, and (plAroq, (piMd, cplXiov ‘friendly’. Some, have no separate femi- 
nine forms (e.g. dOdvaxoq, -ov ‘immortal’). 

C-stem formations are most common in participles, e.g. pres. ptcp. ^eycov, gen. sg. 
XeyovToq ‘speaking’; the n. nom./acc. sg. is endingless Xeyov (< *legont), the pi. in -a 
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(Xeyovm), and the f. follows the first declension (Xeyovaa < *legontya). Similarly weak 
aor. ptcp. Xvaaq, Xvaaaa, Xvaav (m./n. stem Xva avx-), and perf. ^eAukcck;, XeXukuioi, 
A-stoKoq, the m./n. originally an 5-stem (Mycenaean preserves n. pi. forms in - oha ) but 
recharacterised as a C-stem, A.eA.ukox-. 

Although not a productive class, u-stem adjectives are well represented, following the 
declension of Jtr|xo<; in the m. and n., but with gen. sg. -eoq. The f. follows the first declen- 
sion and is built synchronically to a stem in -ei- (e.g. f]8u<;, f)8£ia, f)8i) ‘sweet’, m./n. gen. 
sg. ijSsoq). 5-stem adjectives (e.g. Euysvijq ‘well-born’) have stems in -as- with loss of the 
-s- between vowels, as in nouns like ScoKpairy;. They have no separate f. The n. nom./acc. 
sg. is endingless (evyeveq), and pi. in *-esa > -ea > -r| (suyEvfj). 

The adjective also has comparative and superlative degrees. There are two different 
methods of formation. In the first the suffixes -yon- or -ion- (comparative) and -loxoq 
(superlative) are added directly to the root (not the stem) of the adjective, originally in the 
e-grade: piyaq ‘big’ gives comparative pslijcov (-£i- by an internal development in Attic 
for psijcov of other dialects; -ij- < *-gy-) and superlative psyioxoq. ijSrx; ‘sweet’ gives com- 
parative f)81cov and superlative fjSioioq. aicxpo:; (root aia%-) gives comparative aioxicov 
and superlative aio%ioxoq. The original comparative suffix was *-yos- (cf. Lat. -ior), an 
5-stem attested in Myc. medzohes ‘bigger (m./f. nom. pi.)’, and preserved in the Attic 
alternative m./f. ace. sg. pEiijco < *-oha, nom. and ace. pi. pEkjoxx; < *-ohes, *-ohas, and 
n. nom./acc. pi. pslljco < *-oha. In the zero-grade it was compounded with an indefinite 
nominalising suffix -on- to give n-stem -iov-. Forms such as psiijcov are a conflation of the 
two. The superlative -icrcoq, a regular first/second declension adjective, is a compound of 
the same zero-grade *-is- and a definite nominalising suffix *-to-. 

The second formation adds -xspo<; (comparative) and -mzoq (superlative) to the (m./n.) 
stem. Both are first/second declension adjectives. This -xspo:; was originally contrastive 
in force rather than comparative, a meaning preserved in formations such as SsipxEpo:; 
‘right-hand’ (contrasting with cncaioq ‘left-hand’) and apiaiEpoq ‘left-hand’ (contrasting 
with Ss^ioq ‘right-hand’). In Mycenaean, wa-na-ka-te-iv = wanakteros means ‘of the royal 
(rather than of another) type’. The same suffix is also seen in the possessives f|p£X£po<; 
‘our’ and upsxEpo:; ‘your’. The superlative -mmq appears to be a contamination of the 
PIE absolutive suffix *-(t)mHo- (Lat. ultimus ) by -urco-. Examples of paradigmatic com- 
paratives and superlatives are SsivoxEpo:;, Ssivoxaxoq from 5eTvo<; ‘strange’, yA,uic6x£po<;, 
yA.uKuxaxoq from yA,uicu<; ‘sweet’, p£>,dvx£po<;, pE^avxaxoq from p£>,a<; ‘black’ (stem 
psA-av-). Stems in -o- lengthen the theme vowel if the preceding syllable is light, e.g. 
GoqxoxEpo:;, ooqxoxaxoq from coyoq ‘wise’ (instead of 'oo(pox£po<;, X oo(poxaxo<;). Met- 
analysis of formations such as d$.r|0£o-X£po<; ‘more true’ gives rise to -Eoxspoq and 
-Eoxaxoq (so EuSaipovEGXEpoq built to EuSalpcov ‘fortunate’), and of dxapio-x£po<; ‘more 
ungracious’ (< *ak h arid-tero-) to -ioxapoq, -ioxaxoq (so >,a>aox£po<; built to XaXoq ‘talk- 
ative’). Similarly, forms built regularly to adverbs such as wa^aixEpoq ‘older’ (to naXm 
Tong ago’) instead of the positive grade of the adjective (nakaioq ‘ancient’) cause some 
adjectives in -aioq to form comparatives and superlatives in -odx£po<; and -aixaxoq rather 
than expected -aioxEpoq, -aioxaxoq (e.g. ficruxaioq, f|cruxaix£po<;, f]cn>xodxaxo<; ‘quiet(er, 
-est)’), and conversely some forms in -oq to give -alxspo:;, -aimmq rather than expected 
-oxspo:;, -ommq (e.g. psoo:; ‘middle’ gives psoaixEpo:;, pEoaixaxoq). 

Besides these regular formations there are many irregularities caused by stem ablaut 
and other phonetic changes (;xoA,x><; ‘much’ gives jikeiaiv/jiXecov, jiXsiawq) and by supple- 
tion (ayadoq ‘good’ has both (3e>,x1cov, p/Axioxo:; and dpsivcov, dpiGxoq). 

Adverbs are regularly derived from adjectives by the addition of the suffix -coq to 
the (m./n.) stem, thus, e.g., (pRcoq ‘friendlily’ from (pRoq, mxscoq ‘swiftly’ from mypq; 
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but many are fossilised case forms (e.g. eksi ‘over there’ with o-stem loc. sg. ending). 
The comparative and superlative of the adverb are identical with the n. nom./acc. sg. 
comparative and pi. superlative of the corresponding adjective (e.g. ao<p&q ‘wisely’ has 
oocpcbxepov, aocpcbxaxa). 


Pronouns 

The definite article is in origin a pronoun built to the PIE demonstrative stems *so (m. 
nom. sg. is endingless, 6; f. f) < a, preserved outside Attic-Ionic) and *to (other forms; 
West Greek preserves m. and f. nom. pi. in xoi, xai, while the other dialects have oi, ai by 
analogy with the sg.). In Attic it is obligatory with nouns of definite reference, including 
personal names and abstracts, but its use as an article is evidently a late development. It is 
entirely absent from Mycenaean, and although in Homer it has uses which approach those 
of later Greek, it still functions as an independent pronoun with demonstrative, anaphoric, 
contrastive or relative force. The use as a relative (compare English that) survives in the 
dialects and in Attic verse. 

As a demonstrative the same pronoun survives in the composite ode, f]8e, xo8e, which 
has a strongly deictic force ‘this one (over here)’. Other dialects have similar forms with 
different deictic particles, e.g. Arc. (h)ovu. It contrasts with eKeivoq, £Ksivr|, eKeivo ‘that 
one (over there)’. The pronoun omoq, aoxr|, xouxo ‘this’ has more anaphoric than deictic 
force. 


TABLE 5.8 FORMS OF THE PERSONAL PRONOUNS (enclitic forms are bracketed) 




First person 

Second person 

Third person/reflexive 

Sg- 

nom. 

syd) 

on (also voc.) 

— 


acc. 

8(18 (pe) 

os (os) 

8(8) 


gen. 

spon (poo) 

oon (oon) 

oil (oi>) 


dat. 

spot (poi) 

ooT (ooi) 

oi (oi) 

du. 

nom./acc. 

vcb 

ocpcb 

(acpcos) 


gen./dat. 

vcpv 

otpcpv 

(ocpco'iv) 

pi. 

nom. 

ripeic; 

npeiq (also voc.) 

acpsii; 


acc. 

fipaq 

npaq 

acpscu;, acpon; (acpsai;) 


gen. 

ripcdv 

npd)v 

ocp&ov, acpcbv (acpscov) 


dat. 

f|p!v 

nptv 

acpiai (ocpioi) 


The forms of the first and second person pronouns are shown in Table 5.8. The singular 
has both emphatic accented and unemphatic enclitic forms. The second person nom. sg. 
should be *tu (cf. Lat. tu), and this is the form preserved in WGr. xu. The oblique forms 
are *twe etc., which give Gr. ai etc.; and the other dialects have levelled the a- to the 
nom. There is in Attic no third person pronoun per se. For the oblique cases the pro- 
noun amoc,, otuxfj, auxo is pressed into service, but this is properly an intensive pronoun 
(corresponding to Lat. ipse), and its nom. cannot be used as a third person pronoun. In 
the combination 6 axnoq it means ‘the same’. In Homer the pronoun £ is sometimes 
reflexive, more often anaphoric (when it is always enclitic). In the reflexive use it is often 
strengthened with the appropriate case of axnoq. In Attic s- has become an indeclinable 
prefix conjoined to amoq to fonn the standard reflexive pronoun sauxov, eauxijv, sauxo 



GREEK 305 


‘himself’; so also eg,- and os- in epauxov and oeauxov ‘myself’ and ‘yourself’. The only 
forms of £ which are regularly used in Attic prose are the dat. sg. ot and pi. oipict, which 
are used as indirect reflexives (to refer in a subordinate clause to the subject of the matrix 
clause). Mycenaean uses the dat. sp h ehi (= oipici) as an anaphoric pronoun in the phrase 
meta-k"'e sp h ehi ‘and with them’. The enclitic ace. sg. ptv ‘him/her’, seen in Homer, is 
also attested in Mycenaean in the phrase damos de min p''asi . . . hekt’ehen ‘but the damos 
says that she has . . .’. 

Of the PIE pronominal stems *k w i- and *yo- both survive into Greek. The latter is 
the basis of the relative pronoun oq, fj, 6. The fonner is remodelled as an n-stem %{q, n. 
nom. sg. xi, gen. sg. xivoq, and becomes the standard interrogative pronoun ‘who?’ In an 
enclitic form it serves as the indefinite pronoun. Compounded with the relative pronoun 
oq, it gives the indefinite relative and indirect interrogative ogtic;, r\nq, oil. 

Verbal morphology 

The verb expresses the categories of person (first, second, third), number, time reference 
(past, present, future), aspect (durative, punctual, stative), mood (indicative, subjunc- 
tive, optative, imperative) and voice (active, middle, passive). Non-finite forms include 
infinitives and participles. In the indicative, time reference and aspect combine to give 
the seven tenses of traditional grammars, present, future, imperfect (past durative), aorist 
(past punctual), perfect (originally present stative), pluperfect (originally past stative) and 
future perfect. 

The difference between durative and punctual aspect is encoded in different stems, 
traditionally labelled “present” and “aorist”. In some verbs the present stem seems to be 
basic, and the aorist stem is derived, usually by the addition of -s- (e.g. A,u-co ‘loose’, aor. 
e-Xv-a-a). In others the aorist stem seems basic, and the present stem is derived by one 
of several means including ablaut (aor. 6-Xm-ov, pres. /.eut-co ‘leave’); suffixation with 
-gkx/o-, originally an iterative/durative suffix (e.g. aor. £-Jia0-ov, pres, Jtacxco < *Jta0- 
gkq) ‘suffer’); reduplication (e.g. aor. e-cxp-v, pres. t-Gir|-pi < *si-sta-mi); or addition 
of a nasal suffix (e.g. aor. e-xep-ov, pres. lep-v-co ‘cut’; aor. ppapi-ov, pres, apapx-dv-co 
‘err’) and/or infix (e.g. aor. 8-pa0-ov, pres, pa-v-0-av-co Team’). The difference between 
the two types is sometimes explained as motivated by the inherent atelicity (underived 
present stem) or telicity (underived aorist stem) of the verbal action, but the correlation is 
not perfect. In yet other verbs both stems are derived (e.g. pres. ai)^-dv-co ‘increase’, aor. 
lyu^-riG-a). Especially common is the large class where the present stem is derived (often 
from a nominal stem) by the suffix *-ye/o-, the aorist stem with the *-s- suffix (e.g. pres. 
(pu^dxxco/cpu^doGco ‘guard’ with -it -/-go- < *-ky-, aor. eyvXaiqa = E-(pvXaK-a-a; pres. 
kojttco ‘strike’ with -jit- < *-py-, aor. 8Kov|/a = e-KOJi-c-a; pres. ayyeXXco ‘announce’ with 
-XX- < *-ly-, aor. T]yy£i/.a < *angel-s-a). There are many verbs where the different stems 
are provided by suppletion (e.g. pres, (pepco ‘bear’, aor. ryveyKa; pres, opaco ‘see’, aor. 
eiSov), and others where the operation of sound change on a regularly derived paradigm 
creates the appearance of suppletion (e.g. aor. 8-po>,-ov, pres. P/ubokco < *mlh 3 -ske/o-). 

The distinction between present and past time reference is encoded partially in the 
endings (which also encode for person, number and voice) and partially in the presence 
or absence of the augment. In the dialects of the first millennium the augment 8- is a com- 
pulsory marker of past time reference prefixed to the stem of indicatives. (In roots starting 
with a vowel it surfaces as lengthening of the initial vowel, and is called the temporal, 
as opposed to syllabic, augment.) In Homer, as in the Rg Veda, the augment is optional, 
being present or absent at the convenience of the poet. Absence of the augment was 
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formerly held to be a poetic feature. Its almost total absence from Mycenaean, however - 
there are one or two possible examples - can scarcely be so explained. 


TABLE 5.9 THEMATIC AND ATHEMATIC PRIMARY AND SECONDARY ENDINGS OF THE 
ACTIVE VOICE 


Thematic Athematic 




Aiko ‘loose’ 

xi0r|pi ‘place’ 

dpi ‘be’ (irregular) 



Primary 

(present) 

Historic 

(imperfect) 

Primary 

(present) 

Historic 

(imperfect) 

Primary 

(present) 

Historic 

(imperfect) 

sg. 

1 

Aii-co 

e-Ad-ov 

xl-0r|-pi 

E-xl-0r|-v 

dpi 

T T 

T], T|V 


2 

Ao-su; 

e-Ad-ei; 

xi-Qri-q 

e-xx- 0 r )-<5 

T 

81 

f]o0a 


3 

Ali-Sl 

e-Au-e 

xi-0r|-(n 

E-xl-0r| 

8G-TI 

T 

T]V 

du. 

2 

A0-SXOV 

b-Ao-exov 

T1-0E-XOV 

E-XI-0E-XOV 

8O-T0V 

f]oxov 


3 

A0-EXOV 

E-Au-Expv 

XI-0E-XOV 

E-xi-0E-xr]v 

8G-TOV 

f]OTT|V 

pi. 

1 

Au-opEV 

E-Ali-OpEV 

TI-0E-PEV 

£-xi-0s-pev 

8<7-pSV 

fjpsv 


2 

Au-exe 

E-A6-STE 

XI-0S-XE 

E-XI-0E-XE 

8G-TS 

rjxs 


3 

Ail-own 

e-Au-ov 

TI-0E-CIOI 

E-xl-0£-oav 

eioi 

rjoav 


In the indicative the present stem combines with the so-called primary endings to fonn 
the present tense and with the secondary or historic endings (and augment) to form the 
imperfect tense (past durative) - see Table 5.9. Two different formations must be distin- 
guished, one thematic (presents in -co), one athematic (presents in -pi). The latter are no 
longer productive in Greek, and tended to be replaced by thematic formations. 

From the derivational point of view the historic endings are in fact basic, and several 
of the primary forms are transparently derived from them by the addition of the *-i “hie 
et nunc” deictic marker. The 1 sg. historic -v goes back to *-m, from which primary -pi 
is derived. In thematic -ov the -o- is the thematic vowel. The thematic ending -co is from 
unrelated *-oh 2 , in which the -o- is again the thematic vowel. 

The 2 sg. historic -(e)^ is original (-£- is the thematic vowel). The expected primary 
endings would be in *-si, where the -s- would be lost (via [h]) between vowels. The 
resulting forms *luei and *tit h ei had the -s- restored by analogy with the secondary end- 
ings: in the thematic class it is added to the end to give Xveic,, while the athematic class 
borrows the entire secondary ending. 

In the 3 sg. the athematic secondary ending was *-t, which was lost at the word-end. 
Athematic -oi derives from *-ti by the normal assibilation *-ti > -si in East Greek dialects. 
The original -xi is preserved in scrci and in West Greek (xi0r|xi etc.). Thematic -El cannot 
go back to *-eti, which would remain in West Greek and give ’'-ecu in East Greek. The 
standard explanation of -£i is that it results from the four-part analogy eA-de^ : eWmiq : : eA.de 
: X, X = eA-dei. Sihler (1995: 462) instead endorses the view that the thematic historic 3 
sg. was endingless *-e, and that -ei derives straightforwardly by the addition of /-deictic. 

Athematic root presents like xi0r|pi show stem ablaut with the full grade (0r|- < *d h eh[-) 
in the sg. and the zero-grade (0 e- < *d h h r ) in the pi. In the verb ‘be’ the sg. es- goes back 
to *h[es-, the pi. es- presumably to *h,s- (cf. Ved. santi). The athematic secondary 3 pi. 
ending was originally *-nt. The imperfect of ‘be’ *e-h[S-ent > *ehen, whence WGr. f]v, 
a fonn which was replaced in Attic-Ionic by fjoav with an ending from the weak aorist. 
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Original rjv was redeployed in Attic as a 3 sg. Outside Attic-Ionic the other athematic 
verbs have historic 3 pi. -v < *nt (imperfect exi0£V etc.). Attic-Ionic has made the same 
replacement as in f|oav. Thematic primary -ouat shows the expected East Greek devel- 
opment from *-o-nti (with -o- as the thematic vowel); West Greek preserves -ovxt. Athe- 
matic -dot is peculiar to Attic. West Greek has xi0evxt, StSovxi, taxavxt etc. 

In verbs with stems ending in -£-, -a- and -o- in Attic and in some other dialects the 
stem vowel and ending coalesce in a process known as contraction, e.g. woteco > 7toico, 
opaopev > opropev, 8r|X6et > Sr|AoT. In other dialects the vowels remain uncontracted. In 
yet others - particularly Aeolic - these conjugate as athematics. 


TABLE 5.10 ENDINGS OF THE STRONG AND WEAK AORIST 




Strong aorist 

718(0(0 ‘persuade’ 

Weak aorist 
Mod ‘loose’ 

Sg- 

1 

E-JI10-OV 

s-Au-a-a 


2 

E-7ti0-s<; 

e-h)-a-aq 


3 

E-JI10-E 

e-Au-ct-s 

du. 

2 

E-7110-ETOV 

E-Aii-a-axov 


3 

E-7110-ETTJV 

E-Au-o-dxr|v 

pi. 

1 

E-7li0-Op£V 

s-Au-a-apsv 


2 

E-JI10-STE 

E-Au-a-axs 


3 

E-7110-OV 

s-Au-a-av 


The aorist stem forms only one indicative, the aorist tense (Table 5.10), a punctual 
preterite which corresponds in meaning not only to English ‘I did’ but also ‘I have done’ 
and ‘I had done’. There are two main formations, known as the strong (or second) and 
weak (or first) aorists. Most verbs follow one or the other formation, but some have both 
in different senses (e.g. i'oxr|gi ‘I place’ has a transitive weak aorist eaxqaa ‘I placed’ but 
intransitive strong aorist eaxr|v ‘I stood’), and others follow the weak formation in the sg. 
and the strong in the du. and pi. (e.g. x(0r|pi has £0r|Ka but efiepev; the 3 pi. is either weak 
80r|Kav or strong efieoav). The strong aorist uses the thematic secondary endings; it there- 
fore differs from the imperfect only in the form of the stem. The weak aorist continues the 
PIE *-s- aorist, which was originally athematic but has been remodelled in Greek so that 
it effectively has a theme vowel -a. Additionally, there are root aorists of the type epA,r|v 
‘threw’ < *e-g w leh,-; these take the athematic secondary endings. 

PIE had no future tense. The Greek future derives from the PIE desiderative suffix 
*-h,s-, which was added directly to the root and originally formed athematics, but has 
come to take the thematic primary endings. After stems ending in a stop the laryngeal 
seems to have been lost, giving rise to forms such as A.8lv|/oo ‘I will leave’ = A.eui-a-ro. 
This is the regular formation in Attic. After stems ending in a resonant, however, the 
laryngeal was not lost, giving a suffix *-es- whose -s- was invariably intervocalic and 
therefore lost (via [h] ). This gives rise to the so-called liquid or contracted futures such 
as (3aA.ro = (3aA.ero from (3dA-A.ro (root (3aA.-). The so-called Attic futures are an analogical 
extension of the liquid type to stems in which they do not properly belong and which in 
most dialects form sigmatic futures. Despite the name they are not, in fact, restricted to 
Attic. The so-called Doric future (e.g. kA.sv|/£co from KAejrxro ‘steal’) is a blend of both 
types which has become standard in West Greek. 
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In addition to the active, Greek inherited from PIE a middle voice, originally denoting 
an action in which the agent was particularly involved, either reflexively (kova ‘wash’ 
is transitive; Xouopai is ‘I wash myself’) or with specialised meaning: so, for example, 
while Auco means ‘I loose’, A-uopon means ‘I ransom’; and while (pepco means ‘I carry’, 
(pepopai means ‘I win (i.e. carry off) a prize’. Many verbs have middle rather than active 
forms of the future, perhaps reflecting the original desiderative function of the future 
suffix; so, for example, eaopai from the verb ‘be’. Some verbs, said to be deponent, have 
only middle forms, often for reasons which are no longer semantically transparent (so, 
for example, OTopai ‘follow’). The middle came, however, to be the exponent of passive 
voice. The forms of the middle are given in Table 5.11; the future middle uses the the- 
matic primary endings on the future stem. 


TABLE 5.11 ENDINGS OF THE MIDDLE VOICE 


Present stem 


Aorist 


Thematic 

Athematic 

Strong 

Weak 

Aura ‘loose’ 

xl0r|pi ‘place’ 



Primary Historic 

(present) (imperfect) 

Primary Historic 

(present) (imperfect) 




sg. 

1 

Au-opcu 

s-Ao-op ryv 

xl-0E-pai 

E-xi-0E-pr|v 

£-jn0-6priv 

E-A,u-a-apr|v 


2 

Ao-p 

e-Ao-oo 

ti'-Oe-ctui 

E-XI-0E-C7O 

£-7110-00 

E-Xo-a-m 


3 

Ao-EXOU 

E-Au-EXO 

xi-0E-xai 

E-XI-0E-XO 

E-7I10-EXO 

s-Ao-a-axo 

du. 

2 

Au-£a0ov 

£-Au-£O0OV 

xi-0£-a0ov 

B-xl-0E-a0ov 

E-7X10-EO0OV 

e-Au-o- 

ao0ov 


3 

Au-e<70ov 

E-3.D-£O0r|V 

XI-0S-O0OV 

E-xi-0E-a0r|v 

E-m0-£a0riv 

e-Ao-o- 

ao0r|v 

pi. 

1 

Au-6|is0a 

£-A.D-6p£0a 

xi-0E-|xs0a 

E-Xl-0£-p£0a 

£-7n0-o|iE0a 

e-Ao-o- 

apsOaa 


2 

Au-SCT0£ 

s-Au-sa0s 

XI-0E-CT0E 

E-xl-0E-a0£ 

E-J110-EO0E 

E-Au-a-aa0s 


3 

Au-OVXCU 

E-Au-OVXO 

xl-0E-vxai 

E-X1-0E-VXO 

E-7I10-OVXO 

s-Ao-o-avxo 


The 3 sg. historic form -to is original. The derived primary ending ought to be -toi, 
which is in fact preserved in Arcado-Cypriot and in Mycenaean, -ion has been influenced 
by -pon and -oai. The same holds for the 3 pi. -vio, -viai. 

In early Greek the aorist middle is used with passive sense, but two distinctively 
passive aorist formations arose within the history of the language, dubbed strong and 
weak. The strong aorist passive conjugates like the (active) root aorists such as £(3Xr|v, 
with -r|- throughout and athematic secondary endings. Indeed, it probably originates 
in aorists of this type, many of which are intransitive. In Homer, the vast majority of 
strong aorist passives are in fact intransitives, e.g. e/app ‘I rejoiced’. The weak aorist 
passive is formed with a suffix -0r|- (of unknown origin) to which the athematic historic 
endings are added. It thus conjugates in an identical fashion to the strong aorist pas- 
sive - the difference is simply in the stem. Originally 0r|- was added to the zero-grade 
of the stem (e.g. xelvco forms £idOr|), later to the weak aorist stem without - or even 
with - the -a- marker. In Homer the -0r|- aorist passive is frequently used interchange- 
ably with the corresponding middle. 
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From the new aorist passives arose also two corresponding future passives. The stem 
is in -r|- (“strong”) or -0r|- (“weak”) followed by the future marker -a- with primary 
middle endings. The -Opaopou type is completely absent from Homer, the -paopat type 
attested in two forms only, piyf|aea0at ( Iliad 10.365) and dapaeai (Odyssey 3.187 and 
19.325), suggesting that both are rather late formations. Supporting this view is the use 
in Mycenaean of a middle future participle with passive sense in the phrase aleip h atei 
dzes(s)omendi ‘for an unguent to be boiled' . 

The subjunctive mood is formed in thematic stems by lengthening the vowel of the 
ending. Thus subjunctive Mrr|, T-ucopev vs indicative W>ei, T-uopev. In athematic forma- 
tions the vowel e/o was originally inserted between the stem and the ending; many of 
these “short-vowel” subjunctives survive in Homer and some dialects (e.g. subjunctive 
iiopev vs indicative ipsv), but the lengthened thematic forms quickly replaced them. On 
the use of the subjunctive, see under “Syntax” below. 

The optative mood was formed in PIE in athematic stems by the addition of the suffix 
*-yeh,-/*-ih r to the zero-grade of the root, the suffix appearing in the e-grade in the sg. 
and the zero-grade in the du. and pi. This is exactly what happens in Greek: from the 
verb ‘be’ 1 sg. £ir|v < ’"tqs-yetq-m, 1 pi. eipev < *h,s-ih 1 -men. Similarly, in verbs such as 
xl0r|pi, 1 sg. Ti0dr|v < *d h i-d h h 1 -yeh 1 -m, 1 pi. xtOetpev < *d h i-d h h 1 -ih 1 -men, although one 
would expect the intervocalic *-y- to have been lost in the fonner, and, perhaps, the out- 
come of the latter to have been ’'xtGipev; evidently some analogical rebuilding has taken 
place. Note that the endings are the athematic secondary ones (with original 3 pi. -ev < 
*-ent preserved even in Attic, rather than the replacement -a av). 

In thematic formations Greek uses a suffix -ot-, which is perhaps the thematic vowel 
followed by the zero-grade of *-yeh r (the laryngeal having disappeared, except in 
Slavic). The endings are once again the athematic secondary ones, except in the 1 sg., 
which has been replaced by -otpt, containing the athematic primary ending. 

It is important to note that outside the indicative and participles (and some uses of the 
infinitive) the difference between so-called present and aorist formations is not one of 
time reference but of aspect. The “present subjunctive” is not a present-time subjunc- 
tive, nor does the “aorist subjunctive” refer to past time. Both indeed can refer to future 
events. “Aorist” forms, built to the aorist or punctual stem, refer to simple events, while 
“present” forms, built to the present or durative stem, refer to ongoing or repeated events. 

As well as the eventive formations discussed so far, PIE possessed a stative, which is 
the origin of the Greek formation known as the perfect tense. Originally it denotes a state 
in which the subject finds itself - characteristic examples are oi8a ‘I know’, X£0vr|K£ ‘he 
is dead’ (vs eventive OvtjoKei ‘he dies’ and (oOTjeOavs ‘he died’), eoxqKa ‘I am standing’ 
(vs taxapat ‘I (go and) stand (somewhere)’). As such the stative originally stood outside 
the tense and mood system, and in Mycenaean we still find forms such as ^efuk^woha, 
perf. ptcp. of xeu/co ‘make’, with what from a later Greek perspective is active morphol- 
ogy but passive sense, ‘finished off, completed’; in fact, the meaning is ‘in a state of 
completion’. Similarly, in Homer SeSpe means ‘is ablaze’, from 8aico ‘kindle’. Even by 
the time of Mycenaean, however, middle-passive forms of the perfect have been created, 
e.g. dedemena ‘bound’, perf. of Seco, indicating that the perfect has begun to align itself 
with the voice system of the eventives. Semantically a present state can often be viewed 
as the result of a completed action in the past - ‘he died and therefore is dead' - and 
so the perfect came to denote past action with continued relevance in the present time, 
‘he has died' . In this sense it is properly speaking an eventive present perfect and has 
become aligned to the tense system. It is therefore no surprise that corresponding past 
and future forms were created, the pluperfect £X£0vf)K£i ‘he had died’ and future perfect 
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XcXuoopai ‘I shall have ransomed/I shall have been released’ (these forms are almost all 
middle -passive, xeOvf^ei ‘he will be dead/have died’ being a rare exception). In the Koine 
the perfect became interchangeable with the aorist as a simple past tense ‘I did’ (compare 
the development of the perfect in Latin, and the renewed periphrastic perfect in French). 

TABLE 5.12 FORMS OF THE PERFECT AND PLUPERFECT IN CLASSICAL ATTIC 

oTSa ‘know’ h'iO) ‘loose’ 


Perfect Pluperfect Perfect Pluperfect 






Active 

Middle 

Active 

Middle 

sg- 

1 

oi5a 

iiSn 

A.e-A,D-K-a 

A,8-Ad-(j.<xi 

8-Xe-A.i3-K-T| 

s-A£-Aa3-|j.tiv 


2 

olaGa 

fiSrioGa 

Xe-Xv-K-ac, 

A,8-Ad-o<xi 

8-A.8-Aa3-k-t|(; 

S-A.8-A.D-O0 


3 

Ol8s 

f\bzi 

Xe-Xv-k-e 

A,s-A.o-xai 

8-At:-At3-k-81 

8-A.S-A.D-T0 

du. 

2 

lOTOV 

rjoxov 

Xs-Xv-K-axov 

A.8-A.D-O0OV 

8-A,8-Aa3-K-81TOV 

8-A.S-A.D-O0OV 


3 

lOTOV 

fjOTT|V 

Ae-Ao-K-axov 

A.8-AD-O0OV 

8-A,8-XD-K-8lTr|V 

S-A,8-A,D-O0T|V 

pi. 

1 

IOJI8V 

fjajxsv 

A,8-Ad-k-(X|I8v 

A.8-A,o-|j.e0a 

8-Xe-A,l3-K-8l|I8V 

s-A.s-A.D-jis0a 


2 

10X8 

rjoxe 

A,8-A,o-K-axe 

A.8-A.D-O08 

S-A,8-A,l3-K-81XS 

S-As-A.D-008 


3 

loaoi 

fiSeoav, rjaav 

Xs-Xv-K-dci 

A-s-Ad-vtoii 

8-X8-A-i3-K-e(i)aav 

8-A.8-A.D-VT0 


Although it lacks the reduplication characteristic of the PIE stative, oi8a ‘I know’ most 
closely represents the original with o-grade of the root in the sg., zero-grade in the du. and 
pi. (Table 5.12). In the forms which are productive in Greek, however, the endings have 
been heavily remodelled. Reduplication remained the hallmark of the perfect system. 
The so-called strong perfect usually has the root in the o-grade and no special suffix (e.g. 
XeXoma, perf. of Leuico ‘leave’). The “weak” or kappatic perfect adds -k- to the present 
stem (e.g. LeLuKa, perf. of Xvco); the origin of this formation is obscure, but it is already 
represented in Homer in verbs with stems in long vowels. The middle is an internal Greek 
development formed from the reduplicated stem (usually zero-grade) with primary athe- 
matic middle endings. 

The pluperfect, again, is a Greek innovation, but it may have originated at a time when 
the perfect had a purely stative function to express a state in past time. The stem is the 
perfect stem with augment. The endings, apparently, are the perfect endings prefixed with 
-e-, but the details are obscure. 

The perfect stem, like the present and aorist stems, forms subjunctives and optatives, 
albeit rarely. They may be internal Greek developments (certainly perfect-stem opta- 
tives are vanishingly rare in Vedic). The active forms are built on the same model as the 
thematic subjunctives and optatives (LsLukco, LsLuKoipi); the passives are periphrastic 
(LeLupevoq &, LeLupevoq evq). 

Greek has a fully developed system of infinitives and participles, with present, future, 
aorist and perfect stems giving both active and middle forms. Additionally, the future and 
aorist have distinctively passive forms. In the accusative and infinitive indirect statement 
construction present and aorist infinitives have present and past time reference respec- 
tively, while in other constructions the difference is one of aspect only. Participles encode 
time relative to the action of the main verb (aorist for anterior, present for contemporary, 
future for posterior action). 
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SYNTAX 
Word order 

Having a richly articulated inflexional morphology Greek has relatively free word order. 
Nonetheless, in classical Attic prose we can detect a preference for SOV order (Dover 
1960: 25), although this is often disrupted for pragmatic reasons (Dik 1995). 

In Mycenaean we have in main clauses examples of both SOV (krikewiai onaton 
hek h onsi ‘the barley-women have an onaton (a type of lease)’) and SVO ( Kokalos ape- 
doke elaiwon tosson Ehumedehi ‘Kokalos gave so much olive oil to Eumedes’); and in 
subordinate clauses of SVO ( hote wanaks f'eke Augewan damo-ko-ro ‘when the king 
appointed Augewas as damo-ko-ro''). 

Various words including sentence-connective particles and atonic, clitic pronouns 
originally occupied the “Wackernagel’s Law” position immediately after the first 
constituent of the sentence. The interplay between the requirements of SV order and 
Wackemagel’s Law frequently meant that clitic pronouns were often either widely 
separated from their governing verb or placed following the verb but out of second 
position. In the Hellenistic period the tension was resolved by putting the verb in first 
position so that clitic pronouns naturally followed in second position, and this led to a 
development of VS word order even in those sentences which lacked clitics (Horrocks 
1990; 2010: 108-109). 


Mood, complementation and subordination 

The subjunctive seems originally to have been used to refer to future events, while the 
optative expressed the category irrealis. Traces of these functions are preserved in the 
use of the subjunctive in the protasis of future open conditions (eav tocutoi Jioifjq, KaX&q 
Jioirjoeic; ‘if you do this you will do well’) and of the optative in the protasis and apodosis 
of future remote conditions (si tocutoi Jtoiolr|<;, Ka Mac, av wotolr|q ‘if you were to do this, 
you would do well’) and in wishes for the future (dOs eMlot ‘would that he would come’). 
Over time, however, the subjunctive encroached on the modal uses of the optative, which 
became increasingly marginalised. In the Koine it survives only in fossilised expressions 
such as pf| yevoiro ‘God forbid!’. 

Greek has a range of methods for embedding clauses as the complements of verbs, 
both finite and non-finite. Finite declarative and interrogative complements (i.e. expres- 
sions of indirect statement and question) are introduced by an overt complementiser (on, 
&q both ‘that’, ei ‘whether’ etc.) or an interrogative, and ordinarily have their verb in the 
indicative. Unlike English, Greek has no rule of sequence of tenses: the verb in the com- 
plement clause takes the tense of the verb in the equivalent direct statement or question: 
so 5ifjA.0e Xoyoq on dtccucei amove, Kupo<; (Xenophon Anabasis 1.4.7) ‘a report spread 
that Cyrus was pursuing them’, lit. ‘is pursuing’ because the report was ‘Cyrus is pursu- 
ing them’; but after a past tense matrix verb (traditionally called historic sequence ) the 
verb in the subordinate clause may become optative, retaining, so far as possible, the 
tense of the direct form (in violation of the normal principle that the optative does not 
express time reference but only aspect): so syvcoaav on Kevoq 6 (po(3oq ejq (Xenophon 
Anabasis 2.2.21) ‘they realised that their fear was (lit. is) groundless’ with err] for eon. 
Thus, Greek is often said to have a rule of sequence of mood rather than of tense. This 
sequence-of-mood rule was optional, however, and is abandoned in the Koine. The neg- 
ative in such declarative clauses is ou. 
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Finite clauses can also be used for non-declarative complements. Verbs of effort, 
such as £7upe^ox>pat ‘take care that’, are complemented by clauses introduced by oncoq 
with the future indicative. The negative is pifj. By the same sequence -of-mood rule men- 
tioned above the indicative may, in historic sequence, be replaced by the future optative: 
E7npE>.ougr|v ojtcck ; (pi)) xocuxa ;toii)a£i (notoiq) ‘I took care that he should (not) do this’. 
Verbs of fear for the future are followed by pi) (negative pf| on) with the subjunctive 
(optionally the optative in historic sequence): (poPoupou |tq (on) yevqxat ‘I fear it will 
(not) happen’. 

Non-finite complements use either an infinitive or a participle. Declarative infinitive 
complements follow verbs of thinking and some verbs of saying. Such infinitives express 
tense as well as aspect - the present infinitive standing for a present or imperfect indic- 
ative, future infinitive for future indicative, aorist for aorist, perfect for perfect - and the 
negative is ox>. If the subjects of the infinitive and of the matrix verb are co-referential, 
the subject of the infinitive is left covert and anything in agreement with it is nomina- 
tive. If the two subjects are not co-referential the subject of the infinitive is overt and 
accusative. (Note that this is a substantial difference from Latin, where the subject of an 
accusative and infinitive construction is compulsorily expressed except when the matrix 
verb is passive.) 

Non-declarative infinitives are used after verbs of wishing, commanding, ability etc. 
Most of these are control structures (subject control in (eycb;) Pou^opat [PRO; xocuxa 
Jioietv] ‘I want to do this’, object control in Ke^euco auxoVj [PROj xauxa notstv] ‘I order 
him to do this’). Such infinitives express aspect but not tense (the present is used for 
ongoing or repeated actions, the aorist for simple actions; the perfect is rare, and the 
future not permitted). 

Participles, similarly, may be used in declarative and non-declarative complements. 
Declarative participial complements are marked for factivity, and as such follow verbs 
of perception and knowledge. The verb cpaivopat ‘seem, appear’ selects either a factive 
participle complement (cpaivopai cov = ‘it is obvious that I am’) or a non-factive infinitive 
complement (cpaivopat stvat = ‘I appear to be’, a raising structure). 

Adjectival and adverbial subordinate clauses follow the same range of strategies. Rel- 
ative and temporal clauses are introduced by relative pronouns and temporal conjunc- 
tions respectively (oq ‘who’, oxe ‘when’ etc.) and usually have indicatives. Indefiniteness 
(‘whoever’, ‘whenever’) is expressed by replacing the indicative with the subjunctive 
along with the modal particle civ (optionally in historic sequence by the optative without 
oiv). Temporal clauses referring to the future are treated usually as indefinite and behave 
the same way. Causal clauses introduced by conjunctions such as Sioxi ‘because’ also 
have their verb in the indicative; but if the speaker wishes not to vouch for the veracity 
of the reason expressed, the clause becomes virtually one of reported speech and may, in 
historic sequence, take the optative. 

An important class of finite adverbial clause is that introduced by tva, &q or onaq with 
a verb in the subjunctive (optionally the optative in historic sequence), used to express 
purpose or intention. 

Infinitives are rarer in non-complement subordinate clauses than in complements. 
When the temporal conjunction npiv means ‘before’ rather than ‘until’ (so always in Attic 
after verbs in the affirmative) it governs an infinitive. Result clauses introduced by d>oxe 
have an indicative if the consequence actually did (or did not, when the negative is on) 
follow, and an infinitive if the intention was that the consequence should (or should not, 
pi)) follow. 
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Participles, on the other hand, are abundant and are able to substitute for a wide range 
of adjectival and adverbial finite clauses, including relative and temporal clauses and 
expressions of manner, either with or without introductory particles such as axs or &q, both 
‘because’, Kaurep ‘although’. Where the subject of such a participle is not co-referential with 
another constituent of the sentence it defaults to the genitive case, a construction known as 
the genitive absolute. See Ruppel 2013 for a discussion of this construction and its origins. 

It will be seen that other than in future remote conditions and in wishes the optative had 
become an optional substitute for the subjunctive or indicative in certain constructions. 
It is no surprise, then, that it was lost in the Koine. Other patterns of complementation 
were also simplified. The complementary participle lost ground to the accusative and 
infinitive construction after verbs of perception and knowledge; and the declarative accu- 
sative and infinitive itself lost ground to finite complements introduced by oil and &q. In 
control structures the infinitive also had a tendency to be replaced by finite subjunctive 
complements introduced by iva. This in turn lost its force ‘in order that’ and became 
grammaticalised first as a generic complementiser for subjunctive clauses and then, in a 
phonetically reduced form va, as a marker of the subjunctive itself. 

Prepositions 

The class of words known to traditional grammars as prepositions, denoting such notions 
as spatial relations and selecting for nominal arguments in particular grammatical cases, 
originate as independent adverbials. Spatial relations were, at an early stage of the lan- 
guage, encoded solely in case morphology - acc. for ‘motion towards’ (allatival sense), 
gen. for ‘motion away from’ (ablatival sense) and loc., later dat., for ‘place where’ (loc- 
atival sense). Traces of this system are preserved in Mycenaean, which uses bare forms 
with dat./loc./instr. morphology to express locatival sense (dat.-loc. Sp''agiansi, instr. 
Sp h agiamp h i, both ‘at Sphagianes’), as well as in poetry. Adverbials were used to fur- 
ther specify or nuance the precise relation involved, e.g. Jiapa, which in one set of uses 
implies the presence of some person, so Jiapa xtva ‘into someone’s presence’, Jiapa xtvoq 
‘away from someone’s presence’, Jiapa xtvt ‘in someone’s presence’. 

Where these adverbials occurred immediately before the verb they became regarded as 
prefixes or preverbs. Their original autonomy is attested by the phenomenon of tmesis in 
Homer, where what from a classical perspective is a prefix appears to have become sepa- 
rated from its verb. This must reflect a very ancient stage indeed, as there are no examples 
of tmesis in Mycenaean. 

Where the adverbial occurred immediately before a noun, however, it is easy to see 
how it could become regarded as a head governing a case form, i.e. a preposition. Some, 
where the sense was appropriate, came to govern all three local cases; others only two 
(e.g. sv + dat. = ‘in (locative)’, + acc. = ‘into (allative)’ - although in this latter sense it 
was recharacterised in Attic-Ionic and West Greek as ev-q, whence eiq); and yet others 
only one (e.g. and, ‘away from’ with gen. only). 


Word formation 

An old pattern of derivation has the e-grade in verbs and the o-grade in derived nouns. 
When accented on the root they are often (result of) action nouns, on the suffix often 
agents: e.g. Xeyco ‘speak’, Xoyoq ‘speech’; xpecpco ‘nourish’, xpoipo:; ‘nurse’. Many have 
an -a suffix, e.g. xpoipf) ‘nourishment’. 
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There is a rich derivational morphology. Buck (1955a: 315-348), although rather 
old-fashioned, collects a catalogue of morphs. Of particular note, the PIE agent noun 
suffix *-ter-/-tor- tends in Attic to be replaced by (e.g. ccoifip ‘saviour’, prjicop 
‘speaker’, but Att. Kpftr|q ‘judge’ = dialectal Kpixrjp). The -euq suffix is very common in 
occupational terms (irateix; ‘horseman’ etc.). From PIE *-yo- and *-yeh 2 , Gr. -iov forms 
neuter diminutives (natSlov ‘small child’) and -ia feminine abstracts (croquet ‘wisdom’). 
Action nouns are derived with *-tis, becoming -oiq (mostly abstract, e.g. Ttioriq ‘trust’) 
and -par- (tending to become concrete, 7ipaypa ‘thing’ from irpaiTco ‘make, do’). The PIE 
adjectival suffix *-yos forms adjectives of a very general type (n^ouatoq ‘wealthy’ from 
7tA.ouToq ‘wealth’), and becomes a standard way of forming patronymics in some dialects. 
Its e-grade *-eyo- forms adjectives of material (xpuasoq, Att. xpherouq ‘gold’). The suffix 
-IKO-, although as yet unattested in Mycenaean and rare in Homer, has become the most 
productive formant of general (mostly denominal) adjectives in Attic prose. 

Greek also forms compounds extremely freely. An extensive survey is provided by 
MeiBner and Tribulato (2002), to which the reader is referred. 

FURTHER READING 

The most recent comprehensive reference works are those of Bakker (2010), who includes 
chapters on all aspects covered here and many others, and Colvin (2014). Additionally, 
for the dialects, see Colvin 2007; Buck 1955b contains a useful collection of dialect 
inscriptions with commentary but is rather dated. For the history of Greek from Great 
Attic onwards, see Horrocks 2010. On the decipherment of Linear B see Chadwick 1958 
and Fox 2013, which expands on Kober’s contribution. Ventris and Chadwick’s seminal 
Documents in Mycenaean Greek (1956; 2d ed. 1972) remains an essential introduction 
to the Linear B documents and their language; a third edition is at the time of writing in 
preparation under the editorship of John Killen and Anna Morpurgo Davies. Until then 
the most current survey of the state of the discipline is provided by the three -volume 
Companion edited by Duhoux and Morpurgo Davies (2008, 2011, 2014). On Cypro- 
Minoan see Ferrara 2012-13 and Steele 2013; on the Cypriot dialect and its writing sys- 
tem see Egetmeyer 2009. For the pronunciation of Greek see Allen 1987a and Sturtevant 
1940, and for the historical developments in phonology Lejeune 1972. On morphology 
see Chantraine 1991. On the syntax of the verb see Rijksbaron 2002. On nominal forma- 
tion see Chantraine 1933. 
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PART 6 


ITALIC 

Rex Wallace 


INTRODUCTION 

As a linguistic term, “Italic” refers most commonly to a group of closely related Indo- 
European languages spoken throughout the Italian peninsula in the 1st millennium BCE 
(Map 6.1). The major language in this group, that is to say, the one for which there is 
a significant documentary corpus, is Latin. It is also the only Italic language - in fact 
the only language of ancient Italy, apart from Greek - that did not become extinct. The 
Romance languages, spoken today throughout Western Europe and the Americas, are its 
modem descendants. 

Given the prominence of Latin both linguistically and culturally, and given its impor- 
tance for Indo-European studies, it is the focus of this chapter. However, data from other 
members of the Italic branch will be discussed whenever appropriate. 


The languages of ancient Italy 

At the beginning of the 1st millennium, Latin was but one of many languages spoken in 
ancient Italy (see Map 6.2). Celtic-speaking tribes settled in the north, in the region of 
Lake Maggiore, Lake Como, and the Canton of Ticino. The Veneti inhabited Venetia, 
from the mouth of the Po River as far as the territory of Istria. The Raeti, who spoke one 
of the two major non-Indo-European languages in ancient Italy, lived in the sub-Alpine 
regions around Lake Garda and the Brenner Pass. Speakers of the other major non-Indo- 
European language, Etruscan, occupied Etruria, but pockets of Etruscan speakers could 
be found in Latium, Campania, and Aemilia. The Falisci inhabited a small slice of land on 
the west side of the Tiber River known as the Ager Faliscus. Their major center was Fale- 
rii Veteres (modem Civita Castellana). The Sabellic languages were spoken in an area 
that extended from Umbria along the spine of the Apennines as far south as Bruttium and 
Lucania. Messapic-speaking peoples settled Apulia. Dialects of ancient Greek - Ionic, 
Doric, and Attic - were spoken at numerous colonial outposts in southern Italy. 


Italic within Indo-European 

Within the Italic branch, Latin was most closely related to Faliscan. The Sabellic lan- 
guages (once referred to by scholars as Osco-Umbrian) formed the other branch of Italic 
(Rix 1994). The most important among the Sabellic languages were Oscan, the language 
of the inhabitants of Samnium and Campania; Umbrian, the language of the inhabitants 
of ancient Umbria; and South Picene, the language of the people who inhabited the south- 
ernmost parts of Picenum along the Adriatic coast. Other members of this branch are 
known, but only from a handful of inscriptions. 
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MAP 6.1 THE REGIONS OF ANCIENT ITALY 

Venetic may have been Italic as well, perhaps having its own branch. The amount of 
evidence that can be used to determine the position of Venetic is limited and difficult to 
evaluate. Phonological developments, such as the change of Proto-Indo-European (PIE) 
aspirates to fricatives in word-initial position and to voiced fricatives or stops in medial 
position, point to an Italic connection, but the morphological evidence, such as it is, does 
not contribute to a decision one way or the other. The best approach is to leave the ques- 
tion open and await the acquisition of additional evidence. 



ITALIC 319 



MAP 6.2 THE LANGUAGES OF PRE-ROMAN ITALY 

Italo-Celtic 

Correspondences, both phonological and morphological, are shared by members of the 
Celtic and Italic branches: the thematic genitive singular in -T (Ogham Irish maqqi ‘son’, 
Latin uirT ‘man’, Faliscan titi /titi:/ ‘Titus’ [personal name]); the a-subjunctive (Old Irish 
■bera, Latin ferat ‘carry’, Oscanputlad ‘be able’); the superlative suffix *-(i)smmo- (Old 
Irish tressam ‘strongest’, Latin maximus ‘greatest’); and the development of *CRHC to 
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CraC (Old Irish gran, Latin granum ‘grain’ < *grHnom). The probative value of some 
correspondences, e.g., the genitive in -T, which is found also in Messapic, is not easy 
to assess (Clackson & Horrocks 2007: 31-34, Weiss 2009: 465^-66, Fortson 2010: 
276-277). As a result, scholars’ opinion on the question of an Italo-Celtic unity remains 
mixed. 


Origins of Latin 

At the beginning of the historical period Latin was spoken by the inhabitants of Latium 
(see Maps 6. 1 and 6.2). This region was less extensive than the modem province of Lazio. 
It was bounded to the north by the Tiber River, to the east by the Sacco and Liri river 
valleys, to the south by the Garigliano River, and to the west by the Tyrrhenian Sea. 

Archaeological evidence points to the development of a regionally distinct material cul- 
ture, the so-called Latial culture, in Latium during the late Bronze Age, ca. the 13th 1 1th 
centuries BCE (Holloway 1994: 13-14). Evidence for habitation at Rome in the earliest 
stages of the Iron Age, ca. the 10th and 9th centuries BCE, is found on the Palatine and 
Capitoline Hills, and in the Forum area. The evidence for habitation is slightly later at other 
sites in Latium, for example, at Ardea, Satricum, Gabii, Tibur, and Praeneste. Although it 
cannot be proved, it is likely that the people who produced the artifacts attributed to “Latial” 
culture in the late Bronze and early Iron Ages were speakers of Latin. 

Speakers of the Italic languages probably entered the Italian peninsula from the north- 
east by crossing over the Alps. When this happened cannot be determined, but the idea 
that Italic-speaking peoples had begun to settle in Italy before the middle of the 2nd 
millennium BCE, if not earlier, is not likely to be wrong. The linguistic geography of 
pre -Roman Italy leads us to surmise that Italic speakers spread southward from northeast- 
ern Italy along the spine of the Apennines and fanned out into the more hospitable valleys 
and coastal plains of central and southern Italy. 



FIGURE 6.1 FIBULA PRAENESTINA (OIL 1 2 .3). Reproduced by permission of the Center for Epigraph- 
ical and Palaeographical Studies, the Ohio State University 
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The spread of Latin 

Apart from Greek, Latin was the only language of ancient Italy that did not become 
extinct. Speakers of other languages succumbed to the military, political, and cultural 
domination of Latin. The Romans captured the Faliscan capital of Falerii Veteres in 241 
BCE and forcibly removed all of its inhabitants to a less defensible position. Faliscan 
speakers switched to Latin during the 2nd century BCE. Etruscan and the Sabellic lan- 
guages, Oscan and Umbrian, were still spoken at the beginning of the imperial period, but 
it is unlikely that native speakers of these languages survived beyond the final decades of 
the 1st century CE. Even the languages - both Indo-European and non-Indo-European - 
spoken at the geographic extremes of the peninsula disappeared soon after Roman 
legions and Roman administration gained a foothold in their territories. Venetic, Raetic, 
and Celtic, languages spoken in the Cisalpine territories of northern Italy, did not survive 
much beyond the reign of Augustus. Messapic inscriptions are not attested after the 1st 
century BCE. Ancient Greek was the only language spoken on the peninsula in antiquity 
that was not supplanted by Latin. It survived primarily because of its prestige value for 
Romans and because its speakers maintained a vibrant, though often contentious, rela- 
tionship with the homeland. 

In the course of a few centuries, then, Latin went from being the language of Latium 
to the language of a vast empire that included Europe and parts of Africa and Asia. The 
paths of diffusion of the language coincided with Roman military conquests. By the end 
of the war against Hannibal and the Carthaginians, Rome had annexed the islands of 
Sicilia, Sardinia, and Corsica, and set up outposts in southern Spain. Annexation of Illyr- 
icum (which was situated along the Dalmatian coast), of northern Africa, and of Greece 
followed shortly thereafter. The final pieces of the Roman imperial conquest, Britain and 
Dacia, were in place by the end of the 1st century CE. 

Although the Romans never developed an official language policy, conquered peoples 
eventually abandoned their native languages for Latin. The Roman ruling classes, who 
were responsible for civil administration, finances, and military matters in the colonies 
and conquered territories, left most institutions and religious practices in the hands of 
the natives. But the fact that Latin was perceived as the language of prestige, and thus 
the language of economic, political, and military advancement, eventually made it the 
language of choice for most speakers. The adoption of Latin took place via periods of 
bilingualism during which the natives spoke their own languages at home in private with 
family and friends, and Latin in the public arena. Ultimately, great numbers of speakers, 
within Italy and outside of Italy, gave up their native languages for Latin. Pre-existing 
languages generally disappeared without a trace, although some scholars have argued that 
it is possible to detect features of the substratum languages by means of changes appear- 
ing in the Romance languages. 

Ironically, prestige is the reason why Latin did not succeed in replacing ancient Greek 
as the language of everyday use in the eastern part of the empire. For the Roman elite, the 
Greek language enjoyed an especially elevated status. Members of the Roman aristoc- 
racy, when they could afford to do it, sent sons to Athens for study as part of a rigorous 
program of education. 


Latin and the Romance languages 

Latin is the linguistic parent of the Romance language group and as such is the progenitor 
of the modem Romance languages. French, Italian, Portuguese, Rumanian, Spanish, and 
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the remaining Romance varieties are the modern continuants of regional Latin varieties 
that were spoken by traders, commercial entrepreneurs, soldiers, administrators, and their 
families at the colonial settlements in foreign territories incorporated into the Roman 
Empire. 


Medieval Latin and later Latin 

Latin did not cease to be written even as regional varieties of the spoken language diverged 
and ultimately emerged as Romance languages, nor did it cease to be spoken in the aca- 
demic arena. Latin was adopted as the lingua franca of scholarly discourse during the medi- 
eval period, and it survived as such well into the formative period of the Renaissance, 
finally giving way to treatises and tracts written in the native tongues of their authors. With 
the rise of linguistic nationalism, Latin was restricted more and more to the sphere of reli- 
gious discourse and learned publications. Indeed, the fonner is the primary venue in which 
Latin survives today. The Vatican continues to publish its encyclicals in Latin. 


Varieties of Latin 

Although it is impossible to say when Latin and the other Italic languages began to 
diverge, we can pinpoint the date of the earliest inscriptions. They go back to the middle 
of the 7th century BCE. And, although we cannot say precisely when Latin ceased to be 
Latin and the Romance languages became recognizably such, we can point to the earliest 
text attested in a Romance language, e.g., the Strasbourg Oath in Old French, which dates 
to 842 CE, and assume that the “transition” from Latin to Romance happened somewhere 
between the 7th and 9th centuries CE. If we accept this as the transition period between 
Latin and the Romance languages, then the recorded history of the Latin language ranges 
over some 1,250 years. 

Because the history of Latin covers a vast expanse of time, it is possible to recog- 
nize changes in the language at all levels of linguistic structure over the course of its 
development from the language of the earliest inhabitants of Latium to the language of 
the inhabitants of the military outpost at Vindolanda in Great Britain. Indeed, since the 
chronological period for Latin is so great, it is convenient to recognize stages in the 
historical development of the language. Even though these chronological divisions are 
arbitrary, they serve as guideposts that facilitate discussion. Most historians of the Latin 
language accept some version of the following schema: 

Very Old Latin (VOLat.), ca. 65CM-00 BCE 
Old Latin (OLat.), ca. 400-100 BCE 
Classical Latin (Class. Lat.), ca. 100 BCE-200 CE 
Late Latin (Late Lat.), ca. 200-600 CE 

Looking across the chronological divisions of the language, it is possible to identify dif- 
ferences in pronunciation, in the formation of words, and in sentence construction. For 
example, the dative singular of o-stem nouns and adjectives ended in a long diphthong in 
Very Old Latin, e.g., duenoi /dweno:j/ ‘good (man)’, m. dat. sg. By the Old Latin period, 
however, this long diphthong had changed to a long vowel; other changes transformed 
the initial syllable of duenoi from *dwe- to bo-, yielding Classical Latin bono /bono:/. In 
Old Latin the syntax of CMm-clauses, regardless of their function, called for a verb in the 
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indicative mood. By the Classical Latin period, however, the mood of the verb in clauses 
with causal and concessive function was the subjunctive. In Late Latin the preposition de 
‘from’ takes on the functions of the genitive case; that is to say, it can be used to indicate 
possession, part for whole, and the “objective genitive”. 

While it is important to recognize chronological differences within the history of Latin, 
it is equally important to recognize differences based on geography, social class, and 
style or register. As an example of regional variation we point to the dative singular 
of d-stem nouns. In the Latin of the 3rd century BCE, speakers of some Latin dialects 
outside of Rome pronounced the dative singular ending with a long a, rather than with a 
diphthong, which was the standard in Rome, e.g., fortuna /fortu:na:/ ‘luck’ (Praeneste), 
but Roman Latin mensae /me:nsae/ ‘house’. In the Latin of Pompeii - but only in graffiti 
and thus perhaps only in the speech of sub-elite social orders - syllable-final short i was 
syncopated when standing between /w/ and It/, e.g., aberaut ‘has lost’, cf. CL aberrauit. 
This development is not found in elite Latin texts of the 1st century CE, which suggests 
that it was restricted, at least at its inception, to sub-elite Latin dialects. The language 
variety employed for literary composition, which is commonly referred to as Classical 
Latin, was a Kunstsprache. It drew on features from a number of other varieties of Latin, 
for example, Old Latin, and the language of law and religion. The writers of this variety 
also appropriated or elaborated on linguistic and stylistic features borrowed from ancient 
Greek authors (Clackson & Horrocks 2007: 183-228). 

The Latin language, then, as we would expect of any language, covered a spectrum of 
varieties, written and spoken, geographic and social, formal and informal. Some scholars 
adopt a more severe approach, viewing the differences between elite and sub-elite vari- 
eties of Latin in terms of diglossia, in which two co-existing linguistic systems are rec- 
ognized, one for the language of literature and official communication, the other for the 
language of everyday speech. The idea is that these two systems were already divergent 
by the end of the Roman Republic, and that they continued to diverge over the course of 
the history of the empire and beyond. The evidence does not corroborate this position. 

Documentary evidence 

Latin has come down to us through the medium of written texts. There are two primary 
documentary sources. Arguably, the most important source comes in the form of manu- 
scripts of literary texts. Unfortunately, few Latin manuscripts have survived from antiq- 
uity. Most are late, dating as they do from the 10th to the 15th centuries CE. Thus, the 
texts of Roman authors are attested at least a thousand years after they were originally 
composed. For this reason the orthography of the manuscripts, particularly the manu- 
scripts of the Old Latin poets, may not be as reliable as one would wish it to be. Editorial 
changes made by later copyists, who sometimes altered Old Latin spellings to conform to 
Later Latin models, have made their way into the manuscript tradition. 

Roman literary compositions are the nucleus of this form of documentation. The lit- 
erary productions of the most distinguished Roman poets and prose writers, e.g., Vergil, 
Catullus, Ovid, and Horace for poetry, and Caesar, Cicero, Livy, and Tacitus for prose, are 
known to us with few exceptions from manuscripts. But so also are the works of authors 
such as Columella and Vitruvius, both of whom composed important technical treatises, 
the former on agriculture, the latter on architecture. 

The second important source of documentation is epigraphic. Epigraphic documen- 
tation comes in the form of inscriptions, graffiti, and dipinti (painted texts) that have 
been discovered over the length and breadth of Roman territory. For the most part, the 
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inscriptions that have survived were incised or chiseled on non-perishable material - 
ceramic, stone, and metal. 

Some inscriptions were produced in order to stand the test of time: epitaphs preserving 
the memory of the deceased, texts of laws and decrees incised on bronze and earmarked 
for public display, texts commemorating generous contributions by patrons to their com- 
munities, dedications intended to accompany an offering to a deity, and the “signatures” 
of ceramic makers or their owners. Other texts survived on account of favorable circum- 
stances. The writing on the wooden tablets of the Pompeian banker Lucius Caecilius 
Iucundus survived because of historical accident, namely, the eruption of Mt. Vesuvius in 
79 CE. Lead tablets incised with curses against competitors or against rivals in love sur- 
vived because, in order to be efficacious, they had to be devoted to underworld demons 
and were therefore hidden away in tombs, buried underground, or tossed into wells. The 
leaf-tablets from Vindolanda survived because they were abandoned in drainage ditches 
that were subsequently covered with clay, which deprived the bacteria that would have 
destroyed them of the oxygen necessary to multiply. 

ALPHABET, ORTHOGRAPHY, AND PHONOLOGY 

The Latin alphabet was borrowed from the Etruscans, probably from the inhabitants 
of Caere, a settlement located across the Tiber River a short distance from Rome. The 
Latin abecedarium cited in (1) is the result of several alphabetic reforms that took place 
between the 7th and 3rd centuries BCE. 

(1) ABCDEFGH1KLMNOPQRSTUX 

The letter <g> was added to the alphabet Latin speakers inherited from the Etruscans. 
It first appears in the 2nd half of the 3rd century BCE and, oddly enough, occupied the 
position once held by <z>, which seems never to have been used to write Latin. The letter 
<f>, which originally had the value /w/, is the result of the simplification of the digraph 
<fh>, an older spelling for the sound /f i. In the early imperial period the Greek letters <y> 
and <z> were added to the script to spell sounds found in Greek loan words. They were 
placed at the end of the abecedarium. An attempt to add additional consonantal signs to 
the alphabet was made by the emperor Claudius, and though two of his signs were used 
in inscriptions, they disappeared from the writing system soon after his death. 

For the most part, the consonantal letters stood in a one-to-one relationship to the con- 
sonantal phonemes in the sound system, but in a few instances the spelling of sounds was 
less optimal. The letters <c> and <k> stood for the voiceless velar /k /, e.g., capio ‘seize’ 
and kalendae ‘Kalends’. Occasionally, <q> spelled /k/, especially if the following vowel 
sign was <u>, e.g., pequnia ‘money’. This three-way spelling of Dd is the remains of an 
orthographic rule inherited from the Etruscans. The digraph <qu> spelled a voiceless 
labiovelar; <gu> probably spelled the voiced counterpart of <qu> (on which see below). 
Finally, the letter <x> stood for the cluster /ks/, e.g., arx ‘citadel’ /arks/. 

Vowel length was rarely represented orthographically. A few attempts at indicating 
vowel length, for example, writing vowels double (paastores ‘shepherds’), marking 
length with a diacritic (murum ‘wall’), and employing etymological spellings such as 
ei for /i:/ ([uJeiuam ‘living’), were used sporadically, but they were not adopted as the 
standard. Thus, in most instances, a single vowel sign stood for both a long vowel and 
its short counterpart, e.g., <a> = /a:/ and /a/. The glides /j/ and /w/ were also spelled with 
vowel signs, e.g., iam ‘now’ = /jam/ and uir ‘man’ = /wir/. In the Classical Latin spelling 
system, the signs <i> and <u> represented three phonemes. 
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Metrical evidence indicates that words like peior ‘worse’, maior ‘more’, and aid ‘say’ 
had a heavy first syllable. When standing between vowels, the letter spelled a double or 
long consonant /majjor/. 

In the oldest Latin inscriptions the direction of writing was variable. Some inscrip- 
tions were written in left-to-right direction, others in right-to-left direction. A few 
inscriptions were incised in the boustrophedon style. By the 4th century BCE, how- 
ever, left-to-right direction was the standard. Punctuation in texts was minimal. Very 
Old Latin inscriptions were often written scriptio continua. Later, punctuation points 
regularly separated words, but sentence-signs and paragraph-signs were rare, even in 
long inscriptions. 


Phonology 

The phoneme inventory of Latin is displayed in Table 6.1. 

The voiced labiovelar, which was restricted to the environment after nasals ( ninguit 
‘it’s snowing’), is the only phoneme whose status is controversial. It is listed as part of 
the phonological inventory based on phono tactic and morphophonemic evidence. The 
treatment of qu and gu as clusters /kw/ and /gw/ does not fit with the metrical evidence, 
which points to a unitary phoneme, at least as far as qu is concerned. Interestingly, the 
metrical evidence also suggests that the cluster /kw/, as in equus ‘horse’ < *ekwos, fell 
together with the original labiovelar. 

The velar nasal [q] is sometimes considered a separate phoneme based on the evidence 
of minimal triplets: amm ‘river’, anm ‘year’, and agm ‘lamb’ [aqni:]. However, it is 
possible to treat [q] as an allophone of the phonemes Ini and Igl. Following this analysis, 
the velar nasal in longus [loqgus] is the result of the assimilation of Ini, a dental nasal, 
to the velar position of the following stop. The velar stops /k/ and Igl were pronounced 
as the corresponding nasal when standing before a dental nasal. Thus, ignis ‘fire’ was 
pronounced [iqms] and dignus ‘worthy’ as [diqnus]. 

The glottal fricative /h/ was restricted in large part to word-initial position, and it was a 
feature of elite Latin. But even in the elite variety of the language, word-initial h behaved 
as an outlier. It did not “make position” in metrical texts, and members of the educated 
classes, as is clear from poem 84 of Catullus, could be guilty of hypercorrect pronuncia- 
tions. Although h was written in medial position in some words, it is doubtful that it was 
pronounced, h did not prevent vowel contraction, e.g., mi for mihi ‘to me’, and it did not 
block rhotacism of -5, e.g., diribed for *dishabeo. 

TABLE 6.1 THE PHONEMES OF CLASSICAL LATIN 


Consonants 

stops 

P 

b 

t 

d 

k 

g 

k w 

gw 


fricatives 

f 


s 






h 

nasals 

m 


n 







lateral 

trill 

glides 


1 

r 

j 



w 



Vowels 







diphthongs 



I i 





u 

u 

ae 

ej 

uj 

e 

£ 



0 

0 


aw 

ew 

oe 



a 

a 





ow 
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The vowels in the system were distinguished by length. The short members were lax 
and more central than their long counterparts. The number of diphthongs in the system 
was for all intents and purposes restricted to /ae/ and /aw/, /oe/ was present in a handful of 
words, e.g., poena /poena/ ‘punishment’, perhaps under Greek influence, cf. notvf). The 
diphthongs /ej/, /uj/, etc., were the result of contractions and were restricted to a small 
number of words, e.g., deinde ‘then’, cui ‘to whom’. 

In Classical Latin, epigraphic and metrical evidence suggest that word-final -Vm sur- 
faced as a long nasalized vowel. In Old Latin inscriptions of the 3rd century BCE, final -m 
was often omitted in writing, e.g., OL optumo ‘best’ = CL optimum, a spelling that makes 
sense if the nasal was lost and the preceding vowel was nasalized. The etymologically 
appropriate spelling was restored in the 2nd century BCE and became standard in literary 
Latin and bureaucratic prose. In poetry, words ending in - Vm were subject to elision if 
the following word began with a vowel just as if they ended in a long vowel. And words 
ending in - Vm “made position”, that is to say were metrically heavy, if followed by a 
word beginning with a consonant. Nasalized vowels were also found before -ns, both pri- 
mary and secondary, and -nf- (nasal + fricative), e.g., Tnfans ‘child’, mensa ‘table’, consul 
‘consul’. Note the common abbreviation cos. for consul. 


Diachronic developments 

From a diachronic perspective the most distinctive phonological development, and 
one that set Italic apart from its sisters, was the treatment of the PIE voiced aspirates 
(Stuart-Smith 2004). 

In word-initial position, the aspirates changed to voiceless fricatives. The labial and 
dental aspirates merged as /f/; the velar aspirate, reflecting an earlier merger of PIE *g h 
and *g h , became /h/, presumably via an intermediate stage as *x. Comparative data are 
assembled in (2). 


(2) PIE aspirates in initial position 

*b h : Latin frater ‘brother’, m. nom. sg., Oscan fratriim, m. gen. pi., Umbrian frater 
‘brothers’ (in a religious fraternity), m. nom. pi. < *b h reh,ter-; Latin fero ‘I carry’, 1 
sg. pres, act., Marrucinian ferenter ‘they are carried’, 3 pi. pres, pass., Umbrian fertu 
‘he should bring’, 3 sg. imp. act. < *b h er-; Latin far ‘grain’, n. nom. sg., Faliscan far, 
Oscan far, Umbrian far < *b h ars 

*d h : Latin facio ‘I make’, 1 sg. pres, act., Oscan fakiiad ‘let him sacrifice’ 3 sg. pres, subj., 
Umbrian fa<;ia ‘he should sacrifice’, 3 sg. pres, subj., Volscian faQia, 3 sg. pres, subj., 
Venetic vhagsto ‘he made’, 3 sg. perf. < *d h h,-k-, cf. Latin fecit, 3 sg. perfi, based on full 
grade *d h eh,-k-; Latin janum ‘shrine’, n. nom. sg. < *d h h[S-no-, Oscan fiisnu ‘temple, f. 
nom. sg., Umbrian fesnaf(e) ‘to the temple’, f. ace. pi. < *d h eh,s-neh 2 - 

*g h /g h : Latin hortus ‘garden’, m. nom. sg., Oscan hurz ‘grove’ (as sacred area), m. nom. 
sg. < *g h ortos ‘enclosed area’; Latin horitur ‘incites’ 3 sg. deponent < *g h r-ye/o-, 
Oscan heriiad ‘he should wish’, 3 sg. subj. act., Umbrian heriest ‘he will wish’, 3 
sg. fut. act. < *g h er-ye/o- 

*g wh : Latin formus ‘warm’, m. nom. sg. < *g wh ormos; de-fen-do ‘I ward off’, 1 sg. pres, 
act. < *g wh ende/o- (root *g wh en- ‘destroy’) [no Sabellic examples] 


In medial position the lines of development were more complex, and they served to dis- 
tinguish Latin from the rest of the Italic group. The labial and dental aspirates developed 
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into voiced fricatives [[3] and [3], which may be the stage attested by Venetic, if Io.u.ze- 
rocpo.s. ‘children’, dat. pi., is phonetically [low3ero(3os]. Medial [[3] and [3] merged as [|3] 
in Faliscan and in the Sabellic languages. In Rome and environs, [|3] and [3] survived and 
eventually became the corresponding voiced stops Pol and /d/. The velars (PIE palatals 
and velars), on the other hand, developed to /h/ in all Italic varieties, perhaps via a voiced 
velar fricative. The aspirated labiovelar moved along a different trajectory in Latin and 
the Sabellic languages: *g wh became u /w/ in Latin but / /(3/ in Sabellic. Examples of the 
development of aspirates in medial position are in (3). 


(3) PIE aspirates in medial position 

*b h : Latin albus ‘white’, m. nom. sg., Umbrian alfir, n. abl. pi. < *(h[)alb h os, cf. Oscan 
alafaternum ‘inhabitants of Nuceria Alfatema’, m. gen. pi.; Latin tibi ‘to you’, dat. 
sg., Oscan t(e)fei, dat. sg., Umbrian tefe, dat. sg., South Picene tefei, dat. sg. < 
*teb h ey 

*d h : Latin medius ‘middle’, m. nom. sg., South Picene mefiin loc. sg. + postposition -en, 
Oscan mefiai, f. loc. sg. < *med h yo-; Latin ruber ‘red’, m. nom. sg., Umbrian rufru 
< *h 1 rud h ro- 

*g h /g h : Latin ueho ‘I transport’, 1 sg. pres, act., Umbrian arveitu, ‘let him bring’, 3 sg. 
pres. imp. < *weg h e/o-; Oscan feihuss ‘walls’, m. acc. pi. < *d h eyg h -, cf. Latin jingo 
‘I fashion, shape’, 1 sg. pres, act.; Latin mihi ‘to me’, dat. sg., Umbrian mehe, dat. 
sg. < *meg h ey 

*g wh : Latin foueo ‘I heat up’, 1 sg. pres. act. < *d h og wh eye/o-; Umbrian vufru ‘votive’, m. 
acc. sg. < *h[WOg wh -ro- 

The development of the PIE labiovelars, *k w and *g w , is another change that distinguished 
the Sabellic languages from Latin and Faliscan (and from Venetic too). *k w survived 
in Latin and Faliscan in most environments, e.g., Latin -que ‘and’, Faliscan -cue, cf. 
Venetic -kve < *-k w e. *g w survived too, but only when it followed a nasal, e.g., Latin 
unguen ‘salve’, n. nom. sg. < *h 3 eng w - ‘anoint’; in other environments, it changed to u 
/w/, e.g., Latin uenio ‘come’, 1 sg. pres. act. < *g w m-ye/o- ‘sets out’. In contrast, *k w and 
*g w merged with the labial stops in Sabellic, e.g., Oscan pis ‘who’, m. nom. sg. < *k w is; 
Oscan benust ‘will have come’, 3 sg. fut. perf. < *g w em- (the dental nasal by analogy 
with the present). For the development of the aspirated labiovelar, see above. 

In Latin intervocalic *s changed to r, e.g., Latin generis ‘family’, n. gen. sg. < *gene- 
ses. This change, generally referred to as rhotacism, took place also in Umbrian, in final 
as well as medial position, e.g., Umbrian farariur ‘of the grain’, m. nom. pi. < *-asiy5s. 
Oscan seems to have preserved the intermediate stage of development. Medial *s was 
voiced but not rhotacized, e.g., egmazum ‘property’, f. gen. pi. 

The liquids and nasals developed for the most part unchanged. Syllabic liquids devel- 
oped to oR, e.g., mors ‘death’, f. nom. sg. < *mrtis; syllabic nasals developed to eN, 
e.g., Latin decern ‘ten’ < *dekm, cf. Umbrian desen-duf‘ twelve’ (‘ten-two’). In Sabellic, 
syllabic nasals in word-initial syllables appear as aN, e.g., tanginud ‘decision’, f. abl. sg. 
This may be a secondary development, perhaps under accent. 

The PIE laryngeals were lost in pre- and post-consonantal positions in Italic. If a laryn- 
geal followed a syllabic resonant, the result was Ra, e.g., Latin grata-, Oscan bratas 
‘favor’, f. gen. sg. < *g w rH-teh,; Latin note ‘son’, m. nom. sg., Paelignian cnatois, m. dat. 
pi. < *gnh,-to-. Laryngeals standing between consonants were vocalized to a, e.g., Latin 
datus ‘given’, nom. m. sg. < *dh,-to-, animus ‘spirit’, nom. m. sg. < *h 2 enh,-mo- (medial 
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-i- here by “vowel weakening”, on which see below, under “Accent”), cf. Oscan anams 
‘breath’. Original *VHC sequences yielded long vowels via loss of the laryngeal and 
compensatory lengthening of the preceding vowel, e.g., Latin Jeriae ‘holidays’, f. 
nom. pi., Oscan fiisnam ‘sanctuary’, f. ace. sg. Both nouns were derived from the root 
*d h eh,s- ‘divinity’. 

In word-initial syllables, PIE short and long vowels survived in Latin in most environ- 
ments. Long vowels, regardless of source, were shortened in word-final syllables closed 
by any consonant but - 5 , e.g.,portat ‘he carries’ > portat, portor ‘I’m carried’ > portor, 
etc., but portas ‘you carry’. These changes led to morphophonemic alternations in Latin 
verb paradigms (see Table 6.10). 

Apart from *ay and *aw, the PIE diphthongs developed into long vowels in all posi- 
tions in Latin, e.g., *ey > 1 (*deyko > died ‘say’); *oy > u (*oynos > unus ‘one’); *ew, 
*ow > u (*dewko > dued ‘lead’). 


Accent 

At some point in the prehistory of Latin, the pitch-accent system of PIE was replaced by 
a system in which the vowel in the initial syllable of a word carried a stress accent. 

The effects of this stress accent are abundantly evident in Latin. Short vowels were 
subject to phonetic processes known collectively as “vowel weakening”. Quality dis- 
tinctions were eliminated in non-initial syllables. In open syllables short vowels changed 
to e.g .,facio ‘make’ beside efficio ‘ construct ’. In closed syllables, original a changed 
to e, and original o to u; original e and i remained, e.g ,,factus ‘made’ beside effectus 
‘constructed’, euntis ‘going’, m. gen. sg. < *eyontes. Diphthongs in non-initial sylla- 
bles, which did not otherwise become long vowels (see above, “Phonology”), were also 
affected by “weakening”. *ay and *aw ultimately changed to F and Ci, e.g., caedo ‘I cut’ 
beside incido ‘I cut into’, and claudo ‘I shut’ beside concludo ‘I confine’. 

Over the course of the history of Latin, unstressed vowels in open medial syllables, 
usually, but not exclusively, in the environment of sonorants or fricatives, were subject to 
syncopating processes. Examples follow: *deksiteros ‘right’ > dexter, *uirotuts ‘manli- 
ness’ > uirtiis; and *kedate ‘give here’ > cette. 

By the 3rd century BCE, the Latin “law of the penult” had replaced the word-initial 
accent. Penultimate syllables, if heavy, were accented. If the penultimate syllable was 
light, then the accent shifted to the antepenultimate syllable, if there was one, e.g., affec- 
tus, portamus, dukimus, but duett, animum, facio, etc. 

The Sabellic languages also show the effects of a word-initial accent. Several rounds 
of syncope eliminated short vowels in medial and final syllables. In Proto-Sabellic, short 
vowels in word-final syllables were lost before -s, e.g., Oscan hurz ‘grove’, m. nom. sg. 
< *g h ortos, Oscan Pakis ‘Pacius’, m. nom. sg. < *pakiyos, Oscan humuns ‘men’, m. 
nom. pi. < *homones. Subsequently, short vowels in medial syllables were syncopated, 
e.g., Oscan actud ‘let him do’, 3 sg. act. imp. < *aget5d, Oscan factud ‘let him make’, 3 
sg. act. imp. < *fakitod. 

In the Sabellic languages the accent may have remained on the initial syllable of words. 
The double spelling of vowels in Oscan is found with few exceptions in word-initial syl- 
lables, and this points to the continuation of vowel length in this position, presumably 
under accent. 

Whether or not the change from a pitch accent to a stress accent is to be assigned to 
the Proto-Italic period cannot be determined. Some scholars prefer to see it as a later 
development that originated among languages spoken in central Italy, which then spread 



ITALIC 329 


out from the innovative language via contact. There is something to be said for this view. 
In the non-Indo-European language Etruscan, vowels in medial syllables were synco- 
pated, which suggests that the vowel in the initial syllable was stressed, e.g., Old Etruscan 
tiiruce ‘dedicated’ > New Etruscan turce. 

NOMINAL AND PRONOMINAL MORPHOLOGY 

Nominal forms were inflected for the categories of gender (m., fl, and n.), number (sg. 
and pi.), and case (nom., voc., ace., gen., dat., and abl.). 

The PIE case system was reduced from eight to seven in Proto-Italic; the ablative and 
instrumental merged as the ablative. The number of cases was further reduced in Latin 
by the merger of the locative and the ablative, although the locative singular endings -ae 
and -T are attested in a few nouns and in the names of towns and cities, e.g., hum! ‘on 
the ground’, dorm ‘at home’, Romae ‘in Rome’, etc. In Oscan, Umbrian, and the other 
Sabellic languages, the locative case remained a more vibrant part of the system, e.g., 
Oscan else! teref ‘in this land’. The locative appears to have survived in Venetic too, to 
judge from the noun phrase dekomei diei ‘on the 10th day’, cf. Latin die septum T ‘on the 
7th day’. 

The dual was lost as a morphological category in Italic, but a few forms, e.g., Latin 
ambo ‘both’ and duo ‘two’, preserve the dual nominative/accusative ending. Remnants of 
the PIE collective survived in Latin o-stem plurals, such as loca ‘places’ beside the plural 
locT, and perhaps indirectly in nouns such as pila ‘ball (of hair)’ beside pilus ‘a hair’. 

Nominal stems had the grammatical feature of gender. Certain stem formations were 
associated with certain genders. For example, d-stems (*-eh 2 -stems) were overwhelm- 
ingly feminine; o-stems were masculine and neuter; -mew-stems were neuter, and so 
forth. Sometimes nouns of the same formation could be either masculine or feminine, 
depending on the gender of the referent, e.g., pater ‘father’, frater ‘brother’, and mater 
‘mother’. 

The case endings 

In some noun classes, perhaps within late PIE, contraction of the stem-final vowel and 
the vowel of the inflectional ending led to forms where it was not easy to distract stem 
and ending. Consider the dative singular of o-stems in which the ending -o (VOL -oi) was 
the result of the contraction of *-o-ey > *-oy and then the loss of diphthong-final y, e.g., 
VOL duenoi ‘good [man]’, CL bond ‘good’. The same is true also for many cases in the 
d-stem (< *-eh,) declension. The situation was further complicated by cross-paradigmatic 
borrowings of endings and, in particular, by the borrowing of endings from pronominal 
inflection. The genitive plural of d-stem and o-stem nouns and adjectives is a case in 
point. The inherited genitive plural ending -um survived in a few o-stem nouns, e.g., 
deum ‘gods’, m. gen. pi., but most o-stems inflected with -rum (with lengthening of the 
stem vowel), an ending that was modeled on the genitive plural of d-stem nouns, e.g., 
casarum — * dedrum. But the ending -rum, from earlier *-som, was not part of the original 
d-stem paradigm; it was borrowed from pronominal inflection. 

The endings for Classical Latin consonant-stem nouns are given in Table 6.2. The 
inherited athematic endings, although altered by sound change in many cases, survived 
in the singular; the Latin ablative was formally the locative ending *-i. In the plural the 
nominative ending -es was borrowed from the /-stems, where the full-grade stem and the 
ending fused together to produce a long vowel (*-ey-es > -es). The Latin accusative plural 
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ending -es developed regularly from *-ns. The dative/ablative ending 
-i from the /-stems. 

TABLE 6.2 CLASSICAL LATIN CONSONANT-STEM ENDINGS 

-ibus had its initial 


Singular 



Plural 


m./f. 

n. 

m./f. 

n. 

nominative 

-s 

-0 

-es 

-a 

vocative 

-s 

-0 

-es 

-a 

accusative 

-em 

-0 

-es < *-ns -a 

genitive 

-is 


-um 


dative 

-i 


-ibus 


ablative 

-e 


-ibus 



The endings of the Classical Latin o-stem declension nouns 
in Table 6.3. 

TABLE 6.3 CLASSICAL LATIN O-STEM ENDINGS 

and adjectives are 

listed 



Singular 


Plural 



m. 


n. 

m. 

n. 

nominative 

-s 


-m 

-T 

-a 

vocative 

-e 


-m 

-T 

-a 

accusative 

-m 


-m 

-os 

-a 

genitive 

-T 



-drum 


dative 

-0 



-is 


ablative 

-0 



-is 



Historically, the endings of the nominative, accusative, and dative singular were the 
same as the athematic endings, although the dative is the result of the contraction of 
stem vowel and ending, as noted above. The thematic vocative singular was ending- 
less; word-final -e is the e-grade of the thematic vowel, e.g., puere ‘boy’ (Plautus), cf. 
Umbrian tefre ‘Tefer’. In Very Old Latin and in Faliscan the o-stems were inflected 
with two genitive singular endings: -sio (VOL ualesiosio ‘Valerius [personal name]’, 
m. gen. sg.; Old Faliscan uo<I>tenosio ‘Voltenos [personal name]’, m. gen. sg.) and 
-F (Old Faliscan titi ‘Titus’, m. gen. sg.). The functional distinction between the two, 
if any, is not clear. The ending -sio does not appear to have survived much beyond the 
6th century BCE in Latin, although an Old Latin form attested on an inscription from 
Ardea may continue -sio (titoio /titojjo/ ‘Titus’, Ardea). The personal name Mettoeo 
Fufetioeo ‘Mettius Fufetius’, which appears in Ennius (Ann. 2, 139 [Warmington]), was 
probably modeled on the Greek epic genitive -oio. The ending -T is not a Latino-Faliscan 
innovation; it is also found in Celtic and Messapic. The PIE thematic declension had a 
distinct ablative singular ending *-5d. The ending is found in Old Latin, e.g., gnaiuod 
‘Gnaeus [personal name]’ and poplicod ‘of the people’, and in Oscan, e.g., sakaraklud 
‘sanctuary’. Classical Latin -o is the result of the loss of word-final -d following a long 
vowel. The expected nominative plural ending is -os < *-o-es. This ending is found in 
Sabellic languages, e.g., Oscan nuvlanus ‘inhabitants of Nola’, m. nom. pi., where it 
was extended to pronominal forms, e.g., Oscan iusc ‘they’, m. nom. pi. Latin o-stems, 
on the other hand, had the pronominal ending *-oy, which developed regularly to -T, thus 
uirf ‘men’, m. nom. pi. The dative/ablative ending -is is, from a PIE perspective, the 
instrumental ending *-oys. 
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Neuter 2nd declension nouns had the same ending for the nominative, vocative, and 
accusative. In the singular, the ending was -m; in the plural, the ending was -a. 

Declensional classes 

In Latin, nouns were organized into five classes called declensions - a practice that is fol- 
lowed in most descriptions of Sabellic nominal forms as well. The five Latin declensions 
reflect in large part the stem types attested in other ancient Indo-European languages. The 
d-stems, historically *-eh 2 -stems, are the 1st declension; o-stems are the 2nd declension; 
consonant-stems and /-stems are grouped into the 3rd declension; and M-stems are the 4th 
declension. The e-stems, which are an Italic innovation, are 5th. Examples of inherited 
inflectional types are given in Tables 6.4 and 6.5; e-stems are in Table 6.7. 


TABLE 6.4 CLASSICAL LATIN NOUN PARADIGMS, MASCULINE AND FEMININE 
GENDERS 


singular 

0 -stems 

o-stems 

C-stems 

/-stems 

w-stems 

nominative 

mens a 1 

lupus 3 

rex 5 

puppis 1 

tribus 8 

vocative 

mens a 

lupe 

rex 

puppis 

tribus 

accusative 

mens am 

lupum 

regem 

puppimS 

-em 

tribum 

dative 

mensae 

lupd 

regi 

puppi 

tribui 

ablative 

mens a 

lupo 

rege 

puppT, -e 

tribu 

genitive 

mensae 

lupi 

regis 

puppis 

tribus 

locative 

Romae 2 

hum f 4 

run 6 

— 

— 

plural 

a- stems 

0 -stems 

C-stems 

/-stems 

w-stems 

nominative 

mensae 

lupi 

reges 

puppes 

tribus 

vocative 

mensae 

lupi 

reges 

puppes 

tribus 

accusative 

mens am 

lupos 

reges 

puppis 

tribiis 

dative 

mensis 

lupis 

regibus 

puppibus 

tribibus 

ablative 

me ns is 

lupis 

regibus 

puppibus 

tribibus 

genitive - 

mens drum 

lupdrum 

regum 

puppium 

tribuum 

1 ‘table’, 2 ‘at Rome’ 

, 3 ‘wolf’, 4 

‘on the ground’, 5 

‘king’, 6 ‘in the country’, 

1 ‘stem of a boat’, 

8 ‘tribe’. 


Declensions 2, 3, and 4 also included nominal forms that were neuter gender. The 
inflectional patterns differed from the masculine and feminine in the nominative, voca- 
tive, and accusative, singular and plural. Partial paradigms are in Table 6.5. 

TABLE 6.5 CLASSICAL LATIN NOUN PARADIGMS, NEUTER 


singular 

0 -stems 

C-stems 

/-stems 

u- stems 

nom./voc./acc. 

iugum 1 

genus 2 

mare 3 

genii 4 

plural 





nom./voc./acc. 

iuga 

genera 

maria 

genua 


1 ‘yoke’, 2 ‘kind’, 3 ‘sea’, 4 ‘knee’. 


Masculine gender, 2nd declension nouns and adjectives come in a couple of sub- 
types. The regular type ends in -us < *-os in the nominative singular, e.g., lupus ‘wolf’, 
filius ‘son’. Nouns and adjectives whose stems ended in *-ro- lost the stem vowel in the 
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nominative singular by regular sound change, e.g., uir ‘man \puer ‘boy’, sacer ‘sacred’, 
etc. Nominative case forms such as ager ‘field’ and sacer ‘holy’ developed in the follow- 
ing manner: *agros > *agrs > *agers > ager. These changes account for the alternations 
in - 7 ' 0 -stem paradigms, e.g., ager, m. nom. sg., but agrT, gen. sg., etc.; sacer, m. nom. sg., 
but sacrT, gen. sg., etc.; cf. VOL sakros, m. nom. sg. 

The Latin 3rd declension encompasses PIE consonant-stems and /-stems. Numerous 
declensional subclasses can be recognized based on the final sound of the stem; inflec- 
tional peculiarities are associated with each class. 

1 . Stems ending in stop consonants. This class includes a small group of root nouns of the 
agent or action type, e.g., rex, regis ‘king \pes, pedis ‘foot’, dux, ducis ‘leader’, liix, lucis 
‘light’. Compounds whose final member was a root noun also belong here, e.g., auceps, 
aucipis ‘birdcatcher’ < *awi-kaps, as do nouns fonned by the derivational suffixes end- 
ing in -tat and -tut, e.g., nouitas, nouitatis ‘newness’, and uirtus, uirtutis ‘manliness’. 

2. 5-stems. The neuter 5-stems rhotacize medial -5, thus genus ‘family’, n. nom./acc. sg., 
but generis, gen. sg., generf, dat. sg. Some nouns, like genus, generis, preserve the 
original -o/e ablaut in the suffix (*o > u in closed final syllables). In other nouns, ablaut 
was eliminated in favor of the vowel of the nominative/accusative singular, e.g., corpus 
‘body’, corporis. Masculine and feminine 5-stems were in the process of shifting to the 
7"-stem declension in Classical Latin thanks to the analogical introduction of the -r of 
the oblique stem into the nominative singular, e.g., bonds, honoris — * honor, honoris. 

3. 7"-stems. Nouns of familial relationship preserved remnants of ablaut, e.g., pater, 
m. nom. sg., but patris, gen. sg. Agent nouns fonned by means of the suffix -tor 
made up the largest constituency in this class, e.g., uictor, uictoris ‘winner’. 

4. /-stems. Latin had a few /-stem nouns, e.g., sal ‘salt’, salts, gen. sg., and sol ‘sun’, 
solis, gen. sg. The word for ‘sun’ was in origin a heteroclitic // 77 -stem. The noun 
compound consul ‘consul’ also belongs to this type. The root is *sel- ‘take’. 

5. 77 -stems. De -verbal neuter nouns ending in -men, -minis were very common in Latin, 
and they were productive in Classical Latin, e.g., carmen ‘song’, carminis, gen. sg. 
Animate nouns in -mo, -monis belong to this class as well, e.g., sermo ‘speech’, m. 
nom. sg., sermonis, gen. sg. A small number of nouns preserved traces of ablaut, e.g., 
card ‘flesh’, f. nom. sg., carnis, gen. sg., cf. Umbrian karu, f. nom. sg., karne, dat. 
sg. Nouns of the uirgd-type also have a distinct stem uirgin- outside of the nomina- 
tive singular, e.g, uirginis, f. gen. sg. 

6 . 777-stem. Latin has a single 777-stem, hiems ‘winter’, which, unlike 77-stems, has a 
sigmatic nominative singular, /hiemps/. The p was epenthetic. The noun belonged to 
the PIE root class. 

7. Heteroclitic 7'/77-stems. Latin preserves a few neuter 77 ' 77 -stems, although the class is 
moribund and the nouns that do survive show various types of paradigmatic level- 
ing, e.g., femur ‘thigh’, feminis, gen. sg., but also femoris, gen. sg. (form attested in 
Cicero!); iecur, iocur ‘liver’, nom. sg., iecoris, iocineris, iecinoris, all gen. sg. 

8. /-stems. The number of “true” /-stems, that is to say, nouns whose paradigms pre- 
served /-stem inflection throughout, is relatively small (see Table 6.3). Many /-stems 
have been attracted into consonant-stem inflection to varying degrees. 

From an IE perspective, the /-stems did not inflect in the manner of consonant-stems. 
However, sound change and analogy moved /-stems, particularly those whose stem vowel 
was syncopated in the nominative singular, e.g., mors, mortis ‘death’, gens, gentis ‘clan’, 
etc., in the direction of consonant-stem inflection, which in turn moved consonant-stem 
nouns in the direction of /-stems, e.g., cTuitatium ‘states’, f. gen. pi., cTuitatTs ‘states’, 
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acc. pi. The result is a class of 3rd declension nouns in Classical Latin whose inflection 
lies between that of “pure” consonant-stems and “pure” /-stems, and which is sometimes 
referred to, though somewhat infelicitously, as “mixed-stem” inflection. Nouns of this 
class typically had variant forms of the accusative singular, -em and -im; ablative singu- 
lar, -e and -F; accusative plural, -as and -Is; and, less frequently, the genitive plural, -um 
and -mm. Examples of this inflection are cited in Table 6.6. 

TABLE 6.6 CLASSICAL LATIN ‘MIXED’ DECLENSION 


singular 


nominative 

gens ‘family’ 

Jons ‘spring’ 

mors ‘death’ 

pars ‘portion’ 

vocative 

gens 

Jons 

mors 

pars 

accusative 

gentem 

fontem 

mortem 

partem, -im 

dative 

genti 

font i 

morti 

parti 

ablative 

gente 

fonte, -i 

morte 

parte, -i 

genitive 

gentis 

fontis 

mortis 

partis 

plural 

nominative 

gentes 

fontes 

mortes 

partes 

vocative 

gentes 

fontes 

mortes 

partes 

accusative 

gentis, -es 

fontis 

mortes 

partes, -is 

dative 

gentibus 

fontibus 

mortibus 

partibus 

ablative 

gentibus 

fontibus 

mortibus 

partibus 

genitive 

gentium 

fontium, -um 

mortium 

partium, -um 


Oscan maintained the distinction between /-stems and consonant-stems in most case 
forms, including the nominative plural, e.g., tris ‘three’ < *treyes, but meddiss ‘magis- 
trate’, m. nom. pi. < *meddikes, with syncope of short *-e in the final syllable. Even so, 
these stem classes do exhibit cross-paradigmatic borrowings. For example, the /-stem geni- 
tive singular ending -eis was adopted in consonant-stem inflection in all Sabellic languages. 

The 5th declension is an Italic development, though few forms, apart from Umbrian ri 
‘matter’, dat. sg., are attested for this class in Sabellic. In Latin, the 5th declension nouns 
dies ‘day, res ‘property’, spes ‘hope’, and fides ‘trust’ come historically from different 
PIE nominal classes, but sound change and analogical change have led to fully elaborated 
paradigms. Examples in are Table 6.7. 


TABLE 6.7 CLASSICAL LATIN L-STEMS 


singular 

e- stems 


nominative 

dies ‘day’ 

res ‘property’ 

vocative 

dies 

res 

accusative 

diem 

rem 

dative 

diei 

rei, rei 

ablative 

die 

re 

genitive 

diei, die, dii 

rei, rei 

plural 

nominative 

dies 

res 

vocative 

dies 

res 

accusative 

dies, dis 

res 

dative 

diebus 

rebus 

ablative 

diebus 

rebus 

genitive 

dierum 

rerum 
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Diachronic developments 

The Italic languages shared one major innovation in the nominal system, and that was 
the development of ablative singular forms ending in a long vowel + -d. The ending was 
modeled on the o-stem ablative -od, and it was imported into the a-stems, /-stems, and 
M-stems, which yielded ablatives distinct from their genitive forms. The 5th declension 
nouns adopted this ending as well. A few consonant-stem ablatives were generated fol- 
lowing this model, e.g., OL [c]onsoled ‘consul’, m. abl. sg., leged Taw’, f. abl. sg.; in 
other cases consonant-stems opted for the ablative of the /-stem inflection, e.g., OL opid 
‘resources’, f. abl. sg., bovid ‘cow’, m./f. abl. sg. 


Adjectives 

The most common type of adjective declension paired masculine and neuter forms of 
the 2nd declension with feminine forms of the 1st, e.g., bonus, -a, -um ‘good’. In con- 
sonant-stem declension, distinctions in gender were not formally expressed in the nom- 
inative singular, e.g., ferens ‘carrying’, ferox ‘fierce’, duplex ‘double’, ingens ‘huge’ 
( ingentis , gen. sg.), uetus ‘old’ ( ueteris , gen. sg.), etc. For most /-stems, masculine and 
feminine nominatives were distinct from the neuter, e.g .,facilis ‘easy’, m./f., facile n. In 
paradigms such as acer, acris, acre ‘sharp’, a three-way distinction in gender was created 
in the nominative singular. *akris developed regularly to acer in Old Latin; acris was 
reintroduced, based perhaps on the model of facilis, etc. Interestingly enough, in Old 
Latin acris is found as a masculine form and acer as a feminine form. The match-up of 
gender and form found in Classical Latin is the result of standardization on the part of 
elite Latin writers. 

Latin did not have w-stem adjectives; they were transferred to the /-stem inflection, e.g., 
suauis, suaue ‘sweet’ <■ — PIE *swadu-, cf. Greek ljSuq. Present active participles in -«/-, 
originally consonant-stems in their inflectional pattern, were attracted into the orbit of the 
/-stem inflectional type. The ablative singular ends in -/unless the participle is used as a 
substantive, e.g., portantT ‘carrying’, m. dat. sg. The genitive plural is -ium, e.g., portan- 
tium ‘carrying’, m. gen. pi. 

Gradable adjectives formed the comparative by means of the suffixes -ior < *-yos 
(m./f.) and -ius < *-yos (n.), e.g., melior, melioris, ‘better’, m./f., melius, melioris, n. As 
was the case for s-stem masculine and feminine nouns, so in the comparative the oblique 
stem suffix, whose medial -r was the result of rhotacism, was introduced into the nomi- 
native singular. The superlative suffix was -issimus, -a, -um with 1st and 2nd declension 
inflection. The superlative of a few /-stems, whose stem vowel was syncopated, ended 
in -limus due to assimilation of -Is to -//, e.g., *fakilisomos > *fakilsomos > facillimus 
‘most easy’. 


Pronouns 

Pronominal declension includes personal pronouns, which were not inflected for gender; 
anaphoric/demonstrative pronouns, which were inflected for gender; and interrogative 
and relative pronouns, which were also inflected for gender. 

The personal pronouns were idiosyncratic in their inflection, and this is reflected in the 
paradigms in Table 6.8. Case forms of the 1st and 2nd person personal pronouns were 
built on multiple stems, e.g., egS, me-/me, nos-/nos. Some case endings, for example, the 
dative singular endings -hi and -bi, and the dative/ablative plural -bTs, were unique to 
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these paradigms. The genitive singular and plural forms were drawn from the possessive 
adjective paradigms. The Old Latin genitives mis ‘of me’ and tls ‘of you’ are the inherited 
enclitic forms *mey and *tey, extended by the -s of the genitive. The reflexive pronoun 
had the same forms in the singular and plural. 


TABLE 6.8 LATIN PERSONAL PRONOUNS 


singular 

1 st person 

2nd person 

Reflexive 

nominative 

ego (OL ego) 

tu 

— 

accusative 

me (OL med) 

te (OL ted) 

se (OL sed) 

dative 

mihi (OL mihi) 

tibi (OL tibi) 

sibi (OL sibl) 

ablative 

me (OL med) 

te (OL ted) 

se (OL sed) 

genitive 

me! (mis) 

tul (tls) 

sul 

plural 

nominative 

nos 

uos 


accusative 

nds 

uos 


dative 

nobis 

uobls 


ablative 

nobis 

uobls 


genitive 

nostrum , nostrl 

uestrum, uestrl 



The personal pronouns attested in Sabellic languages match up tolerably well with those 
in Latin, e.g., South Picene eku /egd/, nom. sg., Umbrian mehe, dat. sg., Umbrian imom 
/meom/, ace. sg.; Oscan tiium, nom. sg., tlf[el], t(l)fel, dat. sg.; South Picene tefel, dat. 
sg.; Umbrian tefe, dat. sg.; Paelignian uus, nom. pi.; Oscan slfel; Paelignian sefei, dat. sg. 
Paelignian uus ‘to you’, dat. pi., points to a prehistoric *uofos. If this was the Proto-Italic 
form of the dative/ablative, then the endings of Latin nobTs and uobls were refonned to 
*nobeis and *uobeis based the dative singular. In Very Old Latin and Old Faliscan, accu- 
sative and ablative singulars ended in -d, e.g., VOL med ‘me’, ace. sg., Old Faliscan med, 
ace. sg. Why the accusative forms ended up with word-final -d is not easily explained. In 
contrast to Latin and Faliscan, the particle -om was added to the accusative singular forms 
in Sabellic, e.g., Umbrian mlom ‘me’, tiom ‘you’. The 1st person accusative singular was 
remade in Venetic by contamination with the nominative, *eg5 : *me — » ego : mego. 

Gender-bearing pronouns represent a variety of distinct stem types. The stems *e, *ey, 
and *i united to form the paradigm of anaphoric pronouns is, ea, id ‘he, she, it’, in Latin, 
and izic ‘he’, luk ‘she’, ldi'k ‘it’ in Oscan. The Latin demonstrative hie, haec, hoc ‘this’ 
(close to speaker) is not attested as such in any other language, though it is often thought 
to be from the same etymological source as the Sanskrit particle gha ‘certainly’, iste, ista, 
istud ‘this’ (close to addressee) was formed from a particle *es- + demonstrative *to-/ 
ta-. Umbrian este, ‘this’, n. ace. sg., South Picene estas, f. nom. pi. (?), and Presamnite 
estam, f. ace. sg., are cognate. The /-vocalism in Latin is due to the influence of is. ille, 
ilia, illud ‘that’ (distant from speaker) is the Classical Latin form that replaced OLat. 
olle/ollus, cf. Oscan ullels, ‘that’, m. gen. sg. The /-vocalism here may be attributed once 
again to is and perhaps to iste as well. These paradigms shared an inflectional peculiarity 
by which the genitive and dative singular forms were the same regardless of gender, e.g., 
huius, huic; illius, illl; istius, istT. 

The deictic particle -c(e) was often added to inflected pronominal forms in Latin and 
Sabellic. In Latin, demonstrative pronouns to which the particle had been added were 
incorporated into the paradigms in some case forms, e.g., hie, haec, hoc ‘this, these’, 
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nom. sg.; hunc, hanc, hoc, acc. sg. The same was true also for Oscan and Umbrian, e.g., 
Oscan iuk ‘she’ < *eya-ke. 

The paradigm of the Latin relative pronoun is cited in Table 6.9. Some forms (quern, m. 
sg. acc.) go back to the PIE interrogative/indefinite stem *k w i-, others (quod, n. sg. nom./ 
acc.) to the interrogative stem *k w o-. The PIE relative pronoun *yo- was lost in Italic, and 
the relative paradigm was composed of the stem *k w o- and the stem *k w i-. The use of the 
interrogative/indefinite pronouns with relative function is to be attributed to Proto-Italic; 
the paradigms in Oscan and Umbrian are similarly formed. In Classical Latin the relative 
pronoun was distinct from the interrogative and indefinite only in the nominative singu- 
lar. The masculine and feminine relative pronouns were augmented by the deictic particle 
-f, e.g., Latin quT ‘who’, m. nom. sg., < *k w o-I, cf. Oscan pui. 

TABLE 6.9 CLASSICAL LATIN RELATIVE PRONOUN 



relative 

masculine 

feminine 

neuter 

nominative 

qul 

quae 

quod 

accusative 

quem 

quam 

quod 

dative 

cm 

cul 

cul 

ablative 

quo 

qua 

quo 

genitive 

cuius 

cuius 

cuius 

plural 

nominative 

qul 

quae 

quae 

accusative 

quds 

quds 

quae 

dative 

quibus 

quibus 

quibus 

ablative 

quibus (quls) 

quibus 

quibus 

genitive 

quorum 

quarum 

quorum 


VERB MORPHOLOGY 

Verbs were inflected for tense (pres., impfl, fut., perfl, pluperfl, fut. perfl), mood (ind., 
subj., imp.), voice (act., pass.), person (1st, 2nd, 3rd), and number (sg., pi.). 

The Italic languages, excluding Venetic, for which there is too little evidence, reor- 
ganized the verb system inherited from PIE. In Italic, the primary division was into two 
basic stems, generally referred to as the imperfective and the perfective. Three tense 
formations were constructed to each stem: to the imperfective, there was a present, an 
imperfect, and a future; to the perfective, a perfect, a pluperfect, and a future perfect. This 
organizational schema is set out for Latin and Oscan in Table 6.10. 


TABLE 6.10 ORGANIZATION OF VERB SYSTEM 


Latin 

imperfective 

perfective 


present 

imperfect 

future 

damus ‘we give’ 
dabamus ‘we were giving’ 
dabimus ‘we will give’ 

perfect 
pluperfect 
future perfect 

dedimus ‘we gave’ 
dederamus ‘we had given’ 
dederimus ‘we will have given’ 

Oscan 

imperfective 

perfective 


present 

imperfect 

future 

didet ‘he gives’ (Vestinian) 
fufans ‘they were’ 
didest ‘he will give’ 

perfect 
pluperfect 
future perfect 

deded ‘he gave’ 
(unattested) 

tribarakattust ‘he will have built’ 
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In the imperfective, verbs were organized into four present-tense paradigmatic classes, 
commonly referred to as conjugations: conjugation 1, d-stems; conjugation 2, e-stems; 
conjugation 3, /-stems; and conjugation 4, /"-stems. Conjugation 3 had two subtypes: a 
regular 3rd conjugation, in which the stem vowel -/ did not appear before endings of the 
1st singular and 3rd plural (see below, rego ‘guide’); and the 3 id conjugation, in which 
the stem vowel -i appeared in all forms of the present (see below, facio ‘make’). The 
Latin conjugational classes, present tense, are displayed in Table 6.11. Verbs like fern ‘I 
carry’, uolo ‘I wish’, sum ‘I am’, eo ‘I go’, etc., which do not fit neatly into one of the five 
patterns, are described as “irregular”. 


TABLE 6.11 LATIN CONJUGATION CLASSES 



a-stems 

e- stems 

z-stems 

z-stems (-io) 

z-stems 

1 . 

portd 

moneo 

regd 

facio 

sentid 

2. 

portas 

mones 

regis 

fads 

sentls 

3. 

portat 

monet 

regit 

facit 

sentit 

1 . 

portamus 

monemus 

regimus 

facimus 

sentimus 

2. 

portatis 

monetis 

regitis 

facitis 

sentitis 

3. 

portant 

monent 

regunt 

faciunt 

sentiunt 


The imperfect and future tenses were formed on the imperfective stem. The imper- 
fect tense suffix was -ba for all conjugations, e.g., 1st conjugation portabas ‘you were 
carrying’, 2nd conjugation monebas ‘you were warning’. Third and 4th conjugation 
verbs add the imperfect tense suffix to a stem augmented by -e, e.g., 3rd conjugation 
regebas ‘you were guiding’, 3 id conjugation faciebas ‘you were making’, 4th conjuga- 
tion sentiebas ‘you were feeling’. The future tense had two formants: (1) -b/bi-, which 
was added to the present stems of conjugations 1 and 2, e.g., portabo ‘I will carry’, 
monebis ‘you will warn’; and (2) -a (1st sg. only)/-*?, which was added to the present 
stems of conjugations 3 and 4, e.g., ducam ‘I will lead’,/dc/as ‘you will make’, sen- 
tiemus ‘we will feel’ . In Old Latin, imperfects of 4th conjugation verbs could be formed 
from the stem un-augmented by -e, e.g., exaudibat ‘he was listening’, seruibas ‘you 
were preserving’. -M>/-futures could also be formed to verbs of the 4th conjugation, 
e.g., dormibd ‘I will sleep’. These formations are rare in Classical Latin literature. The 
future attested in Faliscan corresponds to the -M>/-formation in Latin, e.g., 2nd conju- 
gation carefo ‘I will be lacking’. 

The stem of the perfective active was formed in a number of ways from a synchronic 
point of view: (1) by suffixation of -u /w/, e.g., portauit ‘he carried’, monuit /monuwit/ 
‘he warned’ < *monewed, sentTuit ‘he felt’; (2) by reduplication, dedit ‘he gave’, beside 
present tense dat, cf. Faliscan pe:parai ‘I produced’, fifiked ‘he fashioned’, fi[fi]qod 
‘they fashioned’; (3) by lengthening of the stem vowel, uenit ‘he came’, rupit ‘he burst’, 
beside present tense uenit, rupit, (4) by altering the quality of the stem vowel, accompa- 
nied by lengthening, e.g., fecit ‘he made’, to facit ‘he makes’; (5) by suffixation of -5, e.g., 
dixit /di:ksit/ ‘he said’, beside dicit ‘he says’. In some cases, the imperfective and perfec- 
tive stems were distinguished only by the personal endings, e.g., present tense uerto ‘I 
turn’, but perfect tense uerti ‘I turned’. 

Perfective active stems with the suffix -u were generally built from the imperfective 
stem of secondary verbs, most of which were denominatives ending in a long vowel, e.g, 
curd- ‘to take care to’ — * cura-u-. Other perfective formations were built to verbal roots 
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or to verbal stems distinct from the stem of the imperfective, so that the morphologi- 
cal relationship between the imperfective stem and the perfective stem was not always 
predictable, particularly outside of the 1st and 4th conjugations, e.g., mone- ‘warn’, but 
monu- /monuw-/ built from the stem *mone-. 

Alongside the regular a-conjugation perfects, e.g., portauisti ‘you carried’, there were 
also “syncopated” or “contracted” perfect forms, e.g., portastT. Such forms were com- 
mon in Republican Latin literature, particularly in poetry, where they provided metrically 
convenient alternatives to longer forms. But despite the labels, these forms were not the 
result of syncope or contraction. Rather, they originated as sigmatic (aorist) forms to 
denominatives, and they were eventually incorporated into the paradigms of the M-per- 
fects, e.g., portastT = porta-s-stT; portarunt< *porta-s-ont. 

The pluperfect suffix for active voice was -era-, the future perfect suffix was -er-l-eri-. 
Both suffixes were added to the perfective stem, e.g ,,portau-era-s ‘you had carried’, dTx- 
era-s ‘you had said’ ; portau-er-o ‘I will have carried’, monu-eri-s ‘you will have warned’. 

Mood 

In addition to the indicative mood, Italic languages had subjunctive mood forms corre- 
sponding to all tenses in the system with the exception of the future and future perfect. 
Forms for each conjugational class are listed in Table 6.12. 


TABLE 6.12 SUBJUNCTIVE MOOD (2 SG. ACT.) 



a-stems 

e- stems 

/-stems 

i-stems(-io) 

z-stems 

present 

portes 

moneas 

dicas 

facias 

sentias 

imperfect 

portares 

moneres 

diceres 

faceres 

sentires 

perfect 

portauerls 

monuens 

dixeris 

jeceris 

sentiuens 

pluperfect 

portauisses 

monuisses 

dixisses 

fecisses 

sentiuisses 


The suffixes marking subjunctive mood are the following: present tense -a for conjuga- 
tions 2, 3, and 4, and present tense -e for conjugation 1; imperfect tense -re; perfect tense 
-erf-; and pluperfect tense -isse-. As the forms in Table 6.11 shows, for conjugation 1 the 
subjunctive suffix -e replaces the stem vowel -a characteristic of the present indicative. 
The final vowel of the perfect tense suffix, to judge from the variation found in Classical 
Latin poetry, could be either long or short. 

The imperative, which was formed on the imperfective stem, was signaled by distinct 
endings rather than by a special mood suffix. Latin had two sets of imperative endings, 
one for the “regular” imperative and one for the so-called future imperative. Active voice 
formations are 2 sg. age vs. 2/3 sg. agito; 2 pi. agite vs. 2/3 pi. agitote. Functionally, the 
so-called future imperative was restricted primarily to legal and juridical contexts. 


Voice and personal endings 

In Latin, verbs were inflected for active and passive voice. Middle voice as a grammatical 
category had for the most part disappeared, although a small number of verbs permitted 
middle voice readings, e.g., lauatur ‘washes herself, bathes’. The so-called deponents con- 
tinued PIE media tantum verbs, e.g., sequitur ‘he follows’, cf. Greek sreeiat, Sanskrit sacate. 

There were three sets of personal endings for verbs (see Table 6.13). One set was for 
active voice, and another set was for passive voice and for deponents. The active voice 
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had two endings for the 1st singular active: -6 for present, future, and future perfect, and 
-m for imperfect, pluperfect, and all tenses of the subjunctive. First singular -m in the 
present tense of the verb ‘be’, sum, esom ‘I am’ is a remnant of the primary athematic 
ending *-mi, which has otherwise been lost in Italic. The perfect tense had its own special 
set of forms in the active. 


TABLE 6.13 PERSONAL ENDINGS 



active 

passive/deponent 

perfect active 

1 sg. 

-d, -m 

-or, -r 

-l 

2 sg. 

-s 

-re, -ris 

-istl 

3 sg. 

-t 

-tur 

-it 

lpl. 

-mus 

-mur 

-imus 

2 pi. 

-tis 

-mini 

-istis 

3 pi. 

-ntl-unt 

-ntur, -untur 

-erunt, -erunt, -ere 


The perfect, pluperfect, and future perfect passives were periphrastic formations made 
up of the perfect passive participle and a finite form of the verb ‘be’. The participle 
was inflected for gender, number, and case; the inflectional features were determined 
by agreement with the subject of the verb. Person and number of the subject were also 
marked on the verb ‘be’. Examples are cited in Table 6.14. 

TABLE 6.14 PERFECT PASSIVES 


perfect 
pluperfect 
future perfect 


portata es ‘you (f.) were carried’ 
portatl eramus ‘we (m.) had been carried’ 
portatus era ‘I (m.) will have been carried’ 


The paradigms listed in Table 6.11 are evidence for a regular alternation in the length 
of the stem-final vowel typical of all Latin verbal paradigms. The morphophonemic alter- 
nations were the result of sound changes that took place in Old Latin. The first change 
shortened long vowels when they stood before word-final obstruents, except for -s, e.g., 
portat> portat,faciam > faciam, but port as, port amus, etc. The second change shortened 
long vowels when they stood before another vowel (so-called vocalis ante vocalem cor- 
ripitur), e.g., *moneo > moneo, *moneas > moneas, *sentles > senties. From a synchronic 
point of the view, the stem-final vowels in conjugations 1, 2, and 4 are to be treated as 
long in underlying forms. In conjugations 3 and 3io the morphophonemic alternation 
between stem-final -i and stem-final -e is best viewed synchronically as a change of -i to 
-e before -r. This morphophonemic change explains the short -e rather than the short -i 
in forms such as caperes, 2 sg. impf. subj. In the underlying representation this verb was 
/kapires/. From a diachronic perspective, caperes goes back to *kapeses. 

Diachronic developments 

The present-tense conjugation classes are the result of the convergence of different PIE 
formations resulting from sound changes and analogical restructurings. 

The 3rd conjugation verbs continue PIE or post-PIE thematic formations of various 
sorts, e.g., Latin agit ‘leads’ < *ageti, elicit ‘says’ < *deyketi, bibit ‘drinks’ < *pibeti, 
poscit ‘demands’ < *prksketi. In Old Latin, the thematic vowels *e and *o changed to i 
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and u, e.g., *dewketi > *dowket > ducit, *dewkonti > *dowkont > ducunt. Athematic for- 
mations, such as nasal infix verbs (e.g., iungit ‘joins’), reduplicated presents (e.g., gignit 
‘gives birth to’), and a few presents to roots ending in a laryngeal (e.g., sonit ‘sounds’ 
< *swenati < *swenh,ti) were added to the thematic conjugation too, though by diverse 
prehistoric routes. 

The 3rd conjugation verbs of the capio- type and 4th conjugation verbs of the sen- 
tio- type were *-ye/yo- formations built directly to verb roots. The assignment of a verb 
to either the 3rd or 4th conjugation was determined by the prosodic structure of the 
root. Prosodically heavy roots and roots ending in a sonorant developed an epenthetic -i 
between the final consonant of the root and the *-ye/yo- suffix. Sound changes then split 
the verbs into distinct paradigmatic classes, e.g., conjugation 3 id, *kapiesi > *kapes > 
capis ‘you seize’, but 4th conjugation, *wenyesi > *weniyesi > uenis ‘you come’. 

A majority of 2nd conjugation verbs were in origin either iterative/causative forma- 
tions with o-grade of the root and a suffix *-eye/o-, or stative formations in *-eh 2 , e.g., 
noces ‘you cause harm to’ < *nokeyesi, sedes ‘you are sitting’ < *sedeh,si. 

The 1st conjugation verbs included denominatives to d-stems, e.g., cura- ‘make sure 
of’ < *koysa-ye/o-; factitives to thematic stems, e.g., noua- ‘makes new’ < *neweh 2 - 
ye/o-; and *-ye/yo- formations to roots ending in a laryngeal, e.g., *swenh,-ye/o- > *swe- 
na-ye/o- > sona- ‘sounds’. 

The Italic perfect tense indicatives are the result of the merger, both formally and 
functionally, of the PIE aorist and the PIE perfect. For most verbs, either an aorist or a 
perfect was selected as the representative of the Italic perfect, although in earlier periods 
of Latin both aorist and perfect formations are attested for a few verbs. So, for example, 
in Old Latin the verb pared ‘I refrain from’ had multiple perfect tense forms: pepercT, 
parcuT, pars T ‘I refrained from’. The verb facio made a reduplicated perfect and a root 
aorist in Very Old Latin: vhe:vhaked ‘he made’ and feced. Latin, Oscan, and Umbrian 
sometimes selected the same perfective form, e.g., Latin dedit ‘gave’, beside Umbrian 
dede. But more often the perfective formation in Latin does not match that of Oscan or 
Umbrian, e.g., Latin dixit ‘said’, sigmatic aorist, beside Umbrian dirsicust, reduplicated 
perfect; and Latin pepulit ‘he struck’, reduplicated perfect, beside Umbrian a(m)pelust 
‘will have struck’, root aorist. 

Although the overall structure of the tense system is the same in Italic, the exponents 
of some “tense” suffixes differed between Latino-Faliscan and Sabellic. For example, the 
Latin and Faliscan future tense for conjugations 1 and 2 was based on the suffix *-b h e/o-, 
whereas Sabellic continued an s-future, in all likelihood, the PIE desiderative, e.g., Oscan 
deiuast ‘he will swear’. The Latin future perfect was a thematic formation added to the 
perfective formant -is, -eri- < *-is-e/o-, whereas the Sabellic formation is athematic -us-, 
e.g., Oscan trfbarakattust ‘he will have built’. The Latin pluperfect suffix was -er-a- < 
*-is-a-, which was built with the same d-suffix found in the imperfect of the verb ‘be’, 
eras ‘you were’ < *es-a-s, and in the imperfect suffix -ba-. The pluperfect is not attested 
in Sabellic. 

The Latin imperfect formations, both indicative *-b h a- and subjunctive *-se-, have 
exact correspondences in the Sabellic languages, e.g., Oscan fufans ‘they were’, 3 pi. 
impf. act., Oscan patenslns ‘should open’, 3 pi. impf. subj. The present subjunctives 
of the 1st conjugation have their e-vowel from the full-grade athematic optative suffix, 
e.g., *noweh 2 -yeh,- > *nouae- > noue-, a stem that was generalized to plural forms in the 
paradigm. This subjunctive formant then spread from 1st conjugation athematics to the 
large class of denominatives in this conjugation, e.g., cure-, cf. Oscan deiuaid ‘he shall 
swear’, 3 pi. pres. subj. < *deywa-e-. The origin of the d-subjunctive of conjugations 2, 
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3, and 4 remains a mystery, but the formation is Italo-Celtic in origin and is found in all 
branches of Italic, e.g., Faliscan douiad ‘let him give’, 3 sg. pres, subj.; Oscan putfad 
‘be able’, 3 sg. pres, subj.; Oscan deicans ‘let them say’, 3 pi. pres, subj.; Oscan fakiiad 
‘let them make’, 3 sg. pres, subj.; and Umbrian tursiandu ‘let them be frightened away’, 
3 pi. pres. subj. 

The distinction between PIE primary and secondary personal endings was, for the most 
part, eliminated in Latin thanks to the loss of -i in the primary endings. In Latin, a dis- 
tinction remained in the 1 st person singular active, where, as noted above, the ending -m 
appeared in past tense formations and the subjunctive, and the ending -o appeared in the 
present, future, and future perfect. The 3rd person singular active secondary ending -d < 
*-t survived in Very Old Latin and Old Latin, e.g., VOL fheivhaked, OL fecid, but was 
replaced by primary -t < *-ti before the beginning of the Classical period. The perfect 
active endings continued the PIE endings for the most part, though they were extended 
by addition of *-i, the hie et nunc particle, in the singular (-T < *-ay, -sti < *-stay, -it < -I(t) 
< *-ey(t)) and 3rd person plural (-ere < *-eri). Third singular -ei was re-characterized by 
the addition of -t, which developed to -ft in Old Latin (OL fueit) and then to -it in Clas- 
sical Latin. In addition to -ere, two additional 3rd plural endings are attested: (1) -erunt, 
which is a blending of -ere and thematic aorist -unt < — *-ond; and (2) -erunt, which is 
from earlier *-is-ont, the thematic aorist ending added to the perfect formant -is-. The 
endings of the Faliscan perfect paradigm appear to be a fusion of thematic aorist and 
perfect endings, e.g., 1 sg. perfect pe:parai ‘I have produced’; 3 sg. aorist Mked ‘he 
has fashioned’; 3 pi. aorist f[if]iqod ‘they have fashioned’. In the Sabellic languages, the 
primary and secondary endings of the active did not merge, apart from the 2nd person, 
and perhaps the 1st person plural. Thus, in Oscan and Umbrian the endings are 1 sg. -u 
(Umbrian stahu ‘I stand’, 1 sg. pres, act.) vs. -um (Oscan manafum ‘I entrusted’, 1 sg. 
perf. act.); -t (Oscan stalt ‘it stands’, 3 sg. pres, act.) vs. -d (Oscan deded, 3 pi. perf. act.); 
-nt (Oscan sent 3 pi. pres, act.) vs. -ens (Oscan dedens 3 pi. perf. act.). In the 3rd plural, 
South Picene and Presamnite continued the thematic aorist ending *-ond < *-ont. These 
languages did not share the change of *-nd to -ns, which is found in Oscan and Umbrian, 
e.g., South Picene adstluh ‘they set up’, 3 pi. perf. < *-ond, Presamnite fofpofl ‘they 
were’, 3 pi. perf. < *-ond. 

The endings of the passive voice were built on the active by addition of ~(o)r or, in the 
case of the 1st plural, the replacement of -5 by -r, e.g., -mur < *-mor. The 2nd singular 
ending -re goes back to the PIE secondary middle ending *-so. The alternative 2nd sin- 
gular ending -ris is a later formation; final -s is the active singular ending. The 2nd plural 
-mini is sometimes taken to be the nominative masculine plural participle; another expla- 
nation points to PIE *-d h we, which received a nasal extension in the manner of Sanskrit 
2nd plural active -thana, that is to say, *-d h we-ney > *-beney > *-bney > *-mney > -mini. 
Neither possibility is convincing. 

The middle endings in the Sabellic languages did not correspond in all particulars to 
those in Latin. In Oscan the 3rd singular and 3rd plural suffixes were -ter and -nter. In 
Umbrian the -ter and -nter are used in primary inflection; -tur was used in secondary 
inflection. 

Non-finite formations 

The verb system also traditionally includes, in addition to the forms inflected for person 
and number, a number of non-finite forms. These include participles (portans , portantis 
‘carrying’), infinitives (portare ‘to carry’, portauisse/portasse ‘to have carried’), supines 
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( perditurn ‘to lose’), gerundives ( uTtandae ‘worthy of living’), and gerunds (faciendo ‘act 
of making’). 

The present active participle was formed by addition of the suffix -nt- to the imperfec- 
tive stem for conjugations 1 and 2, e.g ,,portans < *portants, portantis, and by addition of 
-ent- for conjugations 3 and 4, e.g faciens, facientis . Gender distinctions were in large 
part eliminated. The masculine and feminine forms were the same throughout, and the 
neuter was distinct only in the accusative singular and the nominative/accusative plural, 
e.g facientem, m./fi, vs. faciens, n.; facientes, m./fi, vs . facientia, n. The neuter form 
facientia shows another innovation of the present participle declension in Latin: /-stem 
inflection in the ablative singular and genitive plural. 

Present middle participles in -minus < *-mh 1 no- are preserved in a few words, e.g., 
femina ‘woman’ < *d h eh,(i)- ‘give suck’, alumnus ‘nursling’ < *al- ‘nourish’, but these 
stand outside of the verb system. 

The PIE verbal adjective suffix *-to- is the source of the Latin perfect participle, e.g., 
dictus ‘said’, ductus Ted’. Roots and stems ending in dentals show the regular change of 
*-d-t- and *-t-t- to -ss-, e.g ,,passus from the root pat-,patior ‘to suffer’, with subsequent 
simplification to -5 if the syllable before the suffix was heavy, *kayd-to- > *kaysso- > 
caesus ‘cut’. Oscan, Umbrian, and other Sabellic languages attest numerous examples 
of this formation, e.g., Umbrian screhto ‘written’, Oscan scriftas ‘written’ < *skreyb h -; 
cf. Latin scnptus. 

The future active participle was a Latin development. In most cases, the suffix -urus, 
-a, -um was added to the stem of past participle, e.g., dict-urus ‘about to say’. 

The PIE perfect active participle is not attested in Latin, but it may have survived in a 
few Sabellic formations, e.g., Oscan sipus ‘knowing’ < *sepwos, Volscian sepu < *sep- 
wod, though other explanations of these forms are possible. 

The gerundive is an Italic innovation, but its origins remain uncertain. The suffix -ndo- 
/-endo- was added to the present stem in Latin, e.g., Latin portandum ‘to be carried’, faci- 
endum ‘to be done’. In Oscan and Umbrian -nd- assimilated to -nn, e.g, Oscan upsannum 
‘to be done’, m. ace. sg., and Umbrian pelsans ‘to be buried’, m. nom. sg. 

The supine in -turn is the accusative singular of a tu-stcm verbal noun. It was found 
predominantly in construction with verbs of motion, e.g., Umbrian avef anzeriatu etu 
‘go to observe the birds’, Old Latin abiit ambulatum ‘she went away to take a walk’. In 
Latin the supine in -tu is found after adjectives such as facilis ‘easy’, e.g ,,facilis dictu 
‘easy to say’. 

The present infinitives in Italic have roots in PIE nominal formations. The present 
infinitive in Latin, which ends in -re, is in origin an s-stem locative *-si. The present 
passive infinitive of verbs belonging to conjugation 3 derive from root nouns inflected for 
the dative case, e.g., cap T ‘to be seized’ < *kapey. Present passive infinitives of conjuga- 
tions 1, 2, and 4 are modeled on the active infinitive, e.g., portare ‘to carry’ vs. portarl 
‘to be carried’. An alternative passive ending -ier, which must go back to an instrumental 
case form augmented by the passive suffix -r, is attested in Old Latin and occasionally 
later, e.g., OL figier ‘to be fashioned’. The present infinitive in Sabellic languages was 
an o-stem accusative, e.g., Oscan ezum ‘to be’, moltaum ‘to fine’. The present passive 
infinitive ends in -fi in Umbrian, e.g., pihafi ‘to be purified, which probably comes from 
an instrumental *-d h yeh|. Oscan had the same suffix, but it was outfitted with the charac- 
teristic sign of the passive, e.g., sakaraflr ‘to be consecrated’. 

The Latin perfect active infinitive was modeled on the present. The ending -se was 
added to the perfect stem that ended in the formant -is-, e.g., dTxisse ( dTks-is-se ). The 
corresponding passive is a periphrastic form, e.g., dictum esse. The system of infinitives 
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in Latin was filled out by future active and passive forms, e.g., active dicturum esse ‘to 
be going to say’ and passive dictum m ‘to be going to say’. Corresponding forms are not 
attested in the Sabellic languages. 

SYNTAX 

Nuclear sentences consist of a verb and its dependents. As in older IE languages, Latin 
and the other Italic languages marked the relationships between dependents and verbs 
by case endings. Grammatical subjects were inflected in the nominative, direct objects 
in the accusative, unless the lexical properties of the verb specified a different case, and 
indirect objects in the dative. Non-obligatory constituents, typically adverbial in nature, 
were expressed by a particular case or by a prepositional phrase, e.g., noctu ‘at night’, 
decern dies ‘for ten days’, cum exercitii suo ‘with his own army’. The subject of the verb 
could be omitted if it was recoverable from context. Direct objects could be omitted as 
well, if context permitted. Agreement was found in two areas of the syntax: (1) verbs 
were inflected for the person and number of the subject; and (2) adjectival modifiers 
were inflected for the gender, number, and case of their head nouns. The inflection of 
dependents, omission of subjects, and agreement phenomena are at play in the following 
passage from Caesar’s De Bello Gallico (6.13.1-8). In (10c) the verb concurrit agrees in 
number, 3rd singular, with the subject numerus. In (lOe) the adjective summam agrees in 
gender, number, and case with its head noun, auctoritatem. And again in ( lOe) the subject 
of constituunt is recoverable from the second clause in (lOd). 

(10) (a) In omm Gallia edrum hominum qui aliquo sunt numero atque honore gen- 
era sunt duo. (b) de his duobus generibus alterum est druidum, alterum equitum. 
(c) III! rebus diufms intersunt. sacrificia publica ac priuata procurant. religiones 
intervretantur . (d) ad hos magnus adulescentium numerus disciplinae causa con- 
currit. magndque hi sunt apud eos honore. (e) nam fere de omnibus controuersTs 
publicTs prTuatTsque constituunt. (f) his autem omnibus druidibus praeest unus, qui 
summam inter eos habet auctoritatem. 

(a) ‘In all of Gaul, there are two classes of men who are of some rank and honor, (b) 
Of these two classes, one is Druid, the other knight, (c) The former are concerned 
with religious matters; they take care of public and private sacrifices; they explain 
religious phenomena, (d) A number of young men come to them [Druids] for train- 
ing, and these [Druids] stand in great honor among them, (e) For they [Druids] 
make decisions about almost all public and private disputes, (f) One man stands at 
the head of all of these Druids, who has the greatest authority among them.’ 

Constituent order 

The unmarked arrangement of the major constituents in a sentence in Classical Latin is 
subject-object-verb (SOV), but that order it is not obligatory. SOV order is even more 
frequent in subordinate clauses where textual or pragmatic factors are less likely to have 
resulted in the movement of a constituent to the front or the end of the clause. In (10c), 
(lOd), and (lOe), the underlined verbs, apart from the copular verb sunt, occupy the final 
position in their clauses. In (lOf) the verb habet (underlined) has been displaced from 
final position by the direct object auctoritatem, which has been moved to the end of the 
clause for emphasis, so-called right dislocation. 
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Classical Latin prose permits the distraction of constituents, usually resulting from the 
movement of elements to be emphasized either to the front or to the end of the sentence. 
In (lOf) the noun auctoritatem has been moved out of the object-noun phrase ( summon 
auctoritatem ) and placed at the end of the sentence. In (lOd) the constituents of the noun 
phrase magno honore were distracted and placed at the beginning and end of the clause. 

Classical Latin poets did not adhere to the SOV standard as rigorously as prose writers 
for reasons that have to do in part with metrical necessity and in part with stylistics. The 
poets employ distraction more aggressively, and constituents at all levels of structure may 
be pulled apart for emphasis, focus, or artistic effect such as alliteration and assonance. 
In some instances the parts of constituents were “scrambled” with one another so that 
the continuity of every constituent in the sentence is interrupted. The following is from a 
poem by Horace. Distracted constituents are indexed by superscripts. 

(11) me tabula b sace 7' a | uotTua 0 paries* indTcat uuida c \ suspendisse potentf \ 

uestimenta c marls deo A (Hor. Carm. 1.5.13-16) ‘the sacred wall with (my) 
votive tablet proclaims that I have hung my soaked garments to the powerful god 
of the sea’. 

The SOV arrangement favored by Latin prose authors is often thought to have preserved a 
stage of the language that was changing from “head final” to “head first”. This idea is sup- 
ported by the fact that heads occur before modifiers in other constituents; that is to say, the 
unmarked order is Noun-Adjective (but not for “subjective” adjectives of the bonus type), 
Preposition-Noun Phrase, and Antecedent-Relative Clause. Furthermore, in the plays 
of Plautus, in main clauses, the order VO is statistically about as common as that of OV 
(Adams 1976). This view also accords well with the fact that in sub-literary texts of the first 
few centuries CE, VO order was the default. It is noteworthy that the variable order of con- 
stituents attested in Plautus contrasts with the order found in bureaucratic prose of roughly 
the same time period, which is predominantly SOV (Clackson & Horrocks 2007: 27-29). In 
the decree of the Roman Senate regarding participation in Bacchic cults, which was issued 
in 186 BCE, the order of the major constituents is consistently SOV. This suggests that SOV 
order was becoming, if it had not already become, one of the defining characteristics of elite 
Latin prose, thus distinguishing it from sub-elite varieties of the language. 

The PIE word-order rule known as Wackernagel’s Law, by which enclitic elements 
were positioned after the first word or constituent of a sentence, is attested to some degree 
in Latin. Sentential particles, e.g., enim, uerd, igitur, and autem, as in (lOf) above, reg- 
ularly occupy the 2nd position in a Latin sentence. But it was more common in Latin 
for words and constituents that were highlighted or emphasized to serve as the hosts for 
enclitic elements such as 1st and 2nd person pronominal forms or the verb esse ‘be’. 
Since these highlighted constituents were often moved into the 1st position in their clause 
(or phrase), unaccented pronominal forms and unaccented forms of the verb ‘be’ gener- 
ally came to occupy the 2nd position (Adams 1994a and 1994b). 

The unmarked order of constituents in the Sabellic languages was probably also SOV, 
although one must point out that many of the inscriptions are short and formulaic, and 
modeled on epigraphic types inherited from Greek and Latin. 

Subordinate clauses 

Latin poetry and prose, and Latin administrative prose, were home to a rich array of com- 
plex clause structures, some of which were elaborated throughout the Classical period 
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under the influence of comparable structures in Greek literature (Clackson & Horrocks 
2007: 184—198). A short survey of these structures follows. 


Complementation 

Sentential complements, that is to say, clauses that functioned as one of the arguments of 
a verb, had two main structures in Latin. The structure of the complement depended to a 
significant extent on the meaning of the verb. Some verbs governed a finite clause, others 
a non-finite clause containing an infinitive and an accusative subject. 

1. The Accusative + Infinitive construction was the rule after verbs of saying 
( dTcere ), thinking (putare , arbitrarT ), knowing ( scire , cognoscere), understanding 
( intellegere ), and perceiving ( audTre ‘hear’, uidere ‘see’, sentTre ‘perceive’). The 
accusative was the subject of the infinitive and was rarely omitted from surface 
structure, even when it was co-referential with the subject of the main clause, as it 
is in (12). 

(12) dTcebas quondam [solum tp nosse Catullum \ \ Lesbia (Catul. 72.1-2) 

‘You used to say [that you knew only Catullus], Lesbia.’ 

In the passive counterpart of this construction, cited in (13), the underlying subject 
of the infinitive EpamTnondas has been moved out of the Accusative + Infinitive 
clause to be the subject of the main clause and for this reason is inflected in the 
nominative. In such sentences modifiers of the subject were also inflected in the 
nominative. 

(13) EpamTnondas [fidibus praeclare cecinisse] dTcitur (C ic. Tusc. 1.4) 
‘Epaminondas is said [to have played brilliantly on the lyre].’ 

2. After verbs and verb phrases of “emotion”, e.g., gauded ‘rejoice’, indoled ‘regret’, 
or moleste fero ‘find annoying’, two structures were permitted: Accusative + Infini- 
tive and quod ‘that’ + indicative. 

3. Interrogative particles (num ‘whether’, -ne ‘if’), adverbs (ubi ‘where’), and pronom- 
inal forms (quis, quid ‘who, what’) introduced indirect questions. The verb was in 
the subjunctive mood. 

(14) senex dTcitur quaesisse num illud carmen desipientis uideretur (Cic. Sen. 22) 
‘The old fellow is said to have asked whether this seemed the poem of a man 
out of his mind.’ 

4. Indirect commands were introduced by verbs of ordering ( iubere , imperare), forbid- 
ding ( uetare ), permitting ( sinere ), exhorting ( hortari ), requesting ( rogare ), resolving 
(i decernere ), and warning (monere). Most verbs took subordinate clauses of the finite 
type. Such clauses were introduced by ut(T)lne, and the verb was in the subjunctive 
mood. The verbs iubere ‘order’ and uetare ‘forbid’, however, took the Accusative + 
Infinitive construction. Compare sentences (15) and (16). 

(15) hortatur eos [ne animo deficiant] (Caes. Civ. 1.19) 

‘He encourages them [not to lose heart].’ 

(16) [Labienum iugum montis ascendere ] iubet (Caes. Gal. 1.21) 

‘He orders [Labienus to ascend the ridge of the hill].’ 
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5. Complement clauses after verbs of fearing have a structure similar to indirect com- 
mands, as do complements following verbs of hindering, though the conjunctions 
that introduced them were distinct. The conjunctions ne / ne non introduced comple- 
ment clauses after verbs of fearing ( tirnere , uereri), and quin introduced complement 
clauses after verbs of hindering ( impedire ‘impede, obstruct’, prohibere ‘prevent, 
forbid’). 

The range of complement clause types was more limited in sub-elite Latin of the imperial 
period. The number of verbs that took the Accusative + Infinitive construction decreased; 
complements introduced by quod or quia extended the contexts in which they appeared 
and eventually become the norm. The Accusative + Infinitive construction survives only 
after verbs of perception in Romance languages. 

Adverbial clauses 

Other subordinate clause types functioned as adverbial modifiers. These clause types 
covered a range of functions, including time, cause, purpose, concession, comparison, 
and result. Adverbial clauses in Latin were, with one exception to be discussed below, 
finite clauses. 

1 . Purpose clauses were introduced by the conjunction ut ‘so that’. The verb was in the 
subjunctive mood. If the clause was negated, the conjunction was ne. Result clauses 
were similar in structure to purpose clauses, but if the clause was negated, it was 
introduced by ut non. Even if the result clause referred to an actual event, the mood 
of the verb was subjunctive. 

2. Temporal clauses were introduced by the following conjunctions: cum, ut, ubi 
‘when’; dum ‘while’; dum, donee, quoad ‘as long as’; dum, donee, quoad ‘until’; 
postquam ‘after’; and antequam, priusquam ‘before’. The mood of the verb was 
indicative in most cases, but in clauses that referred to events that were anticipated 
or generic, the subjunctive was making inroads in Classical Latin. 

3. Causal clauses introduced by quod, quia, quoniam, and quando, ‘because, since’ 
generally took a verb in the indicative mood, but the conjunction cum required a 
subjunctive. 

4. Conditional clauses were introduced by sT ‘if’. If the relationship between the verbs 
in the sF-clause and the main clause was real, the verbs were in the indicative mood. 
If the relationship between the verbs in the clauses was considered possible or was 
counterfactual, the verbs were in the subjunctive mood. 

5. The ablative absolute construction consisted of a noun phrase as subject and a par- 
ticiple as predicate, and both were inflected in the ablative case. Nominal structures 
were also possible, in which case the subject phrase was followed by a predicative 
adjective or noun. The absolute was in origin a circumstantial clause that stood in a 
variety of adverbial relationships to the main verb. Ablative absolutes were rarely 
introduced by conjunctions, so the subordinate relationship was determined by the 
meaning of the predicate and the sentential context. The participle expressed relative 
time. 

Prototypical absolute constructions were short, consisting of a noun and a participle (17). 
In Old Latin this was almost always the case, but during the Classical period, at the 
hands of authors who were influenced by comparable structures in Greek, this clause 
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type became longer and more complex in its structure. Imperial prose writers such as the 
historian Tacitus sometimes specified the subordinate role of the absolute by adding a 
subordinating conjunction (18). 

(17) Caesar [acceptTs littens \ nuntium mittit (Caes. Gal. 5.46) 

‘[After the letters were received], Caesar sends a messenger.’ 

(18) alii \ quamu Ts ejfugid patente] interiere (Tac. Ann. 15.38.7) 

‘Some died, although an escape route was open.’ 


Relative clauses 

In Classical Latin relative clauses typically followed the antecedent noun phrase, partic- 
ularly when they were restrictive, but other arrangements were possible. Relative clauses 
were sometimes placed inside the phrase containing the antecedent, as in (19), or placed 
in initial position for reasons having to do with style, as in 20, which reflects an order 
common in legal texts. Antecedent phrases and relative pronouns are marked in bold. 

(19) ab ifs [quos mfserat \ exploratoribus 0 et ab RemTs cognouit . . . (Caes. Gal. 2.5.4) 
‘he learned from those scouts that he had sent and from the Remi . . . ’ 

(20) [quibus ita est interdictum ] hi numero impiorum ac sceleratorum habentur (Caes. 
Gal. 6.13.7) 

‘[to whom it has been banished in this manner], these (folks) are considered in the 
number of the impious and the wicked.’ 

Other relative clause structures are found in Classical Latin. The correlative type, exem- 
plified in (21) and (22), has comparable structures within Italic and more widely within 
Indo-European. In this type the relative clause occupied the 1st position in the sentence, 
and the antecedent was incorporated into the relative clause. The antecedent was then 
referenced in the main clause by a pronominal fonn, as in (21), or by the antecedent and 
its pronominal modifier, as in (22). 

(21) [quae pars cTuTtatis calamitatem populd Romano intulerat ] ea prmceps poenas 
persoluit (Caes. Gal. 1.12) 

‘ [Which part of the state had brought disaster on the Roman people], that (part) first 
paid the price. ’ 

(22) [qijem AGRUM EOS UENDERE heredemque sequi licet] IS AGER UECT1GAL NEI S1ET (CIL 
F.584.5) 

‘Which property it is permitted for them to sell and to follow an heir, that property 
shall not be subject to taxation.’ 

Relative pronouns were sometimes subject to a syntactic process known as “attraction”. 
Relative pronouns could be attracted into the case of their antecedents, as in (23), where 
the accusative object of consuestT is attracted to the genitive case of eorum. In (24) the 
antecedent urbem was attracted into the case of relative pronoun quam (attractio inuersa 
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‘inverse attraction’). Inverse attraction is well attested in Old Latin, but is rare in Classi- 
cal Latin. As a result, Classical Latin practice could well be an archaism. 

(23) sf aliquid agas eorum [quorum consuesti ] (Cic. Fam. 5.14) 

‘If you should do any of the things that you were accustomed to do.’ 

(24) urbem [quam statud] uestra est (Virg. Aen. 1.573) 

‘The city [that I am founding] is yours.’ 

Sequence of tenses 

In indirect questions and indirect commands the tense of the verbs in the subordinate 
clauses was governed by the tense of the main verb. Verbs that were present or future 
governed a subordinate verb that was present or perfect; verbs that were past tense (imper- 
fect, perfect, pluperfect) governed a subordinate verb that was imperfect or pluperfect, as 
in (25). Note that the tense of the verb in the original question in (25) was perfect tense 
(‘Why did you all come to my house?’). This syntactic rule, which is also known as back 
shifting, is attested in the Sabellic languages. 

(25) (Lentulus) quaesTuit a GallTs quam ob rem domum suam uenissent (Cic. Cat. 3.11) 
‘He (Lentulus) asked the Gauls why they had come to his house.’ 

Other subordinating structures 

Subordinate ideas could also be conveyed by non-finite structures, including participles, 
gerundives, and supines. A selection of the possibilities is discussed below. 

Purpose could be indicated by a supine when in construction with verbs meaning 
‘come’ and ‘go’ (26). After the preposition ad, purpose could be indicated by a gerundive 
phrase that agreed with its head noun in gender, number, and case (27). Gerundives indi- 
cating subordinate ideas were the rule after verbs such as curare ‘to take care (that such 
and such happens)’ (28). 

(26) legatos ad Caesarum mittunt [ rogdtum auxilium ] (Caes. Gal. 1.11.3) 

‘They send legates to Caesar to ask for assistance.’ 

(27) [ad bella suscipienda] Gallorum alacer est animus (Caes. Gal. 3.19.6) 

‘The character of the Gauls is keen to undertake wars.’ 

(28) [pontem in Aran faciendum] curat (Caes. Gal. 1.13.1) 

‘He takes care to build a bridge over the Arar.’ 

Participles serving as attributes of noun phrases conveyed a wide range of subordinate 
ideas. In (29) the phrase a grum Gallicum diuidenfi had the function of a temporal clause 
introduced by ‘while’. In (30) the bracketed phrase, which is an attribute of Dionysius, 
had a causal function. 

(29) C. Flaminio restitit [agrum Gallicum diuidenfi \ (Cic. Sen. 11) 

‘He resisted Gaius Flaminius [while he [Flaminius] was apportioning Gallic 
territory].’ 
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(30) Dionysius [cultros metuens tonsores] candente carbone sibi adurebat capillum 
(Cic. Off. 2.25) 

‘Dionysius, because he was afraid of the barber’s shears, used to singe his hair with 
a piece of white-hot coal.’ 


LEXICON AND WORD FORMATION 

From a historical perspective the Latin lexicon is made up of strata of vocabulary, the 
deepest of which can be traced back to the parent language. Latin preserved IE vocab- 
ulary in a wide range of semantic spheres, e.g., kinship terms {pater ‘father’, mater 
‘mother \frater ‘brother’, soror ‘sister’), basic numbers ( duo ‘two’, tres ‘three’, quattuor 
‘four’, quinque ‘five’, etc.), body parts (pes, pedis ‘foot’, manus ‘hand’), domesticated 
animals (sus ‘pig’, ouis ‘sheep’), weather terms (nix, niuis ‘snow’, pluit ‘it’s raining’), 
and so forth. 

A few words, primarily of a religious nature, can be assigned to the Proto-Italic period, 
e.g., Latin sacer ‘sacred’, nom. sg. m., Umbrian sacru ‘sacred’, nom. sg. n.; Latin sacrem 
‘sacrificial victim’, ace. sg., Umbrian sakre, ace. sg., Oscan sakrim, ace. sg.; Latin 
sanctum ‘inviolate’, Oscan saahtum; and Latin plus ‘devout’, Oscan pilhiul, dat. sg. 
A smaller number of words belong to the secular sphere, e.g., Latin uti ‘to use’, Oscan 
ulttiuf ‘usage’, nom. sg.; Latin cena ‘meal, banquet’, Oscan kersnu ‘banquet’, Umbrian 
gesna ‘meal’. In addition, a few words that are IE in origin underwent semantic devel- 
opments peculiar to Italic. PIE *deyk- ‘to point out, show’, for example, acquired the 
meaning ‘say’ in Latin and Oscan. 

Divergences existed in the Italic lexicon as well. In Latin the words filius ‘son’ and filia 
‘daughter’ replaced the PIE words for ‘son’ and ‘daughter’, but they were preserved in the 
Sabellic languages, e.g., Oscan puklul, ‘son’, m. dat. sg., South Picene puqloh, m. dat. 
sg. < *putlo-; Oscan futlr ‘daughter’, f. nom. sg., futrel, f. dat. sg. < *d h ugh 2 ter. 

Borrowings in Latin cut across the chronological periods of the language. A few words 
have come from a Sabellic language of central Italy, e.g., bos ‘cow’, rufus ‘red’, and 
popina ‘cook shop’. Further afield, Celtic incursions into the peninsula beginning in the 
6th century BCE provided a small number of words, e.g., carrus ‘wagon’. In the 3rd 
century BCE, as contact with Greek speakers intensified, Greek loans began to enter the 
language in greater numbers. The plays of the Roman comedian Plautus are filled with 
Greek loans in semantic spheres ranging from business and finance to medicine and edu- 
cation. Among the Roman elite who took up the study of philosophy, oratory, literature, 
and philology, Greek was the source of many technical terms. The words were borrowed 
with varying degrees of accommodation to the Latin phonological and morphological 
systems, depending on the date of the borrowing, on the education of the borrower, and 
on the reason for the borrowing. For some Greek technical terms, new Latin words were 
coined rather than borrowed, e.g., essentia ‘substance’ for Greek ouaia; for others, exist- 
ing Latin words were semantically extended to cover Greek concepts, e.g., casus ‘acci- 
dence’ for Greek 7iTcocn.c;. 

Throughout the history of Latin, productive morphological processes augmented the 
lexical inventory. Abstract nouns and result nouns were derived from verb roots and verb 
stems by suffixation of -men (fragmen, fragminis ‘fragment’), -mentum (fragmentum , 
fragment T ‘fragment’), -ion- ( occTdio , occidionis ‘slaughter’), and -turn- ( aedificatid , 
aedificationis ‘construction’). The suffixes -(i)tat- ( bonitas , bonitatis ‘goodness’) and 
-(i)tut- ( uirtiis , uirtutis ‘manliness’) also formed abstracts from adjective stems. Agent 
nouns were made with the suffix -tor, which was added to verb roots and verb stems, 
e.g., amator, amatoris ‘lover’, scriptor, scnptoris ‘writer’. The adjective suffix -anus. 
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which originated as a re-segmentation of adjectives derived from 1st declension nouns, 
was very productive, e.g., 3rd declension urban us ‘of the city’, 2nd declension mundanus 
‘of the world’, on the model of 1st declension Romanus ‘of Rome’. The verb suffix -a 
was extracted from 1st conjugation verbs and was used to form denominatives from con- 
sonant-stems, e.g., nominare ‘to call by name’, built on the stem nomin- ‘name’. The 
suffix -(i)ta, which originally formed iterative/intensives, eventually replaced the corre- 
sponding simple formations, e.g., cantare ‘to sing’ to canere, iactare ‘to throw’ to iacere, 
dictare ‘to say’ to dTcere. Compound verbs were formed by addition of prefixes to verb 
stems, e.g., ducere ‘to lead’, ediicere ‘to lead out’. Compound noun formations were 
not as productive in Latin as they were in IE languages like Greek, but compounds such 
as agricola ‘farmer’, auceps ‘bird catcher’, aurifex ‘goldsmith’, and iudex ‘judge’ may 
point back to a more productive period. In classical literature, compounds began to reap- 
pear in greater numbers, but they were created on models from Greek literature, e.g., 
armiger ‘bearing arms’. 


FURTHER READING 

Editions of Latin literary texts are published in numerous text series. The Oxford Clas- 
sical Texts series is the standard. Latin epigraphic materials are collected in the Corpus 
Inscriptionum Latinarum. Rix (2002) is an editio minor of Sabellic inscriptions. Venetic 
inscriptions published before 1987 are in Prosdocimi 1987. Venetic inscriptions recov- 
ered after 1987 are published annually in Studi Etruschi, in the section on Italic epigraphy 
(Rivista di Epigrafia Italica). 

Clackson 2011 covers the Latin language from all angles. Rosen 1999 focuses on the 
development of literary Latin. Historical Latin syntax is treated in the multi-volume work 
edited by Baldi and Cuzzolin (2009). Allen’s Vox Latina (1978 2 ) is the basic introduction 
to Latin phonology. A number of excellent reference works in the comparative/historical 
tradition have been published in the last 20 years. Baldi 1999, Clackson and Horrocks 
2007, Meiser 1998, Sihler 1995, and Weiss 2009 are particularly useful. For the pre- 
history of the Latin verb, see Meiser 2003. For discussion of VOL and OL inscriptions 
Vine 1993 and Wachter 1987 are essential. The following etymological dictionaries cover 
Latin and the other Italic languages: Emout and Meillet 1985, Untermann 2000, and de 
Vaan 2008. For Latin dialects from regional and social perspectives, see Adams 2007 and 
2013. Bilingualism in the Roman world is the topic of Adams 2003. 

Fortson 2010, chapter 13, is a concise and informative introduction to Italic and the 
Italic languages (Venetic is discussed in Chapter 20, “Fragmentary Languages”). Wallace 
2007 is an overview of the Sabellic languages. 
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PART 7 


CELTIC 


Patrick Sims-Williams 


INTRODUCTION 

The surviving Celtic languages fall into two groups: (a) the Brythonic (or Brittonic or 
British) group (Temes 2011); and (b) the Gaelic (or Goidelic or Irish) group. The two are 
very distinct and have been mutually unintelligible for well over a millennium. To the 
Brythonic group belong Welsh, spoken widely in Wales, and Breton, spoken in the west 
of Brittany. Cornish, the language of Cornwall, which was very similar to Breton, died 
out as a natural language in the eighteenth century. To the Gaelic group belong Irish (or 
Irish Gaelic), spoken mainly in the west of Ireland, and Scottish Gaelic, spoken mainly 
in the west of Scotland. Manx, the Gaelic language of the Isle of Man, died out as a 
natural language in the twentieth century. Celtic languages are also spoken in the Amer- 
icas, as a result of the modem diaspora of Celtic-speaking peoples, so that, for example, 
there are Welsh/Spanish bilinguals in Patagonia in Argentina, and Scottish Gaelic/English 
bilinguals in Nova Scotia in Canada. 

All these surviving Celtic languages (including Breton!) are known collectively as 
Insular Celtic languages, as opposed to the ancient Continental Celtic languages, e.g. 
Gaulish, Galatian, Celtiberian, etc., which were all dead by ad 500 and mostly much 
earlier. The term “Insular” refers to the two islands of Ireland and Britain. From these the 
Gaelic and Brythonic languages spread: in about the fifth century ad Scotland was settled 
by emigrants from Ireland (the Scotti), while Armorica was settled by emigrants from 
southern Britain, becoming known as Brittany (Breton Breiz < *Brittia). The theory that 
the Breton language includes a substratum of indigenous Annorican Celtic is unproven 
but not impossible. 

The Brythonic dialects began to diverge into West British (> Welsh) and South-West 
British (> Cornish and Breton) in about the fifth century ad, but probably remained mutu- 
ally intelligible for several centuries. The Gaelic dialects began to diverge in about the 
tenth century ad, but there was a common literary language to the end of the Middle 
Ages, and even today Irish and Scottish Gaelic are much more similar than are Breton and 
Welsh, which have long been mutually unintelligible. The geographical reason is obvi- 
ous: there has always been easy travel between Scotland and northern Ireland. Another 
factor is that Welsh has been in contact with English, whereas Breton has been in contact 
with French. These contacts have influenced the syntax, morphology and phonology of 
the languages; for example, there are nasalized vowels in Breton and French, but not in 
Welsh and English. 

Within most individual surviving languages there are marked dialectal differences, so 
that Breton speakers from north and south may find it easier to communicate in French, 
while Gaelic speakers from north and south may prefer to talk English together. Nearly 
all adult Celtic speakers are bilingual. 

A general impression of the divergence of the Celtic languages may be gained from 
comparing the ordinal numerals in Gaulish (McCone 1994: 208 = 2005: 300, Lambert 
2002: 105-109, Hamp 2012) with those in Old Irish and Middle Welsh; see Table 7.1. 
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TABLE 7.1 COMPARISON OF THE ORDINAL NUMERALS 


Gaulish (first century ad) 

Old Irish (eighth century) 

Middle Welsh (thirteenth century) 

1 

cintux((?)mos) 

cetn(a)e 

kyntaf 

2 

al(l)os 

tan(a)ise, aile 

eil 

3 

trito(s), tr(itios) 

tris 

trydyd, f tryded 

4 

petuarios 

cethramad 

pedwyryd, f. pedwared 

5 

pinpetos 

coiced 

pymhet 

6 

suexos 

se(i)ssed 

chwechet 

7 

sextametos 

sechtmad 

seithvet 

8 

oxtumetos 

ochtmad 

wythvet 

9 

namet(os) 

nomad 

nawvet 

10 

decametos 

dechmad 

decvet 


Celtic 

The term “Celtic”, as applied to the Insular Celtic-speaking peoples and their languages, is a 
modem one. These peoples did not refer to themselves and their languages as “Celtic” until 
recently. For example, the medieval Irishmen were Goidil (an opprobrious name derived 
from Brythonic, cf. W gfydd ‘wild’), and their language was Goidelach, and medieval 
Welshmen regarded themselves as Brython (< Lat. Brittones ) or Cymry (< *kom-brogT 
‘co-countrymen’) and their language was Cymraeg. (The Welsh themselves do not use the 
English name Welsh < OEng. w(e)alh ‘foreigner, or slave, mostly speaking a Romance or 
Gallo-Brythonic language’. This may derive ultimately from the Continental ethnic name 
Volcae.) There is no evidence that medieval Brythonic and Gaelic speakers recognized their 
special linguistic kinship; this was discovered by early comparative philologists such as 
the Scot George Buchanan (in his Rerum Scoticarum Historia, 1582) and the Welshman 
Edward Lhuyd (in his Archaeologia Brittanica, 1707). The modem Romantic idea of a pan- 
Celtic ethnic unity and “Celtic national character” has had some influence even in the Celtic- 
speaking countries, but really derives from foreign works such as Ernest Renan’s La Poesie 
des races celtiques (1854) and Matthew Arnold’s The Study of Celtic Literature (1866). 

The linguistic term “Celtic” derives from the usage of ancient and early medieval Greek 
and Latin writers, who only use it of Continental Celtic languages; for example, as late 
as the ninth century, Heiric of Auxerre, Vita S. Germani 1.353, explains the place-name 
Augustidunum as meaning ‘Augusti mons’ in Celtica lingua. Celtica lingua, in fact, seems 
to have been equivalent to Gallica lingua, the term used, for example, in the sixth cen- 
tury by Venantius Fortunatus, who explained the old Gaulish place-name Vernemetis as 
“fanum ingens” (‘great temple’) ( Carmina I.ix.9-10). Because close ethnic and linguistic 
similarities between Gaul and Britain had been noted by writers from Tacitus ( Agricola , 
11) down to the Renaissance, it seemed reasonable to early modem scholars to apply the 
term “Celtic” to Brythonic as well. Thence it was extended to the languages of Ireland and 
Scotland, on comparative philological grounds, even though the Insular Celtic languages 
may never have been called “Celtic” in Antiquity (Sims-Williams 1998; cf. Blom 2009). 

Despite its dubious origin, the term “Celtic” remains a useful label for a distinct family 
of IE languages. The phonological and lexical similarities between Gaulish, Brythonic 
and Gaelic are illustrated by the above forms Augustidunum (i.e. Gaul. *-dunon) and 
Vernemetis (i.e. Gaul. *Wer-nemeton). With the first compare OIr. dim ‘fort’, OBret. 
din gl. ‘arx’, Com. dvn, OW din (note /u:/ > /u:/ > T\J in Brythonic). These Celtic cog- 
nates are distinct from the Germanic cognates such as OEng. tun (> Eng. town) in not 
showing the effect of Grimm’s Law (/d / > /t/; see p. 20, 392-393). With the intensive 
prefix Ver- ‘ingens’ compare OIr. for < *wor- < *wer- and OWCB guor- < *wor- 
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< *wer- (Subdivision, p. 358). These cognate forms differ from IE cognates such as 
Gr. u;isp in showing the well-known Common Celtic loss of IE /p/, i.e. *uper > *wer-. 
Lastly, Gaul, nemeton ‘fanum’ is probably related to Lat. nemus, Gr. vepo<; ‘grove’, but 
the dental formation is only paralleled within Celtic, for example, OIr. nemed [n'cpbS] 
(where p represents a nasal bilabial fricative and cf. p. 367) glossing sacellum ‘sanctuary’ 

< *nepe|tan < *nemeton, and Old Breton personal names in -nemet [neped] and Old 
Welsh ones in -nimet [naped], both < British *nemeton. This formation is already seen in 
the Celtic name *Nemetios in an Etruscan inscription of the fifth century bc at Genoa: MI 
NEMETIES ‘I am [the tomb] of Nemetios’ (de Simone 1980). 

Origins 

The original “homeland” of the Celtic speakers is unknown - but in any case the sim- 
plistic concept of a “homeland” is of limited validity, both in terms of ethnogenesis and 
in terms of language origins: a people may comprise diverse ethnic elements, and a lan- 
guage may derive from various sources (e.g. the English are ethnically both Celtic and 
Germanic, and the IE element in English comes via Latin and French as well as from 
Germanic). There are three main approaches to the problem. 

Archaeological approaches have failed to locate the first speakers of Celtic. Archaeolo- 
gists used to associate Celtic speech with either the early, so-called Hallstatt Iron Age cul- 
ture or the later, so-called La Tene, Iron Age culture, or with both (see the maps in Powell 
1980: 48, 115; cf. Mallory 1989: 96-107). While it is likely that Celtic speakers were found 
in these Hallstatt/La Tene areas, as in later centuries, we can rarely equate language with 
material remains: a correlation between La Tene finds and Celtic speech may work quite 
well in eastern Europe (Falileyev 2014), but it does not work in Spain or northern Italy, 
where Celtic inscriptions fail to correlate with La Tene material. More recent archaeolo- 
gists, dissociating Celtic from the above Iron Age cultures, have speculated that a form of 
IE was already spoken in north-western Europe by 4000 bc and gradually developed into 
Celtic in situ (this phase began with Renfrew 1987: 249; cf. Meid 1989, Mallory 1989: 
274). This extreme view is possible only provided that (it is a massive proviso) we can 
believe that communications over the four millennia bc would have allowed many parallel 
developments to take place in the Late IE and Early Celtic dialects. The fact is that we can 
only speculate about when Indo-European and then Celtic were first spoken on the Atlantic 
seaboard, including Britain and Ireland. The latest possible date for Celtic reaching Ireland 
is the first century ad (just before the Celtic names attested in Ptolemy’s Geography), and 
this has indeed been proposed as near the actual date (Schrijver 2014: 81). Archaeology 
can neither disprove nor confirm this proposal, and so far no light has been cast on these 
problems by ancient (or modem) DNA (Sims-Williams 2012a). 

A second approach to the problem is to examine the references to Keltoi, Celtae, Galatae, 
etc. by Greek and Latin writers. There are several problems here: (a) These ethnic labels, 
which are of uncertain etymology (Sims-Williams 2011), may not always correspond to our 
modem linguistic use of the term Celtic (see p. 353 above), (b) Mediterranean observers 
may not have distinguished clearly between barbarian peoples such as Celtae and Ger- 
mani, and probably simplified the true situation, especially at first (just as even today the 
French tend apply the term Anglais to all the inhabitants of Britain), (c) There are very few 
references to “Celts” before 400 bc, the period when their violent expansion into Italy, the 
Balkans, Greece and Turkey began to bring them to the serious attention of Mediterranean 
writers. Earlier Greek writers associate them variously with Spain (the Marseilles periplus 
in the sixth century bc, if correctly reported in Avienus’ Ora maritima), with the hinterland 
of Marseilles ( Massilia ) (Hecataeus of Miletus, c. 500), and with the far west of Europe, 
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including the source of the Danube (!) in Pyrene = (?) Pyrenees (Herodotus, c. 450). (See 
Powell 1980: 11-15, Schmidt 1986a: 15, Tovar 1986: 79-80, Sims-Williams 2016.) Obvi- 
ously these Greek writers were chiefly familiar with the Celts nearest to them, but there may 
well have been peoples calling themselves “Celts”, or who were called “Celts” by others, in 
northern Europe too. As we cannot prove this, however, the early ethnographical testimo- 
nies are of limited help in defining the “Celtic” area of Europe before 400 bc. It used to be 
thought significant that Avienus’ names for the inhabitants of Ireland ( Hierni , see Conso- 
nantism, p. 353) and Britain ( Albiones cf. MW el\yd ‘world’ < *albyo-, cognate with Lat. 
albus ‘white’, etc.) had specifically Celtic etymologies. We cannot be sure that these names 
originated in the British Isles, however, and anyway their linguistic Celticity is uncertain 
(Sims-Williams 1998: 20-21, Schrijver 2014: 78). 

The third approach to the problem is to examine the distribution of apparently Celtic 
personal, place- and ethnic names (Sims-Williams 2007a, Raybould & Sims-Williams 
2009, Falileyev et al. 2010) and the distribution of Celtic-language inscriptions, coin 
legends, graffiti and so on. Here, too, there are serious obstacles: (a) such data are of 
course chiefly recorded in the vicinity of Mediterranean cultures, where writing systems 
developed (on the orthographies of the Celtic languages see Russell 1995: 197-230 and 
de Hoz 2007); and (b) they mostly belong to the period shortly before the Christian era 
or still later, too late, that is, to shed light on the areas in which Celtic speech originated. 



MAP 7.1 LATIN PROVINCIAL LAPIDARY INSCRIPTIONS BEARING CELTIC PERSONAL 
NAMES 

Source: Marilynne E. Raybould and Patrick Sims-Williams, Introduction and Supplement to the Corpus of 
Latin Inscriptions of the Roman Empire Containing Celtic Personal Names , CMCS Publications, Aberyst- 
wyth, 2009, map 2 (sample = compound names, the most recognizable type of Celtic name) 
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MAP 7.2 DENSITY OF CELTIC-LOOKING ANCIENT PLACE-NAMES 

Source: Marilynne E. Raybould and Patrick Sims- Williams, Introduction and Supplement to the Corpus of 
Latin Inscriptions of the Roman Empire Containing Celtic Personal Names, CMCS Publications, Aberys- 
twyth, 2009, map 16 (created by M. Crampin on the basis of data in Sims- Williams 2007a: 164-165) 


Position within IE 

The position of Celtic among the IE languages was approached in various ways during 
the twentieth century (cf. Schmidt 1979: 197). (1) Pokorny, Wagner and others empha- 
sized Celtic’s divergence from the IE model and explored the possibilities of substra- 
tum or “areal” influences from non-IE languages, for example, Semitic and Hamitic, 
mostly on the vague basis of typology (see Greene 1966, Eska 2010b, Broderick 2013). 
Regrettably, too little is known about the early history of the non-IE languages with 
which Celtic-speakers were in contact, for example, the language of the non-Celtic Pic- 
tish inscriptions in Scotland (Jackson 1980) or Aquitanian/Basque in France and Spain. 
(2) In reacting to this trend, Dillon, Watkins, Meid and others stressed the degree to which 
Celtic could be explained by “forward reconstruction” from IE (Watkins 1962: 7); but 
at the same time they regarded Celtic as particularly archaic, preserving hypothetical IE 
features lost in other less “marginal” IE dialects (e.g. the original distribution of primary 
and secondary verbal endings, Personal Endings, p. 372). (3) Reacting in turn to this, 
scholars such as Rix and Cowgill argued that the peculiarities of Celtic could mostly be 
explained as internal developments and did not require significant modification of the 
traditional IE model reconstructed on the basis of Indo-Iranian and Greek (see McCone 
1986: 222-223). (4) Other linguists still insisted “on the high importance of Old Irish 
and other genetically western branches for the reconstruction of proto-Indo-European” 
(Hamp 1987). Probably there is truth in all these approaches. 
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Celtic cannot be grouped with any other branch in an IE subdialect, unless arguably 
with Italic. The long-held theory of “Italo-Celtic” has had periods out of favour (see 
Watkins 1966, Schmidt 1991) but remains resilient (see Eska 2010a: 22, Weiss 2012, 
Schrijver 2015). Some of the similarities, such as the retention of the mediopassive in -r 
(shared with Hittite and Tocharian), can be regarded as shared archaisms. Others, such 
as the apparent equivalence of the Latin future in -bo and the Old Irish / : future, may be 
illusory (Future Stem, p. 375). Others, such as the o-stem genitive singular in -T (also 
in Messapic, but not found in Celtiberian (p. 360) or Osco-Umbrian), and superlatives 
of the type Osc. nessimas ‘proximae’: Gaul, neddamon (gen. pi.), OIr. nessam, OBret. 
nesham ‘nearest’, may be due to diffusion over the long period during which Italic and 
Celtic speakers lived side by side (cf. Eska 2010b: 546). (In fact, the Italic influence on 
Celtic has never ceased, owing to continual borrowings from Latin, which have had a 
fundamental influence on Celtic word formation, as well as supplying many loan words.) 
The shortage of striking Italo-Celtic phonological developments (e.g. Watkins 1966, Zair 
2012: 271) has been seen as a strong argument against the Italo-Celtic theory. 

There is much less evidence for any deep connection between early Celtic and Germanic, 
despite the fact that they were spoken in close proximity by the time of Julius Caesar and 
Tacitus. There are no shared innovations in phonology and morphology, and the number of 
significant lexical correspondences was formerly exaggerated (see Campanile 1970, Evans 
1981, Polome 1983, Schmidt 1984, 1986b: 205-206, Schumacher 2009). According to the 
minimal approach of Evans (1981: 248), only about a dozen early Germanic words are 
certainly borrowed from Celtic, for example, Goth, reiks ‘ruler’: Gaul, -nx < IE *regs (cf. 
Chapter 1); Goth, eisarn ‘iron’: Gaul. Isamus (personal name), OIr. iarnn, W haearn (< 
*(h)3i(h)am-) < *isamo-; OHG ledar ‘leather’: OIr. lethar, W lledr < Celt. *letro- < IE 
*pl(e)-tro- (cf. Lat. pellis, etc.). The Insular Celtic languages were, of course, influenced 
by Germanic through English at a later stage, and there is also a slight Old Norse element. 

The fact is that Celtic shares some isoglosses with almost all other IE languages, 
and these can be selected to support many different theories (cf. Schmidt 1985, 1986b: 
202-206, Stalmaszczyk & Witczak 1995). Similarly, scholars wishing to stress the 
peripheral and/or archaic nature of Celtic can and have noticed certain remarkable reten- 
tions, for example, masculine and feminine forms of the numerals ‘three’ and ‘four’ in 
Modern Welsh: tri chi ‘three dogs’, fair cath ‘three cats’; pedwar ci ‘four dogs’, pedair 
cath ‘four cats’. Generalizations about the affiliations of Celtic and about its archaic 
or innovative tendencies are usually subjective, lacking a statistical basis. Statistical 
attempts to compare the IE languages on the basis of “core vocabulary” (Elsie 1990: 318) 
or of occurrences under Pokomy’s roots (Bird 1982: 1 19-120) are open to objections, but 
do agree on Celtic’s closeness to Germanic and (to a lesser extent) to Latin/Italic, and a 
phonological link between the three, attributed to some version of Dybo’s Law, has been 
seen in the pre -tonic vowel shortening exemplified by Olr.fer, OEng. wer, Lat. uir ‘man’ 
versus Skr. vira- (e.g. Matasovic 2012). Some form of the hypothesis of “Late Western 
Indo-European” (Meid 1968: 53) could explain such convergence. A more selective and 
rigorous comparison favours an especial closeness to Italic (see Eska 2010a: 22). 


Subdivision 

The internal subdivision of the Celtic languages has not yet been agreed by scholars; 
either the data are inadequate, or the reality is more complex than a simple genetic model 
would allow. 

A traditional division distinguishes “P Celtic”, in which IE *k w > *p, and “Q Celtic”, 
in which *k w remained: Gaelic and Celtiberian are Q Celtic, whereas Brythonic, Lepontic 
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and most (but perhaps not all) Gaulish material is P Celtic. In view of the originally allo- 
phonic nature of the *k w /*p alternation in Celtic (Consonantism, p. 363), most linguists 
would now agree that “the ‘isogloss’ between P-Celtic and Q-Celtic is structurally triv- 
ial” (Watkins 1966: 32 n. 7, following Hamp 1958: 211). Nevertheless, the P/Q division 
has had a great influence on archaeological speculation, and “P” and “Q” remain useful 
labels for the important and valid distinction between Brythonic (or possibly “Gallo- 
Brythonic”) and Gaelic. 

Some linguists use the term “Insular Celtic” (see Introduction, p. 352) in a more than 
geographical sense, emphasizing a special relationship between Gaelic and Brythonic 
(Greene 1966, McCone 1986: 262; cf. Evans 1988: 219). It is true that these groups share 
certain important developments - such as the development of morphophonemic mutations 
of initial consonants (Consonantism, p. 364), the system of absolute and conjunct verbal 
endings (Personal Endings, p. 372) and periphrastic constructions with preposition + 
verbal noun such as OIr. oc precept, W yn pregethu ‘preaching’ (Sims-Williams 2015) - 
but it could be argued that these developments might also have occurred in Gaulish if this 
were attested after ad 500 (Sims-Williams 1984: 147-148). Logically, it is impossible to 
establish a genetic relationship between two dialects on the basis of a shared innovation 
occurring after all other dialects have died out. An opposing approach, which still leads to 
the same negative conclusion, is to argue that a feature like the absolute/conjunct system 
occurs in Insular Celtic but not (apparently) in Continental Celtic because the Continental 
dialects were less archaic, having drifted towards Greek and Latin, or even Late IE inno- 
vations (Meid 1986: 120-121). The fundamental problem, however, is the chronological 
disjuncture between our evidence of Insular and Continental Celtic. A single example 
illustrates this. We noted above (Introduction, p. 353) that *wer (< *uper) became *wor 
in Brythonic and Gaelic, whereas Ver- is attested in Gaulish. Is this innovation an Insular 
Celtic isogloss (Schrijver 1995: 129, 464)? The change *wer > *wor (due to influence 
of *wo < *upo ‘under’ or simply to rounding after /w/, cf. Evans 1967: 279) must in 
fact be late in Brythonic, since in ad 725 Bede, Chronica niaiora 434, records the name 
of a fifth-century Briton as Uertigernus (> later Uortigern, Gwrtheym). As the change 
*wer > *wor did not occur before the fifth century ad in Brythonic, it is possible that it 
would also have occurred in fifth-century Gaulish as well, if this was attested - for there 
is evidence of shared phonetic developments between British and Late Gaulish (Fleuriot 
1978). Moreover, /wer/ > /wor/ might occur quite independently; for example, there is 
evidence for it in Spain (Tovar 1986: 89). 

The term “Gallo-Brythonic” emphasizes the similarities in language (e.g. name forma- 
tions) between Gaulish and British, which were already noted by ancient writers (Intro- 
duction, p. 353). It is debatable whether these similarities are due to the many ethnic 
movements across the English Channel (already mentioned by Julius Caesar) or go back 
to a genuine “Gallo-Brythonic” dialect of Celtic, genetically distinct from Gaelic, Celti- 
berian, Lepontic and so on (cf. Evans 1988: 220, Fleuriot 1988, Lambert 1994: 17-19, 
Schrijver 1995: 463^-65). The evidence for both the “Gallo-Brythonic” and the “Insu- 
lar Celtic” hypotheses can be undermined by postulating “areal” developments (Sims- 
Williams 2007b: 1-42, Stifter 2008, Eska 2010a: 24—25, Lambert 2010). 

An ambitious attempt at a Celtic family tree by Schmidt (1988: 235), combining the 
ideas of “P Celtic” and “Gallo-Brythonic” with a distinction between an “ em/en - lan- 
guage” (i.e. Gaelic), which diverged very early, and the “am/an- languages” (i.e. Celti- 
berian and most or all the P Celtic languages), failed to convince (cf. Tovar 1986: 84 
n. 3, Evans 1983: 29-31, McCone 1991:22,48-50, 161). It was based on the treatment of 
IE /m/ and /n/ in initial position and before plosives, e.g. IE *mb h i ‘about’ > CIb. amPi-, 
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Gaul., ambi-, W am versus OIr. imb (see further de Bernardo Stempel 1987: 38, 51, 
121). A secondary Gaelic development of /-an/ > /-en/ was suggested by Cowgill (1975: 
49), and some scholars postulated a Proto-Celtic allophone [re] before nasals (e.g. Hamp 
1965: 225, Joseph 1990: 126 n. 10). McCone (1996: 67-79) argued convincingly that /m/ 
and /n/ gave CC /am, an/ and that raising of the /a/ to /e/ or /i / occurred in Primitive Irish 
(cf. Schrijver 1993). 

The earliest connected written material in Celtic languages comes, as would be 
expected, from the Mediterranean world - Italy, France and Spain - in the second half of 
the first millennium bc (slightly earlier in the case of some northern Italian inscriptions: 
Uhlich 2007: 379). For other areas at this period there are only place-, personal and ethnic 
names (e.g. Sims-Williams 2007a, 2012b), but even these are of some value. For exam- 
ple, judging by the onomastic evidence, the language of the Galatians in Asia Minor was 
similar to Gaulish (Eska 2013a). Again, there is obvious Gaulish material in the “Thra- 
cian” and other eastern onomastic corpora (Falileyev 2009, 2013, 2014). Here, however, 
I shall concentrate on the Celtic languages from which more than names and odd words 
are known (see also Eska & Evans 2010). 

Lepontic is the name given to the language of inscriptions first found within a 50 km 
radius of Lugano, in north-western Italy and Switzerland, and arguably distinct from 
Cisalpine Gaulish (Lejeune 1970, Karl & Stifter 2007: 45-73, Uhlich 2007, Stifter 2008: 
285-286, Eska 2010a: 24, Eska & Evans 2010: 35). They date from the sixth to the first 
century bc and are written in the Lugano alphabet. This alphabet does not distinguish 
between /p t k/ and /b d g/ and avoids double letters; for example, the Lepontic personal 
name ANOKOPOKIOS corresponds to Gaul. Andocombogios. Here /nd/ > /nn/ is a Lep- 
ontic peculiarity, but the suppression of the nasal in KO(m)P may be purely graphic. Like 
Gaulish (Consonantism, p. 364), Lepontic distinguishes between two sibilants, both seen 
in ISOS [?i‘sos] < *istos ‘that (man)’ in an inscription from Vergiate, now in the archae- 
ological museum in Milan (Lejeune 1970: 444-452, Eska & Mercado 2005): PELKUI 
PRUIAM TEU KARITE ISOS KALITE PALAM = ? Belgiii bruwyam Dewu garite, i'sos 
kalite palam = ?‘Dewu enclosed the construction for Belgos; he erected the stone.’ The 
word for ‘construction’, the accusative of *bruwya, recalls Gaul, briva ‘bridge’ < *b h rewa, 
but the closest cognates are in Germanic, for example, OS bruggia ‘bridge’ < *b h ruw-yo-. 
The word possibly meaning ‘stone’, pala, which is common in Lepontic inscriptions, is 
of unknown etymology. The verbal stem *gar- may derive < *gr-: IE *g h er- ‘to enclose’, 
as in OIr. gort ‘field’, W garth ‘enclosure’ (but cf. Hamp 1991); and *kal- may derive < 
*kl-: IE *kelH- ‘to raise, rise’, cf. Gaul, celicnon ‘building’, Lat. collis, OEng. hyll ‘hill’ 
and possibly Ce/ft~‘?ex alted ones’. The dental preterites are obscure, but seem to be con- 
firmed by the Gaulish verbs KarniTu (3 sg.), KarniTus (3 pi.) found farther south in Gaul- 
ish inscriptions in Lugano script at Todi and at San Bemadino di Briona (Eska 2007/8). In 
the Todi inscription, which is bilingual, KarniTu corresponds to LOCAVIT ET STATVIT. 

Celtiberian is the only Hispano-Celtic language known from inscriptions as well as 
proper names. (The Celticity of the Lusitanian and “Tartessian” inscriptions is gener- 
ally rejected.) The “Celtiberian” inscriptions come from north-eastern Spain, in exactly 
the same region that the Celtiberi are located by ancient writers (de Hoz 2007, 2010-). 
The earlier inscriptions, such as the most famous one from Botorrita 20 km south of 
Zaragoza (c. 100 bc), are written in the native Iberian script (Eska 1989, Eichner 1989: 
23-55 (= Karl & Stifter 2007: 9^14), Meid 1994a, Prosper 2008, de Bernardo Stempel 
2010). This did not distinguish voiced and voiceless stops, like the Lugano alphabet, 
and it had the added ambiguity of using syllabic characters for these sounds (Pa, Ca, 
Ta, Pe, Ce, Te, etc.); e.g. the Ti symbol could denote /ti/, /di/, /t/ or /d/. Its s = /s/, but 



360 PATRICK SIMS-WILLIAMS 


5 = /z/ or /8/ by lenition of *s and *d according to Villar (1995); for further theories see 
Stifter 2008: 271-273. Inevitably, then, interpretations of Celtiberian writings are dis- 
puted. Later inscriptions are in the Roman alphabet, which should be less ambiguous; 
however, opinions still diverge. For example, Kodderitzsch (1985) offered the following 
reading and interpretation of the first- or second-century ad inscription from Penalba 
de Villastar (cf. Villar 1991): ENIOROSEI VTA TIGINO TIATVNEI [-ME11] ERECAIAS 
TO LVGVEI ARAIANOM COMEIMV ENIOROSEI EQVEISVIQVE OGRIS OLOGAS 
TOGIAS SISTAT LVGVEI TIASO TOGIAS ‘To Enior(o)sis and to Tiatu of Tiginos we 
bestow furrows, and to Lugus a field; to Enior(o)sis and to Equaesos, Ogris submits 
the protections of the fertile-land, to Lugus the protections of the scorched-land.’ Meid 
(1994a and 1994b) and others, however, offer completely different interpretations (see 
Jordan Colera 2004: 375-390; de Bernardo Stempel 2008). Thus, Meid (1994a: 36-37) 
translates: ‘To the mountain-dweller as well as to . . ., to Lugus of the Araians, we have 
gone on a field-procession (reading TRECAIAS ); for the mountain-dweller and horse- 
god, for Lugus, the head of the community has put up a covering (i.e., a house or a hall), 
(also) for the thiasus a covering’. And Prosper (2002) has ‘In Orosis and as far as the 
Tigino reaches, to Lugu we consecrate the fields. In Orosis and in Equeiso, the hills as 
well as the ploughed fields and the houses are consecrated to Lugu, that is, the houses of 
the bounded area’. Notable phonological points here include: 

1. the loss of /p/ (if Kodderitzsch ’s etymologies are correct) in (a) er{e)caia- ‘furrow’ 

< IE *perk- (cf. W rhych, Lat. porca, Eng .furrow); (b) ol(o)ga- ‘fertile-land’ < IE 
*polg( h )a, *polka (cf. Gallo-Latin olca > Fr. ouche, Eng. fallow < Gmc *falga); and 
(c) tiaso ‘burnt land’ < *teposo- (cf. OIr. tee ‘hot’ < *tepe-, Lat. teped ); 

2. the retention of IE /k w / in -que ‘and’ (Lep. -pe, arch. OIr. -ch) - this is used here 
alongside the connective uta, cognate with Skr. uta and OBret. ut (Eska 1990, 
Sims-Williams 2007b: 234-235); 

3. the apparent (orthographical?) retention of IE ley/ in com-{m)ei-mu, ‘we bestow’ 

< IE *mey- according to Kodderitzsch (cf. Skr. mavate ‘exchanges’, Lat. munus < 
*moy-nes-, OIr. moln ‘treasure’ < *moy-ni-, MW mwyn ‘value’ < *mey-no-) or < IE 
*h,ey- ‘go’ according to Meid, Prosper and de Bernardo Stempel; 

4. the retention of final Iml (not > Ini) in the acc. sg. ar(a)ianom (‘field’, according 
to Kodderitzsch, cf. OIr. airim, MW ardaf ‘I plough’ < *ary-, or, according to de 
Bernardo Stempel, a neuter adjective related to W lawn ‘just’) - Meid’s gen. pi. 
ethnonym in -om (not -Mm!) is unlikely; 

5. in sistat ‘puts’ < *sistati (or 3 pi. *sistanti according to Prosper) the apocope of pri- 
mary *-i, as in Lat. sistit (Personal Endings, p. 372-373) - although some analyse 
sistat as if = *sistaz < old secondary *sistat; 

6. epenthesis in er(e)caias, ol(o)gas, ar(a)ianom, and Enior(o)sei (Eska 1996). 

In morphology note the o-stem genitives in -o ( Tigin-o , tias-o) rather than the -T found 
elsewhere in Celtic and Italic (Position, p. 357; cf. McCone 1994: 96 = 2006: 83, Eska 
1995, Schmidt 1995: 252-253). 

Gaulish inscriptions begin in about the third century bc, but the bulk of material comes 
from the first century bc to the second century ad (Stifter 2012). The earlier material 
is mostly written in Greek script, with a few texts in the Lugano script from Cisalpine 
Gaul. The later material is mostly in Roman letters. To the inscriptions on stone which 
have long been known can now be added important long texts on metal plates, nota- 
bly from Chamalieres, Larzac and Chartres (Lambert 2002, Viret et al. 2013); these are 
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transforming, and confusing, our understanding of the Gaulish language(s). The fol- 
lowing example of Gaulish is a stone inscription from Alise-Sainte-Reine (first century 
ad): MARTIALIS DANNOTALI IEVRV VCVETE SOSIN CELICNON ETIC GOBEDBI 
DVGIIONTIIO VCVETIN IN ALISHA (Lambert 1994: 98-101, Eska 2003) ‘Martialis 
[son] of Dannotalos offered to [the god] Ucuetis this edifice, and to the smiths who hon- 
our (?) Ucuetis in Alisia’. Note here -e < *-ey in the dat. Ucuete and -n < *-m in the 
acc. sg. celicnon (the source of Goth, kelikn ‘tower’). Morphological points of interest 
include the genitive singular in -i in Dannotali, the dative (or instrumental?) plural in -bi 
in gobedbi, the third-person singular preterite in -u (cf. Lepontic) and the indeclinable 
relative particle io in dugiionti-io (Pronouns, p. 370). 

Apart from two curse -tablets from Roman Bath (Mullen 2007), British and its suc- 
cessors, Primitive Welsh, Primitive Cornish and Primitive Breton, are known only from 
proper names in inscriptions and Latin texts, which provide, nevertheless, detailed infor- 
mation about phonological developments (Jackson 1953, Sims-Williams 1990, 1991, 
2003). From c. ad 800 onwards we have manuscript glosses, memoranda and so on in Old 
Welsh, Old Cornish and Old Breton (Temes 2011). (In addition, some Welsh poetry may 
be as early as the sixth century, although transmitted in late manuscripts.) The orthog- 
raphy of Old Welsh, Old Cornish and Old Breton (OWCB) is based on a pronunciation 
of Latin found in Britain, the main feature of which was that medially Latin consonants 
underwent “lenition” (Consonantism, p. 364), so that Latin words like medicus, dec- 
imals were pronounced [meSigsh], [degipa:dsh]. Consequently, similar values were 
assigned to letters in writing OWCB; for example, [degped] (‘tenth’ < *dekametos (see 
Table 7.1 on p. 353)) would be written decmet. 

Primitive Irish (McCone 1994, 1996, 2005) is known from proper names in inscrip- 
tions and Latin texts, and its phonological development is clear in outline (McCone 1996, 
Sims-Williams 2003: 296-301). Most Irish inscriptions of the fifth, sixth and seventh 
centuries ad are in the Ogham alphabet (McManus 1991, Ziegler 1994, Sims-Williams 
2007b). This is well suited to carving on wood and stone, since only straight strokes 
are used, e.g. ///// = /k w /, ////= /k/. The phonology of the earliest Ogham inscriptions is 
archaic, for example, distinguishing between /k / and /k w / and (probably) between /g/ and 
/g w /, and showing the old case endings. The earliest extant manuscripts containing Old 
Irish (glosses and short texts) are eighth century, but it is probable that some texts extant 
in late manuscripts were written down towards the end of the sixth century. Most Old 
Irish material is written in an orthography based on the British pronunciation of Latin 
described above (for exceptions see Harvey 1989, Russell 1995: 224); hence, medial and 
final [b d g] are written p t c, and medial and final [fl 5 y p] are written b d g m. Medially 
and finally double consonants may be used to avoid ambiguity; e.g. [b] may be written bb, 
and [k] may be written cc. Palatalized consonants are indicated by flanking vowels, e.g. 
macc [mak], maicc [mak J ], beirid [b J er j a5 ] ],feraib [TerafP], In modern edited texts, diph- 
thongs are generally distinguished by placing the length mark on the i; e.g. ai and ui are 
diphthongs (the former also written aeloiloe), but ai and ui denote [a:] and [u:] followed 
by palatalized consonants (in the phonetic transcription j marks a palatalized consonant, 
as in OIr. nemed [n'cp'od], Introduction, p. 354). 

THE PHONOLOGY OF COMMON CELTIC 

The following features would generally be accepted as defining “Celtic”. For ease in 
consulting handbooks (e.g. Lewis & Pedersen 1961), I give IE sounds in their traditional 
reconstructions without prejudice to the actual phonetic reality of /b h /, etc. Also, not much 
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attention is paid to laryngeals, for which Celtic provides little independent evidence (cf. 
Hamp 1965, Joseph 1982, Ringe 1988, Lindeman 1988, 1989: 291-293, McCone 1994: 
71-73 = 2005: 35-38, Zair 2012). Almost nothing certain is known about the fate of the 
IE free accent in Celtic nor about the development of the very different initial and pen- 
ultimate stress accents of Gaelic and Brythonic respectively (cf. Koch 1987, Lindeman 
1992, Schrijver 1995: 16-22). There is already evidence for penultimate accentuation in 
Continental Celtic (de Bernardo Stempel 1995). 


Vocalism 

The IE short vowels /i e a o u/ remained in Common Celtic. A laryngeal regularly fell 
together with /a/, e.g. Gaul, atir (Larzac inscription), OIr. athair ‘father’ [ajjafl]. The IE 
long vowels /i: a: u:/ remained in Common Celtic, but IE /e:/ > /i:/, e.g. *regs (: Lat. rex ) > 
Gaul, -rix, OIr. ri, W rhi, and IE /o:/ > /a:/, e.g. *donom (: Lat. donum ) > OIr. dan ‘gift’, W 
dawn (with /au/ < h:l < /a:/) ‘gift’, except when IE /o:/ occurred in final syllables, where it 
became CC /u:/, e.g. *b h ero > *biru > OIr. -biur ‘I carry’ and Gaul, delgu ‘I hold’ (cf. Eska 
1995: 34). One gap in the resulting Common Celtic system of vowels (shown in Table 7.2). 

TABLE 7.2 THE COMMON CELTIC 
VOWEL SYSTEM 


i: i uu: 

e o 

a 
a: 


was filled by the development /ei / > /e: 2 / (cf. Gr. steikhein ‘to walk’: Early OIr. -tegot 
[t j e:yod] ‘they go’ = Later OIr. -tiagat [t j iaysd]), but /ei/ > /e: 2 / may not have been completed 
in Common Celtic. (See p. 362 for CIb. ei, and note dat. -ei in Lepontic as well.) The other 
gap was filled in Gaelic by /eu au ou/ > /o: 2 / (later alternating with /uo/) and in Brythonic 
by /eu ou/ > /o: 2 / (later >/u :/), but in Common Celtic /eu ou au/ remained diphthongs, as did 
/ai/ and /oi/. There was, however, an early tendency, seen already in Lepontic and Gaulish, 
for /eu/ > /ou/, e.g. *teuta ‘people’ > touta (> OIr. tuath [tua]?], MW tut [tu:d]). 

The semi-vowels /w/ and /y/ remained in Common Celtic, and indeed survive to this day 
in Welsh. Vocalic /m, n, r, 1/ developed as vowel + consonant (/am, em, an, en, ar, al/) or 
consonant + vowel (/ri, li/), depending on context (see de Bernardo Stempel 1 987, McCone 
1996: 67-79); there is often apparent divergence between /em en/ in Gaelic and /am, an J 
elsewhere (Subdivision, p. 358). The so-called long resonants, that is, vocalic /m, n, r, 1/ 
+ laryngeal /H/, developed mostly as /ma:, na:, ra:, la:/, e.g. IE *grH-no- > OIr. gran, W 
grawn ‘grain’ (: Lat. granum). The derivation of reflexes with short /a/ is debated: for exam- 
ple, does OIr. tarathar, W taradr ‘auger’ < *tara-tro-n come from *trH-, or is tara- due 
to vowel harmony in *tera-tro-n coming from *terH-? The latter is probable. (See Joseph 
1982, de Bernardo Stempel 1987: 43^15, Lindeman 1988, Schrijver 1995: 87.) 

Consonantism 

/m, n, r, 1, s/ remained basically unchanged (on 5 see Stifter 2012). Final -m became 
-n in most Celtic languages, but not in Lepontic, Celtiberian (p. 359-360) and some 
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Gaulish. As in other IE languages, /s/ had an allophone [z]; this is not differentiated 
from S in Continental Celtic writings, but in Insular Celtic [z] > [8], e.g. Gaul. TASGO-: 
OIr. Tadg (personal name). As in Armenian, IE /p/ was completely lost, apparently via 
/f/ (Eska 2013b); it had already been reduced to /h/ or /zero/ when classical writers 
borrowed the name of the Hercvnia silva in central Germany (: IE *perk w us ‘oak’, cf. 
Evans 1979: 531-532). Another commonly cited example is the classical name for Ire- 
land/the Irish, Ierne/Hierni, but it is doubtful whether this name is Celtic and cognate 
with Skr. pfvarf, Gr. nieipa ‘fat, rich’ (Schrijver 2014: 77-78). Among the plosives 
there is no trace of an original palatal series (Gaelic palatalization arose much later, 
p. 367), and the only trace of the aspirate/unaspirate distinction is between the labiovelars 
/g w / > CC Ibl and /g wh / > CC /g w / (Cowgill 1980, Sims-Williams 1981, 1995). For 
IE /g w / note OIr. beo, W byw ‘living’: Lat. uTuus\ OIr. imb ‘butter’: Lat. unguen; the 
only certain exception is before /y/, which delabialized /g w /, e.g. OIr. nigid [n'iy j 38 j ] 
‘washes’ < *nig w -ye-ti and W g'iau ‘sinews’: Ved. j(i)va- ‘bowstring’, A v.jiia- ‘bow- 
string, sinew’, Gr. |3i6<; ‘bow’. For IE /g wh / note OIr. gonaid ‘wounds’, W gwanaf" I 
wound’: Gr. (povoq ‘murder’, IMvco ‘I strike’, Hitt, kuenzi, Skr. hand ‘kills’. By the 
time of Old Irish, /g w / was delabialized, but there is probably an old symbol for /g w / in 
the Ogham alphabet (p. 361). In Brythonic, initial /g w -/ > /gw-/, probably directly and 
not via /w-/ in view of the early Welsh inscription GVANI, which predates the general 
change of IE /w-/ > W /gw-/ (Sims-Williams 1995). In Gaulish /g w -/ > /w-/ is possible, 
if the first-person singular verb uediiumi (? uediiu mi) in the Chamalieres inscription is 
cognate with W gweddi ‘prayer’, OIr. guide ‘prayer’ < ?*g w edya, Gr. noMco ‘I wish’, 
Ptv.jaidiia- etc. (Cowgill 1980). The fate of non-initial /g w / < /g wh / is debated. In Welsh 
it appears to have had the same result intervocalically as /g/, i.e. loss (e.g. MW de 
‘bums’ < IE *d h eg wh -). The words nv/‘snow’ and deifio ‘to burn’ (/= [v]) are probably 
wrongly cited in the handbooks as reflexes of IE intervocalic / g wh / ; nyf probably shows 
a special treatment of g"' h after Ini (cf. Lat. ninguis), while deifio is not from IE *d h eg wh - 
at all (Sims-Williams 1995). 

The above data are best explained by the following chronology (cf. Sims-Williams 
1981: 227), starting from the system 

TABLE 7.3 PRE-CELTIC STOPS 

(p) t k k» 

b d g g w 

b h d h g h g wh 


Common Celtic merged /g w / and Ibl as Ibl. Then de-aspiration, occurring throughout the 
system (/b h / > Ibl, /d h / > /d/, etc.), created a new /g w 2 / < /g wh /. The relative chronology 
of /p / > /zero/ is uncertain, but it was the gap in the system resulting from the loss of 
/p/, which made possible the allophonic alternation between “Q Celtic” with [k w ] and “P 
Celtic” with [p,] < [k w ] (Subdivision, p. 357): 

TABLE 7.4 COMMON CELTIC STOPS 

[p 2 ] t k [k»] 

b d g g w 2 
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The most interesting combinatory change among the consonants was the development 
of a new dental phoneme, written S3, ss and so on, in Gaulish (and apparently with special 
symbols in Lepontic, p. 359), from /d/ + l\l , /t/ + /t/, lxl+ Is/, etc. (Evans 1967: 410-420). 
Note also a general development of /xt/ < /pt, kt /. Structurally, a more important devel- 
opment was the widespread rise - as in Germanic - of intervocalic geminate consonants 
contrasting with single consonants in the same position (Kurylowicz 1960: 259-273, de 
Bernardo Stempel 1989). Such geminates are often shown in Gaulish, but are excluded 
by the scripts used for Lepontic and Celtiberian. In Insular Celtic geminates developed 
differently from single consonants, for example, OIr. maicc [mak j ] ‘of a son’ < Ogham 
MAQQI = *mak w k"T, but MW meib ‘sons’ < *mapl < *mak"7; the single /p/ is “lenited” 
(i.e. voiced), but the geminate /k w k w / is simplified. In addition to the phonemic distinction 
between /VCV/ and /VCCV/, there arose, probably already in Common Celtic, an allo- 
phonic distribution of [C] and [CC] in other environments (Harvey 1984). The evidence 
for this comes partly from the phonemicization of reflexes of these allophones in Insular 
Celtic (Consonantism, p. 365, 367), partly from the analogy of Romance, which may 
reflect Celtic substratum influence here (Martinet 1952; cf. Villar 1995). Ultimately, this 
was of great importance in external sandhi, giving rise to the Insular Celtic system of 
initial mutations (Russell 1995: 231-257, Sims-Williams 2007b: 43-58). For example, 
*esyo kattos ‘his cat’ > OIr. a chatt [a xat] (lenition), MW y gath [i ga:]a] (lenition), but 
*esyas kkatus ‘her battle’ > OIr. a cath [a kaja] (no mutation), MW y chat [i xa:d] (spi- 
rant mutation). A further set of mutations was produced after old nasals (Consonantism, 
p. 366), e.g. OIr. a catt [a gat] ‘their cat’, MW vyg cath [va pha:]a] ‘my cat’. These initial 
mutations, which began as sandhi phenomena, were grammaticalized in Insular Celtic, 
for example, as markers of relative clauses (O hUiginn 1986). 

THE PHONOLOGY OF EARLY BRYTHONIC 

The accent in later British fell on the penultimate syllable, which became the ultimate 
syllable after the loss of final syllables c. 500. A full range of vowels was therefore pre- 
served in the final syllables of OWCB words, whereas pre -tonic vowels tended to have 
been shortened, reduced or syncopated. (Much later, by c. 900, at least in Welsh (Sims- 
Williams 2003: 289), the accent shifted from the ultimate to its present position on the 
penult, except in the Vannetais dialect of Breton.) Schrijver 1995 and Temes 2011 are the 
most up-to-date handbooks. 

Vocalism 

IE vowel quantity was at first retained, but by the sixth century ad a new quantity sys- 
tem applied automatically even in stressed syllables; according to this, Primitive Welsh, 
Cornish and Breton vowels were short in [VCC] syllables and long in [V(C)] syllables 
(Sims-Williams 1990: 250-260). In stressed syllables, the Common Celtic vowels and 
diphthongs developed mainly as follows: 

/i/ > Pr. W /i(:)/ (written <y> in later W), Pr. Corn., Pr. Bret. /i(:)/ 

/e/ > Pr. W/Corn./Bret. /e(:)/ 

/a/ > Pr. W/Corn./Bret. /a(:)/ 

/o/ > Pr. W/Corn./Bret. /o(:)/ 

/u/ > Pr. W/Corn./Bret. /u(:)/, also Pr. Com./Bret. /o(:)/ (note: /u(:)/ is written <w> in 
Welsh, <ou> in Breton) 
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/i:/ (< IE /e:/ and /i :/) > Pr. W/Corn./Bret. /i(:)/ 

/e:/ (< CC /ei/) > Pr. W/Corn./Bret. /ui/ 

/a:/ (< IE /a:/ and /o:/) > British /o:/ > Pr. W /au/, Pr. Com./Bret. /©( : )/ 

/u:/ (< IE /u:/ and /-o:/) > British /u:/ > Pr. W/Corn./Bret. /i( : )/ 

/au/ > British /o:/> Pr. W /au/, Pr. Corn./Bret. /e(:)/ (see Schrijver 1995: 195) 

/ou/ (< CC /ou/ and /eu/) > British /o:/ > /u:/ > Pr. W/Corn./Bret. /»(:)/ 

/ai/ > British /e:/ > Pr. W/Corn./Bret. /oi/ 

/oi/ > British /u:/ > Pr. W/Corn./Bret. /u(:)/ 

The semi-vowel /w/ had become /gw/ in absolute anlaut by the time of the earliest OWCB 

(c. 800). Medial /y / sometimes developed to /8/. 


Consonantism 

/s/ tended to become /h/ or lyl or to disappear, [z] became /8/, and [x] was vocalized as /y/. 
(On /g w / see p. 363 above.) Most other consonants suffered the change known as “lenition” 
in positions where their weaker allophones occurred (cf. p. 364), and there was later a 
tendency for unlenited voiceless consonants to be spirantized (except in absolute anlaut): 


TABLE 7.5 BRYTHONIC CONSONANTAL DEVELOPMENTS 


[pp] 

> 

[Pi 

>m 

(Spirantization) 

[p] 

> 

[b] 

(Lenition) 


[tt] 

> 

[t] 

>1M 

(Spirantization) 

[t] 

> 

[d] 

(Lenition) 


[kk] 

> 

M 

> [x] 

(Spirantization) 

[k] 

> 

[g] 

(Lenition) 


[bb] 

> 

[b] 



[b] 

> 

IP] 

(Lenition) 

([PI later > [v]) 

[dd] 

> 

[d] 



[d] 

> 

[5] 

(Lenition) 


[gg] 

> 

[g] 



[g] 

> 

[y] 

(Lenition) 

([y] later lost, or > [y] or [w]) 

[mm] 

> 

[m] 



[m] 

> 

M 

(Lenition) 

(M later > [v]) 


These changes probably took place in three stages: (a) spirantization of /b, d, g, g w , m / 
(before c. ad 400?); (b) voicing of /p, t, k/ (fifth century?); and (c) spirantization of /p 2 , t,, 
k,/ (sixth century?) (see Sims-Williams 2007b: 14-16, 26, 43-58). Since they occurred in 
external sandhi, they led to “lenition” (e.g. /k ~ g/) and “spirantization” (e.g. /k ~ x/) as 
initial mutations (p. 364). The above simplifications of double consonants (except [mm], 
which followed the pattern of [nn]) were completed before the advent of the new quan- 
tity system (Vocalism, p. 364), hence, for example, W cred ‘belief’ < *kredd- (versus 
mdm(m ) ‘mother’ < *mamm-). 


THE PHONOLOGY OF GAELIC 

The accent in Gaelic fell on the initial syllable of stressed words. A full range of vowels 
was therefore preserved in this syllable, whereas post-tonic vowels tended to be short- 
ened, reduced or syncopated. For example, in Old Irish there were only two short vowels 
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in closed unstressed syllables, [a] and [u] (although the spelling system seems to disguise 
this). Because Brythonic stress developed very differently (p. 364), the evidence of the 
two branches is of complementary importance in reconstruction. The best handbook is 
McCone 1996; cf. Sims-Williams 2003: 296-301. 


Vocalism 

IE vowel quantity was retained in stressed syllables; in unstressed syllables long vowels 
were shortened in Primitive Irish, except in final syllables before final /h/ < Pp 8 x s/, and 
even in these syllables the vowel was eventually shortened, e.g. Celt. *teutas ‘tribes’ > 
Pr. Ir. *to]3ah > OIr. tuatha [tuajaa]. In stressed syllables, the Common Celtic vowels and 
diphthongs developed mainly as follows: 

/i / > OIr. /i/ (if not lowered to /e/ by following low vowel) 

/e/ > OIr. /e/ (if not raised to /i / by following high vowel) 

/a/ > OIr. /a/ 

/o/ > OIr. /o/ (if not raised to /u/ by following high vowel) 

/u/ > OIr. /u/ (if not lowered to /o/ by following low vowel) 

/i:/ (< IE /e:/ and/i:/) > OIr. /i:/ 

/e:/ (< CC /ei/) > OIr. /e:/ alternating with /ia/ 

/a:/ (< IE /a:/ and /o:/) > OIr. /a:/ 

/u:/ (< IE /u:/ and/-o:/) > OIr. /u:/ 

/au/ > OIr. /o:/ alternating with /ua/ 

/ou/ (< CC /ou/ and /eu/) > OIr. /o :/ alternating with /ua/ 

/ai/ and /oi/, though still distinct in most Ogham inscriptions, merged in Old Irish as a 
diphthong of uncertain value (/oi/?), written <ae>, <ai>, <oe>, <oi>. 

The semi-vowel /w/ became /f/ in absolute anlaut and /v/ after nasals, for example, 
*wiros > fer ‘man’, *banwos > banb [banv] ‘pig’ (: Gaul. Banuus, W banw ); in other 
positions /w/ was lost, as was /y/, e.g. *yowankos (: W ieuanc) > *(y)o(w)£gah > OIr. 
oac [osg] ‘young’. 


Consonantism 

/s/ tended to become /h/ or to disappear, [z] became /8/, but [x], written ch, remained 
in Old Irish. In the Ogham inscriptions /k w / and (probably) /g w / were still distinct 
from /k / and /g/, but they had been delabialized by the Old Irish period. Even before 
the Ogham inscriptions the combinations /nt, nk, nk w , ns/ had become / dd, gg, g w g w , 
ss/, with compensatory lengthening of a preceding /a/ or /e/ to /e:/, e.g. *sentus (: 
Bret, hent, OHG sind ‘road’) > */s£:ddus/ > OIr. set [s j £:d] ‘road’ (cf. Ogham per- 
sonal name SEDANI > OIr. Setn(a)i); *k w enk w e (: Lat. quinque) > /k w £:g w g w e/ > 
/ko:g j e/ > coic [ko:g j ] ‘five’ (the rounding was due to the labiovelars); *BrigantT (: Skr. 
brhatT f. ‘exalted one’) > /brig£:ddi:/ > /briycdi/ > Brigit [b J r J iy J ad j ] (personal name). 
These changes after nasals also took place in external sandhi, giving rise to the initial 
nasal mutation, e.g. gen. pi. *wiran trumman > *wira ddrumman > OIr. fer tromm 
[f J er drom] ‘of heavy men’. (This mutation was not usually shown in writing.) As in 
Brythonic, most consonants underwent “lenition” in positions where their weaker 
allophones occurred (cf. p. 364), but lenition of /t/ and IkJ took a different form in 
Gaelic: 
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TABLE 7.6 GAELIC CONSONANTAL DEVELOPMENTS 


[tt] 

> 

[t] 


[t] 

> 

M 

(Lenition) 

[kk] 

> 

[k] 


M 

> 

[x] 

(Lenition) 

[bb] 

> 

[b] 


[b] 

> 

m 

(Lenition) 

[dd] 

> 

[d] 


[d] 

> 

[5] 

(Lenition) 

[gg] 

> 

[g] 


[g] 

> 

[y] 

(Lenition) 

[mm] 

> 

[m] 


[m] 

> 

M 

(Lenition) 


Gaelic lenition probably took place in two stages: (a) spirantization of /b d g( w ) m/; (b) 
spirantization of /t k( w )/; they cannot be dated precisely, but (b) occurred later than the 
(fifth-century?) voicing of /p t k/ in British (see Sims-Williams 1990: 233, McCone 1994: 
74 = 2005: 42). In external sandhi, lenition resulted in an initial mutation (Consonan- 
tism, p. 364). Internally, lenited consonants were often lost with compensatory lengthen- 
ing, for example, Ogham SAGRAGNI (gen. sg.) > OIr. Sarain [sa:ra:n J ]; Pr. Ir. *e[m- (: W 
edn < *petnos) > OIr. en ‘bird’. Note that this gave rise to new long vowels in unstressed 
syllables. Phonemically, the most important other change was the growth of palatalized 
consonants before front vowels in certain environments (Greene 1973), e.g. *alyos (: Lat. 
alius, MW eil) > Pr. Ir. *al j iyah > *al j eyah > OIr. aile [al j e] ‘other’. Contrast *kaletos (: 
W caled ‘hard’) > Pr. Ir. *kale[)ah > OIr. calad [katad] ‘hard’, and note that palatalization 
had become phonemic at the point when [afeyah] was opposed to [kalejjah]. All conso- 
nants could be palatalized. On the spelling of palatalized consonants see Old Irish, p. 361 . 

The typical development of consonants in Insular Celtic can be exemplified by IE *t 
(and the older handbooks’ *th) as follows: IE *t (and *t h ) > Celtic /t/; Celtic /t/ allophoni- 
cally = [tt] and [t]; inBrythonic [tt] >/t/ (absolute anlaut) and/]}/ (elsewhere), but [t] >/d/; 
in Gaelic [tt] > /t/ and /t j /, but [t] > /[>/ and /JP/. Thumeysen ( 1 946: 97) maintained that Old 
Irish also had u-quality consonants (/t u / etc.), and some favour re-instating his three-way 
system (Anderson 2014, McCone 2015, versus Jaskula 2014). 

THE MORPHOLOGY OF COMMON CELTIC 

In nominal morphology the threefold distinctions in gender (masculine, feminine, neuter) 
and number (singular, plural, dual) survived from late Indo-European. The neuter was 
lost in Middle Irish, and only traces remain in Brythonic. The dual is always reinforced by 
the numeral da ‘two’ in Old Irish and is merely residual in Brythonic, where it is formally 
identical with the singular or plural, for example, W y gafl ‘the fork’ (< *sindos gablos), 
V geifl ‘the forks’, Yr Eifl (mountain- name) ‘the (two) forks’: the last two forms imply 
*sindi gabli (not du. **sindo gablo), but whereas the masculine plural has dropped leni- 
tion by analogy with non-lenition after the feminine plural article v < *sindas, the lenition 
remains in the dual. This is an illustration of “the eviction or replacement of a morph by a 
new morph only in the former’s primary or secondary function” (Kurylowicz 1964: 14). 

The number of cases is reduced to five in Old Irish, with a “dative” case subsuming the 
functions of the dative, ablative, locative and instrumental. (Some of these distinctions 
remain in Celtiberian and Gaulish.) Owing to syncretism, the endings of the “dative” may 
derive not only from the IE dative, but also from the IE ablative, locative or instrumental. 
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In Insular Celtic, inflected forms began to be “hypercharacterized” (Schmidt 1974), owing 
to the functions of the case endings being increasingly subsumed by fixed syntactic struc- 
tures such as preposition + noun, VSO word order (Syntax, p. 377), noun + dependent 
genitive (in prose texts) or noun + qualifying adjective. It is not surprising, then, that case 
endings were often allowed to remain ambiguous in Old Irish and disappeared altogether 
in Brythonic, apart from a few fossils, for example, the dative of perm ‘head’ (o-stem) in 
MW erbyn ‘against, to meet’ < *are pennu = OIr. ar chiunn (+ gen.) ‘id.’ < *are k w ennu. 
In Brythonic, nouns (and some adjectives) have only singular and plural. Some plurals are 
historical, for example, MW mab ‘son’, pi. meib (form used after numerals) < *mapos, 
-l (cf. OIr. macc, pi. maicc), but many are analogical, e.g. meib(i)on ‘sons’, with -(i)on 
from the old n-stem pi. *-ones. Ahistorical plurals were inevitable in Brythonic wherever 
there would have been no distinction of number; for example, *donyos, pi. *donyi ‘man’ 
(< *gdonyos: Gr. yjiovioc, Word Formation, p. 377) gave duine, pi. duini in Old Irish, 
but dyn, pi. **dyn — » dynion in Welsh; plural dyn survived only after numerals, where it 
came to be regarded as singular. Many Brythonic plurals were old collectives, which may 
account for the use of singular verbs with plural subjects in Welsh. 

The case system survived the loss of final syllables in Old Irish because the latter left 
traces in vowel affection, palatalization and following mutations. The paradigm of the 
masculine o-stem *wiros ‘man’ (: Lat. uir) illustrates this. Parallel endings are given 
here (and for the a-stems below) from Gaulish, Lepontic and Celtiberian (Evans 1967: 
420-426, Lejeune 1970: 467, 1985a, 137-138, 1985b, Tovar 1986: 91-92, Eska 1989: 
160-163, 1995, Prosdocimi 1989, Lambert 1994: 49-58, Villar 1995, Eska & Evans 
2010: 33, 37, 40); the quantities of vowels in these scripts are sometimes guesswork. 


TABLE 7.7 NOMINAL STEMS IN -O- 


Singular 

nom. 

fer 

l-r] 

< *wiros 

cf. Gaul, -os, Lep. -os, CIb. -os 

voc. 

fir L 

1-rJ] 

< *wire 

cf. Gaul. -e(?) 

acc. 

fer N 

l-r] 

< *wiron 

cf. Gaul, -om, -on, Lep. -om, CIb. -om 

gen. 

fid 

1-rJ] 

< *wirl 

cf. Gaul. -I, Lep. -f, -oiso, ?-«, CIb. -o (see Celtiberian on p. 360) 

dat. 

fiuP 

l-r] 

< *wiru 

cf. Gaul, -ui, -H; Lep. -Hi, CIb. -Hi, -ei (loc.), -us (abl.) 

Plural 

nom. 

fid 

l-r*] 

< *wirl 

cf. Gaul, -oi, -T, Lep. -oi, CIb. 1-oi 

voc. 

firu 

[-ru] 

< *wirus 


acc. 

firu 

[-ru] 

< *wiru(n)s 

cf. Gaul. -o(:)s, -us, CIb. -Hs(?) 

gen. 

fed 

l-r] 

< *wiron 

cf. Gaul, -on, CIb. -u(\)m 

dat. 

feraib 

[-rap] 

< *wirobis 

cf. Gaul, -obo, Lep. -oPos, CIb. -uPos 


L = +lenition, N = +nasal mutation. 


The dative singular -u(f) may derive from the IE dative *-oy, and from instrumental 
*-o or ablative *-5d. The old nominative/vocative plural *-os > *-us has been replaced 
by *-T (< pronominal *-oy) in the nominative but survives in its secondary function as 
vocative. The Old Irish palatalization in the dative plural points to *-bis, an instrumental 
ending; cf. Gaul, gobedbi (p. 361). The genitive plural fed points to *-om, not **-um < 
*-om; probably all long vowels before final nasal were shortened very early in Celtic, 
before the five long vowels were reduced to three (Cowgill 1975: 49, Jasanoff 1989: 139), 
although CIb. -um ( if= /u:m/ < /o:m/, which is uncertain) may arguably tell against this 
(cf. Sims-Williams 2007b: 7-14, Eska 2010a: 23, Eska & Evans 2010: 33). 

The Old Irish a-stcm declension is also quite well paralleled in Continental Celtic (see 
Table 7.8). The most controversial Old Irish ending has been the accusative singular with 
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palatalized -th, but probably the development was *-am > *-am (cf. above) > *-aen caus- 
ing palatalization (cf. Subdivision, p. 359). Late Gaulish borrowed -im from the /-stems. 
There is clear syncretism in the genitive singular, where both Irish and Late Gaulish have 
replaced *-as with *-(i)yas, the pronominal and ya-stem ending (cf. Lat .paterfamilias), 
which developed via Ogham -EAS to OIr. -e. The original *-as was retained in the irregu- 
lar paradigm of OIr. ben ‘woman’. This also preserved an old ablaut pattern, for example, 
nominative/vocative/accusative plural mna (= Gaul, mnas < *bnas) and genitive plural 
ban N (< *banom, cf. Gaul, bnanom, de Bernardo Stempel 1987: 83, Delamarre 2003: 73). 
The singular is given in Table 7.8. 

TABLE 7.8 NOMINAL STEMS IN -A- 

Singular 


nom. 

tuath L 

< *teuta 

cf. Gaul. - a , Lep. -a, CIb. -a 

voc. 

tuath L 

< *teuta 

cf. Gaul, -a 

acc. 

tuaith N 

< *teut 2 en 

cf. Gaul, -an, -im, Lep. -a(:)m, CIb. -a(:)m 

gen. 

tuaithe 

< *teut(i)yas 

cf. Gaul, -as, -ids, CIb. -as 

dat. 

tuaith L 

< *teutl 

cf. Gaul, -ai, -i, Lep. -ai, CIb. -ai 

Plural 

nom./voc. 

tuatha 

< *teutas 

cf. Gaul, -as (?), CIb. -as 

acc. 

tuatha 

< *teuta(n)s 

cf. Gaul, -as, CIb. -as 

gen. 

tuath n 

< * teuton 

cf. Gaul, -anom, CIb. 1-aiim 

dat. 

tuathaib 

< *teutabis 

cf. Gaul, -dbo, -abi 


nom. 

ben L 

< *bena 

< *bena 

<— *g w enh. 

acc. 

bein N 

< *benaen? 

< *benam 

< *g*enh,m 

gen. 

mna 

< *mnas 

< *bnas 

< *g w neh,s 

dat. 

mnaP 

< *mnai 

< *bnai 

< *g w neh 2 (e)i 


In Old Irish there is also be N ‘woman’, and it has been suggested that this derives < 
*ben < *g w en < *g w en < *g w enh 2 , whereas ben < *bena (or *bena) is analogical (Jasanoff 
1989). This depends on the doctrines that IE /-VRH/ > IE /-V:R/, and that CC /V:/ > CC 
/V/ before /-m/ and/-n/ (p. 368). 

The only other Old Irish paradigm which may preserve ablaut in the oblique cases is the 
n-stem cu (= Brythonic ki < *ku), in which oblique con- may partly continue *kwon- and 
may partly represent the weak grade *kun- with regular lowering of /u/ before following /o/ 
(see Joseph 1990, Stiiber 1998: 85-89, de Bernardo Stempel 1999: 27-28, and Table 7.9). 


TABLE 7.9 NOMINAL STEM IN -N- 


Singular 


nom. 

cu L 

< *ku 

< *k(u)wo 

cf. Skr. 

sva 

acc. 

coin N 

< *koncen ? < *kwonm 

cf. Skr. 

svanam 

gen. 

con 

< *kunos 

< *kunos 

cf Skr. 

sunas 

dat. 

Plural 

coin L 

(from acc.) (*kuney, -i) 

cf. Skr. 

suna (instr.) 

nom. 

coin 

< *kones < *kwones 

cf. Skr. 

svanas 

acc. 

cona 

< *kunas 

< *kunms 

cf. Skr. 

sunas 

gen. 

con N 

< *kunon < *kunom 

cf. Skr. 

sunam 

dat. 

conaib 

< *kunobis < *kwnb h is 

cf. Skr. 

svabhis (instr.) 





370 PATRICK SIMS-WILLIAMS 


Most other IE declensional classes, for example, /-stems, w-stems, various consonantal 
stems and even heteroclitic r-/n- stems (Stiiber 1998: 84, de Bernardo Stempel 1999: 
130-139), are represented in Insular Celtic, but much less completely on the Continent; 
for reasons of space they are omitted here. Adjectives belong to a more restricted range 
of declensions, mostly vocalic stems. Celtic retained the IE degrees of comparison - 
the comparative, mostly in -(i)u in Old Irish < *-y5s, e.g. siniu ‘older’ (: Lat. senior), 
also residually in Brythonic, e.g. W hyn ‘older’ < *senyds (de Bernardo Stempel 1989, 
Schrijver 2007), and the superlative, in -em, -am in Old Irish, -ham in Old Welsh, 
< *-isamo/a (Gaul. Marti Rigisamo, cf. Position within IE, p. 357). Celtic also added the 
equative, in -ithir, -idir in Old Irish, but -(h)et in Brythonic; its etymology is uncertain 
(cf. Watkins 1966: 37, McCone 1994: 125 =2005: 141-142). 

Pronouns 

Pronouns are not well attested in Continental Celtic, and in Insular Celtic they have 
evolved through analogical levelling, through interaction with verbal endings and through 
phonetic attrition, especially when unstressed (see Schrijver 1997, Katz 1998). 

The demonstratives mostly derive from IE *so + clitic pronouns, e.g. Gaul, sosin celic- 
non (ace.), sosio < ?*sosyod ‘this’ (p. 361; Syntax, p. 377), OIr. suide < *so-de-so(s) 
according to Schrijver (1997: 33), not < *sodyo-. Demonstratives recalling Lat. iste occur 
in Lep. isos (p. 359) and in CIb. isTe, sTena (n. pi.) (Eska 1989: 165, Schrijver 1997: 63). 
In Insular Celtic the definite article comes from *sindos, *sinda, *sen; cf. Gaul, indas 
mnas = OIr. inn a mna ‘the women’ (ace. pi.). Demonstrative *so or *son may mark 
the relative in the Old Irish third-person singular relative beires ‘who carries, which he 
carries’ < ?*beret-so-, while ( s)a N < *sen appears as antecedent in, for example, OIr. 
for(s)a N ‘on which’; but the usual relative marker was uninflected *yo (: Hitt, va ‘and’): 
for example, *esti-yo ‘who is’ > OIr. as(a), MW yssyd; *welesi-yo ‘*whom thou seest’ 
(Sims-Williams 1984: 153-154) > OIr. file ‘who is’; *beronti-yo ‘who carry, which they 
carry’ > OIr. berte\ cf. Gaul, dugiiontiio (p. 361). In compound verbs *yo was infixed, 
causing lenition, e.g. do-ceil ‘hides’, relative do-cheil < *di-yo-kelet(i) (McCone 1980, 
O hUiginn 1986). The interrogative stem *k w ey- (OIr. cla, OW pui ‘who?’) is rarely used 
to express the relative. The connective particles *k w e (: Lat. -que, CIb. -Cue) > OIr. -ch 
(‘and’) and *de (: Gr. 8e) > OIr. -d- may serve as relative markers in Old Irish, by a sec- 
ondary development (Vendryes 1911, Watkins 1963). 

In the personal pronouns the distinction between nominative and accusative in the first 
and second person seems to have been lost in Celtic, and in Insular Celtic “dative” pro- 
nouns occur, in greatly reduced form, only in combination with other words, for example, 
*tl < *toy in OIr. duit ‘to you’. First-person singular OIr. me and OBret. me suggest *me, 
whereas OW mi, if not influenced by second-person singular ti, may derive from *ml < 
*me (cf. mi in Gaulish, below). OIr. gen. mo L ‘my’, stressed mui ‘mine’, imply *mowe < 
*mewe, whereas MW yv N ‘my’ implies *men < *mene (cf. Av. mana, Slav, mene), so the 
Old Irish forms (and MW stressed meu ‘mine’) may be analogical to the second-person 
singular *tewe. The Old Irish second-person singular tu and OW ti together imply CC 
*tu. Its genitive forms, OIr. do L , stressed tui, tai, and MW dv L , stressed teu, imply Celt. 
*tewe (: Skr. tava). The first-person plural forms, OIr. sni, MW ni, Gaul, sni, suggest 
*snl(s) < *(s)ne(:)(s), and the second-person plural forms, OIr. si, MW chwi, suggest 
*swl(s) < *(s)w(e:)(s); the *s- may be due to the first-person plural verbal ending: *-mos 
ni(s) > *-mos snT(s) (cf. Katz 1998). The Old Irish genitive forms nathar, nar ‘ours’ 
(unstressed ar N , MW an), sethar, sar ‘yours’ (unstressed far N ), can been compared with 
Lat. noster, vester, Goth, unsara-, izwara- etc. (Schrijver 1995: 454). 
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The third-person pronouns are problematic. The Old Irish subject pronouns, singular e 
(m.), si (f.) (W hi), ed (n.), plural e (W wy), may derive from *es, *si, *eda, *ey. W e/ ‘he’ 
may come from accusative *emem, while an unreduplicated accusative *em lies behind 
the masculine infixed object pronouns OIr. -a N -, Bret. -en-. Old Irish accusative feminine 
-5 n -, -e N -, may imply *(s)iyam or *sam, and the neuter accusative singular -a L - implies 
*e < *ed. (In Gaulish id may occur, but OIr. beirthi ‘carries it’ implies *bereti-e(d), not 
**bereti-id.) The accusative plural - 5 - (also -s N -) comes from *sus < *so(n)s (cf. Gaul, sos 
‘them’). Most of the above forms cannot be traced directly back to IE. By contrast, the 
unstressed genitive forms OIr. a L (m. n.) (MW tT), a (f.) (MWy s ), a N (pi.) (W eu /?-) can 
be derived regularly from IE *esyo, *esyas, *eys5m; the /y/ survives as /8/ in the stem of 
the Middle Welsh stressed forms eidaw (m.), eidi (f.). 

Subject pronouns seem to occur after verbs in Gaulish ( uediiu mi Consonantism, 
p. 363, unless -mi is an added athematic ending as in Skr. bhdrdmi), and they perhaps 
lie behind OIr. 1 sg. -mm in, for example, benaimm ‘I strike’ < ?*binam-me. Some Old 
Irish “emphasizing pronouns’’ are personal pronouns, for example, laimir-sni ‘we dare’ 
and also (but see Griffith 2010) ni-bir-siu ‘you do not carry’ < *nls-beres-tu (cf. MW 
kereist ‘you loved’ with -t < *tl); most of them, however, are demonstratives in origin, 
for example, beirid-som ‘he carries’ (: Goth, sama ‘the same’, Gr. opoq), earlier -sa < 
?*se. Old Irish object pronouns are suffixed to simple verbs, for example, beirthi ‘carries 
it’ < *bereti-e (above), but infixed within compound verbs and after particles, for exam- 
ple, da-cheil ‘hides it’ < *di-e-kelet(i), ra-ngaid ‘has beseeched him’ < *pro-em-g wh od h e. 
They are often combined with the particle *de, for example, fordom-chdin ‘he teaches 
me’ < *wer-de-me-kanet(i). The Old Irish stress (here indicated by ') regularly follows 
infixes, and the latter cause sandhi mutations. Deuterotonic compound verbs without 
visible infixes or sandhi, for example, do-ceil ‘hides’, have been attributed either to a 
meaningless sandhi-inhibiting particle, for example, *di-(e)s-kelet(i) (Cowgill 1985; for 
other suggested particles such as *eti see Karl & Stifter 2007: 301-402), or to analogical 
creation by “infix deletion”, for example, *di-e-kelet(i) > *d(i)-e-xele(j}) — » *di-kele(j}) 
> do-ceil (McCone 1985 - the -c- is not lenited because it arose after the period of len- 
ition, p. 367, pace Russell 1995: 54). The sentence-initial position of such deuterotonic 
verbs in Old Irish (Syntax, p. 377) implies the former presence of infixes, standing in 
second position according to Wackemagel’s Law (Vendryes 1911; Chapter 2, p. 377). The 
deleted particles may have been mainly proleptic/redundant neuter pronouns (e L , d(e)-e L ), 
whose deletion would have avoided confusion with relative verbs of the type do-cheil 
(p. 370) (Sims-Williams 1984; for other theories see Karl & Stifter 2007: 301^102). For 
prototonic verbs (e.g. dichil < *di-kelet(i)) see Syntax, p. 377; such forms occur when 
a proclitic particle precedes, with or without an infix, e.g. ni-dichil ‘does not hide’, nim- 
dichil ‘does not hide me’. 

The Celtic verbal system 

This simplified semantic distinctions carried by inflections in Indo-European. For 
example, the IE aorist and perfect merged in a single “preterite” tense, and the sub- 
junctive and optative moods merged as a single “subjunctive” mood; instead, aspectual 
differences were expressed syntactically by the use of preverbs and particles, for exam- 
ple, OIr. ro, MW rv < *pro (Schmidt 1990). The inflectional system survives most fully 
in Old Irish. In Old Irish, verbs express the active voice with either active inflection or 
deponent inflection, and for the passive/impersonal voice a passive inflection obtained 
which was similar to but distinct from the deponent inflection; e.g. suidigidir ‘places’ 
differs from suidigthir ‘is placed’ only in lacking syncope. According to one theory, 
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both resulted from a late split in the IE mediopassive: syncope patterns would vary 
according to the syllable-count of the base, and this variation could have been exploited 
in order to differentiate deponent and passive (McCone 1986: 240; 2006: 137-138; see 
differently Jasanoff 1 997, Griffith 2009). Already in Gaulish, deponent verbs (i.e. verbs 
with active meaning and mediopassive inflection) are apparent, e.g. marcosior ‘I shall 
(or may I) ride’ (Lejeune 1985a: 138, Lambert 1994: 125, Delamarre 2003: 217-218). 
For a detailed lexicon of Celtic verbs see Schumacher 2004, supplemented for Middle 
Welsh by Rodway 2013. 


Personal endings 

The personal endings of the Celtic verb derive mainly from: 

(a) the primary endings of the IE present/aorist system (*-5/*-mi, *-si, *-ti, etc.), which 
probably fell together with the secondary endings (*-m, *-s, *-t, etc., Chapter 4, 
p. xx), partly through loss of *-i as in Italic (Cowgill 1975, 1985, Lambert 1994: 
63, Villar 1995, McCone 2006: 106, Sims-Williams 2007b: 29 and 34) and possibly 
through a still earlier expansion of the domain of primary endings; 

(b) IE imperative endings; 

(c) IE mediopassive endings in -r (as in Hittite, Italic, Tocharian: cf. Chapter 2, 
p. 99); 

(d) IE perfect endings (see Chapter 2, p. 98-99). 

In Insular Celtic there are also obscure “imperfect” endings in the imperfect indicative, 
conditional and past subjunctive. They are identical in both active and deponent verbs 
and are apparently of mediopassive origin (cf. Schrijver 1992, Ahlqvist 1993). 

Like other IE languages, Celtic came to prefer “thematic” inflection, with a thematic 
vowel alternating between e and o before the personal endings, to “athematic” inflection, 
without e/o but often with ablaut variation in the root in the present (full grade in the 
singular, weak grade in the plural). A survivor of the athematic type is the “copula” (the 
form of the verb ‘to be’ expressing equivalence rather than existence): third-person sin- 
gular OIr. is, MW vs < *es-ti, third-person plural OIr. it, MW ynt < *s-enti. Celtic tended 
to thematicize athematic verbs and to generalize a single grade of ablaut, usually the zero 
grade of the plural: thus IE *melg-ti, *mlg-enti Tnilk(s)’ — > CC *mlig-e-ti, *mlig-o-nti 
> OIr. mligid, mlegait (Watkins 1962: 141-142, McCone 1986: 228, 1991: 29). The 
*e/o spread wherever it helped to avoid awkward consonant clusters; hence, it was not 
inserted after roots with a final laryngeal, which gave Celtic /a/ and remained athematic, 
e.g. *skerH-ti, *skrH-enti > *skarati, *skaranti > OIr. scaraid, scarait ‘separate(s)’ (cf. 
Watkins 1962: 189). 

(a) Primary endings occur in Old Irish active verbs in the present indicative, present 
subjunctive and future indicative, and in such preterite indicatives as derive from 
the IE aorist (i.e. not those which derive from the IE perfect). In Insular Celtic this 
type of ending has two forms: (a) “absolute”, in simple verbs in absolute initial 
position without a preceding particle; and (b) “conjunct”, in all compound verbs and 
in simple verbs in non-initial position: e.g. (a) OIr. beirid ‘carries’, MW trenghit 
‘dies’; (b) ni-beir ‘does not carry’, nv threingk ‘does not die’. It is now agreed that 
“absolute” and “conjunct” endings have a single origin, and do not derive from the 
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IE primary and secondary endings respectively (see Sims- Williams 1984, Cowgill 
1985, McCone 1985, 2006; cf. Watkins 1963). Accepting an early apocope of *-l 
(p. 372, cf. sistat p. 360), the conjunct forms of beirid ‘carries’ can be derived from 
primary forms as follows: 


TABLE 7.10 A POSSIBLE ORIGIN OF OLD IRISH CONJUNCT ENDINGS 


1 sg. 

-biur [friur] 

< *biru 

< *b h ero 

2 sg. 

-bir [b>ir J ] 

< *biris 

< *b h eresi 

3 sg. 

-beir [bier*] 

< *beret 

< *b h ereti 

1 P L 

-beram [t^erap,] 

< *beromos 

< *b h eromosi 

2 pi. 

-beirid [b l er J 3b'] 

< *berete 

< *b h erete 

3 pi. 

-berat [b'ersd] 

< *beront 

< *b h eronti 


The Old Irish absolute forms are longer than the conjunct forms, and it appears 
that they originally included some additional element which protected final *-i from 
apocope. Since absolute verbs always occur at the head of clauses, the position of 
the additional element must be due to Wackemagel’s Law (cf. p. 371). Various par- 
ticles have been proposed (cf. p. 371), but one candidate (Sims-Williams 1984) is a 
redundant or proleptic neuter object pronoun *e(d): 


TABLE 7.11 A POSSIBLE ORIGIN OF OLD IRISH ABSOLUTE ENDINGS 


1 sg. 

biru [frim] 

< *biru-e 

2 sg. 

biri [b'ir J i] 

< *birisi-e 

3 sg. 

beirid [b i er i 95 i ] 

< *bereti- 

lpl. 

bermai [b'ermi] 

< *beromosi-e 

2 pi. 

beirthe [b i er J J> i e] 

< *berete-e 

3 pi. 

berait [biered*] 

< *beronti- 


In the third persons - or all persons, according to McCone (1994: 141, 2006: 
118, 120, 124), who posits 2 pi. *-tesi and analogical developments in the first- 
person singular - the additional element was dropped before the general apocope 
by a process of “suffix deletion” presumably contemporary with “infix deletion” in 
compound verbs (p. 371). 

(b) The imperative endings were mostly similar to the conjunct of the present indic- 
ative, except the second-person singular, e.g. OIr. gaib ‘get!’, Gaul, gabi, and the 
third-person singular, e.g. OIr. gaibed < *g h ab h yetow? (cf. Goth, -dau); the need to 
differentiate the indicative from the imperative in absolute initial position may have 
encouraged the generalization in that position of the absolute (rather than conjunct) 
indicative endings of simple verbs (Sims-Williams 1984: 171, Eska 1991). Simi- 
larly, the imperative of compound verbs was distinguished from the (deuterotonic) 
indicative by its protonic stress (cf. p. 371). 

(c) The passive! impersonal inflection has third-person singular and third-person plural 
endings only. (The absolute forms, with palatalized final consonant, are arguably 
based on the analogy of active absolute forms like third-person plural her ait!) In 
the present tense there are two inflections in the singular, with or without a dental 
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consonant (cf. Umbr. pres, subj .ferar vs Lat .feraturl), as seen in the following con- 
junct forms: 


TABLE 7.12 OLD IRISH PASSIVE FORMS (PRESENT TENSE) 


3 sg. 

-berar 

< *beror 

(absolute berair ) ‘is carried’ 


-marbthar 

< *marwa(:)tor 

(absolute marbthair ) ‘is killed’ 

3 pi. 

-bertar 

< *berontor 

(absolute bertair ) 


-marbtar 

< *marwa(:)ntor 

(absolute marbtair) 


Infixed pronouns indicate the first and second persons, e.g. nom-berar ‘I am 
carried’. 

The passive preterite paradigm, however, was built up on the basis of the IE verbal 
adjective (Sims-Williams 1984: 183), e.g. MW lias ‘he was slain’ < *slad-tos, OIr. 
nom-breth ‘I was carried’ < *nu-me-britos (< *b h rtos). 

The deponent conjunct endings may be illustrated with OIr. -fograigedar ‘sounds’. 
The irregular syncope outside the third-person singular and third-person plural of 
this five-syllable base arguably follows the pattern of four-syllable bases (p. 372). 
(Jasanoff (1997), however, hypothesizes 3 sg. *-tro, 3 pi. *-ntro.) 


TABLE 7.13 OLD IRISH DEPONENT FORMS (PRESENT TENSE) 


1 sg. 

-fograigiur 

< *wogaro-sagI-or 

(absolute fograigim ) 

2 sg. 

-fograigther 

< *wogaro-sagI-ter 

(absolute fograigther) 

3 sg. 

-fograigethar 

< *wogaro-sagI-tor 

(absolute fograigidir) 

Ipl. 

-fograigmer 

< *wogaro-sagT-mor 

(absolute fograigmir) 

2 pi. 

-fograigid 

< *wogaro-sagI-dwe (?) 

(absolute fograigthe) 

3 pi. 

-fograigetar 

< *wogaro-sagI-ntor 

(absolute fograigitir) 


The first-person singular and second-person plural absolute endings are borrowed 
from the athematic active, and the palatalized consonants in the absolute third-person 
singular, first-person plural and third-person plural are by analogy with passive abso- 
lute forms like gaibthir, gaibtir. 

(d) Some IE perfect endings survived in the OIr. “suffixless preterite’’, e.g. 1 sg. -gad 
‘I prayed’ < *g w ada, 3 sg. -gaid < *g w ade, 3 pi. -gadatar < *g w adontVr - the last a 
blend of the IE 7-ending (cf. Skr. vid-ur ) and the ;?t-ending of the present/aorist (cf. 
Lat. uide-r-unt). 


Present stem 

The present stem (used to fonn the present and imperfect indicative and the imperative) 
was formed with various suffixes, which merged to give the following Old Irish conjuga- 
tions according to the numeration of Thurneysen 1946: 

AI: *-a - (cf. Lat. -are) and *-a- < root-final *-H, e.g. scaraid (Personal Endings, p. 372); 
All: *-eye/eyo- (denominative and causative), also *-I- (partly < stative *-e-, e.g. -ruidi 
‘blushes’ cf. Lat. rubere ‘to be red, blush’); 
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AIII: miscellaneous verbs with hiatus, e.g. ba'id ‘dies’ < *ba-eti (McCone 1986: 228); 
BI: *-e/o- (e.g. beirid. Personal Endings, p. 372); 

BII: *-ye/yo- (3 sg. -gaib, not **-gaibi < *g h ab h yet(i), is arguably on the analogy of BI, 
although a *-i/yo- suffix or *ye > *i have been suggested, cf. Sims- Williams 1981 : 
211-216, Weiss 2012); 

Bill: *-e/o- with nasal infix, e.g. bongid ‘reaps’, passive pret. -bocht, cf. Lat. ta-n-go, 
tac-tus (cf. Joseph 1990, McCone 1991: 41^17); 

BIV: *-na-, e.g. crenaid, pi. crenait ‘buys’ < *k w ri-na-ti, *k w ri-na-nti (? ultimately from 
*k w ri-ne-h,-ti, *k w ri-n-h 2 -enti, with nasal infixed in the root *k w r(e)ih,-, seen with- 
out infix in the Old Irish subjunctive stem cria-) (McCone 1986: 225; cf. 1991: 
11-54); 

BV: supposedly *-n(e)u- (but see McCone 1986: 225-227, Campanile 1990, Hamp 
1991, McCone 1991: 13-15, 22-23). 

AI and All are the only productive formations in Old Irish, and broadly resemble 
the Brythonic regular verbs, e.g. MW carafe I love’ < *karami, kenif‘l sing’ < *kanlmi 
(unless the latter is < *kanu-mi). 


Subjunctive stem 

The subjunctive stem (used for the present and past subjunctive) is divided into two 
classes: (a) the unproductive 5-subjunctive, e.g. OIr. geiss, -ge ‘may pray’ < *g w ed-s- 
t(i), MW gwnech ‘may do’ < *(g)wrex < *wrek-s-et(i); and (b) the productive so-called 
d-subjunctive, e.g. beraid, -bera ‘may carry’, traditionally derived from *ber-a-t(i). It is 
semantically difficult to derive (a) from the IE 5-aorist indicative (with Watkins 1962), 
and derivation from the 5-aorist subjunctive is difficult unless *g w ed-s-t(i) replaced the 
expected *g w ed-s-et(i) under the influence of the 5-preterite inflection (McCone 1986: 
245-246), which is against the normal Celtic athematic — » thematic tendency (cf. Hamp 
1987, McCone 1991: 57, 73, 79-80, Schumacher 2004: 53). Subjunctives in -se-Ti may 
occur in Celtiberian (Eska 1989: 170). (b) The d-subjunctive, traditionally derived, with 
that of Latin, from an innovative Italo-Celtic optative suffix *-a- (cf. OLat. aduenat, subj. 
of aduenio), is now often analysed as *-a-se/o- or *-a-se/o-, with *a, *a originally from 
roots in *-H, *-RH (McCone 1986: 260, 1991: 85-113, Schumacher 2004: 50); other 
linguists retain the Italo-Celtic explanation (cf. Schmidt 1991: 17-19; Jasanoff 1994). 


Future stem 

There are three main types of Old Irish future stem (used for the indicative future and 
conditional): (a) the / : future, e.g. rannfa, -rannub ‘I shall divide’; (b) the reduplicated 
5-future, e.g. gigis, -gig ‘will pray’ < *g w i-g w ed-s-t(i); and (c) the reduplicated so-called 
d-future, e.g. cechnaid, -cechna ‘will sing’ < *ki-kana(:)-se-t(i), and its subtype, the e- 
future, e.g. celaid, -cela ‘will hide’ < ?*ki-kla(:)-se-t(i) (with *kikl- > *kexl- > cel-), (a), 
the / : future, is traditionally compared with the Latin future in -bo. This is phonetically 
controversial - normally /f i is < *sw - but because the / : future occurs only in Old Irish, 
where it is very productive, it is tempting to see it as a late, parallel innovation, based on 
a periphrasis involving the root *b h (e)w- ‘to be’ (but on /bw/ cf. McManus 1991: 122). 
However, McCone (1991: 176-182) suggested that it spread from a verb in which redu- 
plicated *si-sw- > *si-f-; Jasanoff (1994: 216) and Russell (1995: 20 and 49) rejected this, 
(b-c), the two reduplicated futures, run parallel to the corresponding subjunctives, and 
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their suffixes must be explained similarly. They are originally desideratives (semantically 
cf. English ‘he will pray’) and are comparable with Sanskrit desideratives, e.g. titrpsati 

< *ti-trp-se-ti (root *terp- ‘enjoy’) (Thumeysen 1946: 414-415; McCone 1986: 248- 
255). Unreduplicated futures in *-sye/o- have been identified in Continental Celtic, Indo- 
Iranian and elsewhere (e.g. Schmidt 1988: 241, Lambert 1994: 63, Schumacher 2004: 58). 

Active preterite stem 

The Old Irish active preterite stem (to which “deponent” as well as “active” inflections 
could be added) derived, for most verbs, from the IE aorist, but for others from the IE 
perfect. 

With few exceptions the aorists were originally athematic and sigmatic. According to 
Watkins (1962) third-person singular forms, e.g. *skerH-s-t, *ber-s-t developed regularly 
via *skarass(i), *bert(i) to -scar ‘separated’, -bert ‘carried’ (cf. OIr. tart < *trstu- ‘thirst’), 
and these third-person singular forms formed the basis for the whole paradigms, e.g. 
first-person singular *skarass-u > -scarus (the s-pret.), *bert-ti > -biurt (the t-pret.). Cf. 
MW 1 sg. kereis ‘loved’ < *karassti, 3 sg. ( *)caras < *karassit(i); other stem vowels 
before *ss were partly influenced by present stems (McCone 1986: 232, Rodway 2013: 
69-80, 128-165). A variant on Watkins’ s-aorist theory derives -bert < *birt < *blr(s)t(i) 

< *b h erst(i), with lengthened grade; this vocalism is supported by the *byrth underlying 
the MW compound kymyrth ‘took’ (McCone 1991: 67-69; Schumacher 2004: 61-64). 
It is phonologically more difficult to apply Watkins’ s-aorist theory to a Middle Welsh 
t-preterite such as MW 1 sg. keint ‘sang’ < *kantu, 3 sg. cant < *kant(i), and similarly 3 
sg. gwant ‘pierced’. Isaac (2001) explained the latter as an IE athematic imperfect, and 
Jasanoff (2012), accepting this, has explained MW kymyrth, OIr. -bert and other t-preter- 
ites as IE imperfects with lengthened grade *e. 

The third-person singular Brythonic termination of Middle Welsh absolute keressvt < 
*karass-iti, conjunct ( *)caras < *karass-it, is an innovation paralleled in Gaul, legasit < 
?*legast + *-iti versus more archaic prinas < *k w rinast (Sims-Williams 1984: 188; cf. Lam- 
bert 1994: 64, 68, Eska& Evans 2010: 41). For other Continental Celtic preterites seep. 359. 

Some of the Old Irish “suffixless preterites” employ reduplication and derive from 
old perfects (cf. Lat. tango, te-tigi), e.g. OIr. cechain ‘he sang’ < *ke-kan-e; a rare MW 
example is kigleu ‘he heard’. Others replaced *e in the root with Celtic *a (probably < 
*5), e.g. OIr. do-feid < *-wedet ‘leads’, do-faid < *-wade Ted’, MW godiwawd < *-wade 
‘overtook’. It is unclear whether this a-preterite is attested in Continental Celtic: a pos- 
sible precursor, perhaps with 16/ as in the IE perfect, has been seen in Gaul. AVVOT(E ) 
‘FECIT = ? /aw-wo(:)d(e)/ < *a(p)o-wod( h )-e Ted away, carried out, produced’ (Lambert 
1987, 1994: 122; cf. Hamp 1973). In general, the Celtic a-preterite resembles long-vowel 
preterites in other languages (cf. Germanic comparison in McCone 1986: 235-238, 1994: 
168 = 2005: 224) and like them is of obscure origin. Jasanoff (2012) derives them from 
IE imperfects. 

WORD FORMATION 

Despite much work on Celtic name formation (e.g. Uhlich 1993, Garcia Alonso 2013, 
Falileyev 2014), the vast subject of Celtic word formation has only recently been studied 
in detail, for example, Joseph 1987 on denominative verbs in *-sag- ‘seek’ (e.g .fograige- 
dar. fogur ‘sound’, Personal Endings, p. 374), Russell 1990 on velar suffixes (such as 
the productive *-ako-), Irslinger 2002 and Repansek 2013 on dental suffixes, de Bernardo 
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Stempel 1999 and Zimmer 2000 on Old Irish and Welsh noun formation, Stuber 1998 
on n-stems, and Schumacher 2000 on Welsh verbal nouns. Most IE types of composi- 
tion survive in Celtic, at least residually (cf. McCone 1994: 126-132 = 2005: 146-155), 
including tfvantfva-compounds such as OIr. gaisced ‘weapons’ < *gaiso-skeitom ‘spear- 
and-shield’ or TEVO-XTONION (gen. pi.) = DEIS ET HOMINIBUS in a bilingual Gaulish 
inscription at Vercelli (Delamarre 2003: 141-142, 176, cf. p. 368). Composition vowels 
are discussed by Sims-Williams (2013). Like other IE languages, Celtic developed the 
use of preverbs to modify verbal bases aspectually (The Celtic Verbal System, p. 371) 
or semantically, e.g. OIr. fo-reith ‘helps’ < *‘runs under’ < reithid ‘runs’ (: MW gwaret 
‘helps’; Lat. subcurro, succurro < curro). Large numbers may be strung together (e.g. 
OIr. intururas ‘incursion’ < *ind-to-are-uks-ret-), and they tend to appear in a particular 
hierarchy (McCone 1987: 94) which has parallels elsewhere, e.g. ro < *pro tends to occur 
close to the root, as did Ved. pra, Horn, pro (Sims-Williams 1984: 190). Despite similar- 
ities, the system of preverbs and prepositions in Celtic and Italic cannot be reduced to a 
unity (Watkins 1966: 36). As in other IE languages, e.g. OLat. ob uos sacro (— > obsecro 
uos), preverbs may be divided by tmesis (see Syntax below; McCone 1985: 267). 

SYNTAX 

The most important development in Celtic is in the position of the verb. Whereas the 
normal order is SOV in Celtiberian (e.g. p. 360) and in some Gaulish (e.g. Buscilla sosio 
legasit in Alixie Magalu ‘B. chose this in Alisia for Magalos’), Insular Celtic favours 
VSO (p. 368), with most apparent exceptions, such as SVO (Lewis 1989), being expli- 
cable either as nominativus pendens or as cleft sentences with [copula] + S + relative 
verb (Russell 1995: 292-300, Willis 1998: 97-101). The most important exception is the 
arguably archaic Old Irish construction known as Bergin’s Law (Binchy 1979), by which 
the verb, instead of appearing initially in absolute (Personal Endings, p. 372-374) 
or deuterotonic (p. 371) form (e.g. *Loiscis Lugaid trebthu ‘Lugaid burnt dwellings’, 
*Ad-rimi maicni nAilb ‘You reckon the sons of Alb’), appears finally/medially in con- 
junct or prototonic form, e.g. Lugaid loisc trebthu, Maicni nAilb dirmi. This construc- 
tion is attributed by some to preservation of the SOV/SVO syntax seen in Continental 
Celtic but by others to mere poetic licence (McCone 2006: 61-65, Eska 2007). The key 
to the development of VSO was identified by Vendryes (1911; cf. Watkins 1963, Eska 
1994, Russell 1995: 13, 303, McCone 2006) in the phenomenon that certain clitics were 
closely tied to verbs in Celtic and therefore drew either the verb or its first preverb to the 
head of the clause by Wackemagel’s Law (p. 371; Personal Endings, p. 372-373), e.g. 
verb + relative *yo in Gaul, dugiiontiio Ucuetin ‘who honour Ucuetis’ (p. 361), preverb 
imm + -a < *yo in archaic/poetic OIr. imma- lanna -lig ‘which lies about lands’ (with 
tmesis (Word Formation, p. 377), — » later/prosaic Hmma-lig lanna). This phenomenon 
(labelled “Vendryes’ Restriction” in Sims-Williams 1984: 167) must surely be linked 
with the fact that absolute and deuterotonic verbal forms, which are required in VSO 
order, seem originally to have included clitic elements subject to Wackemagel’s Law (see 
Pronouns, p. 370-371, Personal Endings, p. 372-374; for theories about the identity of 
these clitics cf. McCone 2006 and Karl & Stifter 2007: 30 1-402). 1 

FURTHER READING 

More detailed surveys are provided by MacAulay 1992 and Ball and Muller 2010, the 
former concentrating on the modem languages. The only comparative grammar is still 
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Lewis and Pedersen 1961. Being a revision of a work published in 1909-13, this does 
not reflect the vast amount of work on Celtic over the past hundred years. Its old fash- 
ioned IE notation has been followed to some extent in this chapter for the convenience 
of readers. There is no handbook of Continental Celtic as a whole, but Lambert 1994 
is a good introduction to Gaulish while Delamarre 2003 is an etymological dictionary. 
The classic grammar of Old Irish by Thumeysen (1946) still holds the field, but can be 
supplemented by McCone 1994 (Italian version 2005). Etymology is not covered in the 
Dictionary of the Irish Language (http://edil.qub.ac.uk/dictionary/search.php), and the 
Old Irish etymological dictionary by Vendryes (1959-) still lacks the letters E-L. The 
standard grammars for early Welsh, Cornish and Breton by Evans 1964, Lewis 1990 and 
Hemon 1975 can be supplemented by Temes 2011. Etymology is not the most up-to-date 
aspect of Geiriadur Prifysgol Cymru: A Dictionaiy of the Welsh Language (http://www. 
welsh-dictionary.ac.uk), which closely follows Pokorny 1959, but it is a convenient start- 
ing point, as are Pokorny 1959 and Matasovic 2009 for the Celtic languages as a whole. 

FURTHER READING 

In revising this chapter I am grateful to the editor and to J. F. Eska and P. Schrijver for 
helpful criticism. 
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PART 8 


GERMANIC 


Joshua Bousquette and Joseph Salmons 


We survey Germanic as a distinct branch of Indo-European, tracing developments that 
differentiate Germanic from IE up to attested Germanic varieties. We cover structural 
patterns shared across parts of the family and those that distinguish the sisters from one 
another. Beyond familiar examples (umlaut, case loss, verb second), we treat areas of 
recent progress, such as laryngeal phonetics and phonology, foot structure, definiteness 
and complementizers. We stress how these patterns cut across grammatical modules, con- 
necting phonology, morphology and syntax. That discussion is embedded in the available 
social and cultural context, e.g. language contact. 


INTRODUCTION 

Our earliest evidence for Germanic languages is from names and words recorded in clas- 
sical-language sources, including Tacitus, Caesar and Pliny. Most discussed are tribal, 
personal and place names, but they also include cultural vocabulary, some borrowed into 
Latin, e.g .framed ‘spear’, glesum ‘amber’ (cf. glass) and possibly cognates of soap ‘red 
hair dye’ (Green 1998: 185-188, Kluge 2011). 

Germanic languages are also attested in runic inscriptions written by speakers in the 
first centuries of the Common Era (Krause 1971, Antonsen 1975, Schulte 2006). These 
are not limited to Scandinavia (as with the Negau helmet, an early inscription, found 
in present-day Slovenia) and are identified with various Germanic languages, based on 
where they were found (problematic for inscriptions on portable objects like spearheads 
and brooches), orthography and structure. Page (2001) illustrates these difficulties for 
Frisian runic inscriptions, where arguments for identifying inscriptions as Frisian are ten- 
tative and limited to a few items; e.g. the evidence for Frisian provenance of the apparent 
personal name skanomodu assumes a connection to Gmc *skaun- ‘beautiful’, showing 
distinctively Frisian monophthongization. 

The first longer text preserved in a Germanic language is part of a Bible translation, 
attributed to Wulfila (Ulfilas) in the late 4th century, written in his Greek-based script. 
This is our major source for Gothic, in a 6th century Italian manuscript. Texts in the Latin 
alphabet begin after 500 ce, summarized here, with rough dates for composition and 
surviving manuscripts: 

(1) Early longer texts in selected Germanic languages 

Gothic late 4th c., manuscript from the 6th c. 

Old English mid-late 7th c., manuscript from the 8th c. 

Old High German mid 8th c., manuscript from the late 8th c. 

Old Saxon ca. 830, major manuscript from the mid 11th c. 

Old Norse mid lithe. 

Old Frisian late 11th c., manuscript from the late 13th c. 



388 JOSHUA BOUSQUETTE AND JOSEPH SALMONS 


The family is traditionally divided into three branches: 

• East Germanic: Gothic and fragments from other languages. The last known survivor 
was “Crimean Gothic”, recorded in 1562 (Steams 1978). 

• North Germanic: Runic and then Old Norse, represented by the modern Nordic 
languages. 

• West Germanic, including English, German, Dutch, Yiddish, Frisian, etc. 

These branches show their characteristics in the earliest attestations; see Robinson (1992: 
eh. 10). He eschews sub-branching within this tree, yielding the picture on the left; 
another view is that on the right (Salmons 2012: 84), a three-way division where North 
and West Germanic are closer to each other than to East Germanic. 


Proto-Germanic 


Proto-Germanic 




West Germanic North Germanic East Germanic North Germanic West Germanic East Germanic 

FIGURE 8.1 TWO MODELS OF GERMANIC SUBGROUPING 

Sociohistorical evidence supports the model on the right, with the relatively early 
departure of East Germanic from southern Scandinavia (Heather 2010), leaving a dialect 
continuum across North and West Germanic. 

One gauge for subgrouping is shared innovations that are unlikely to be independent 
innovations, but these prove problematic. In sound change, glide hardening - the devel- 
opment of stops from glides ( Verscharfung , Holtzmann’s Law), shared between Gothic 
and North Germanic - has long seemed exotic. In both languages, reconstructed geminate 
glides (*jj, *ww) appear as geminate stops, so that OHG triuwa (cf. troth) corresponds 
to ONor. trvggva and Goth. t?iggwa. Yet, while similar reflexes do appear, the outcomes 
do not always match up: OHG zweiio ‘of two’ corresponds to coronals in Goth, twad- 
dje and velars in Norse tveggia. And Faroese (but not its cousin Icelandic) underwent a 
later sharpening ( skerping ), where apparently geminate -jj- and -ww- hardened eventually 
into fortis geminates, palatal and velar respectively (data and analysis following Arnason 
2011: 31-33, who uses Old Icelandic as a starting point): 

(2) Faroese skerping (adapted from Arnason 2011: 31) 

Old Icelandic Intermediate stage Modem Faroese 

eyjar [oyjar] [d 3 :] oyggjar [o(i)t/:ai] ‘islands’ 

roa [row.wa]? [g v :] rogva [,iek v :a] ‘to row’ 

Worse, hardening is hardly foreign to West Germanic, though in different environments. 
Schirmunski (2010: 429) shows southern Swabian hardening of labiovelar glides to b, cf. 
dibar ‘your (pi.)’, MHG iuwer. In Low German, palatal glides can harden to a spirant, e.g. 
Soest [sya] eier ‘eggs’ < OS eiero < PGmc *ajj- (Hall 2014). Modem North Germanic, 
moreover, seems to show similar tendencies. Riad (2014: 21, 29) discusses the rise of 
“damped” or “buzzing” realizations of [i:] and [y:] as [i: z ] and [y: z ] and of a [[3] off-glide 
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after long high back vowels: [fi: z n] for fin ‘fine’ and [bu: p k] for buk ‘book’. Here, we have 
glide hardening without motivation from syllable structure, since these are additional 
coda material rather than new onsets. High vowels and glides harden into stops around 
the world; e.g. earlier Inscriptional Burmese -iy and -uw correspond to -it and -uk in Maru 
(Mortensen 2012). 

To take another example, rhotacism, the development of /r/ sounds from earlier *z, is 
shared by North and West Germanic: Gmc *deuza ‘animal’, Goth, dius, OHG tior and 
ONor. dvr. Here, too, the change is far from unique, as in Lat. fids ~ florem plus later, 
independent occurrences of rhotacism in Romance, Germanic and other IE languages 
(Catford 2001), undercutting its value as a diagnostic. 

In morphology, innovations are often built from inherited material in ways that could 
be independent, like class iv weak verbs (-nan) in Goth, gadaupnan ‘die’ and Norse 
vakna ‘awaken’, absent in West Germanic. Conversely, the innovative -st 2 sg. verbal 
inflection is characteristic of West Germanic only, attested first in Old High German in 
high-frequency verbs in main clauses, and later spreading to other tenses, moods and 
syntactic positions (Somers 2011). Independent developments in single branches may 
further cloud relationships. Gothic exhibits a synthetic middle passive, e.g. (ik) haitada ‘I 
am called’, where -da may be reflexive, consistent with reconstructed IE (p. 162). This is 
already in competition with reflexive verbal constructions using sik, sis, or seina ‘self’, a 
process that occurred in Greek (Ferraresi 2005: 106-109); synthetic passives are absent 
in later Germanic. (The Scandinavian middle is a later development, a reanalysis of the 
reflexive pronoun sik as verbal -sk; Faarlund 2004: 126-127, also Emonds & Faarlund 
2014.) 

If a unified Germanic once existed, it was perhaps ca. 500 bce. Early Runic shows 
surprising uniformity over time and space, possibly due to conservative writing traditions 
and a limited early corpus. The evidence shows differentiation across branches from the 
beginning; some West Germanic varieties show nasal loss before coda fricatives (/f, 0, 
x/) with compensatory lengthening of the preceding vowel, while others do not: Eng. 
five, mouth vs. Germ, funfi, Mund, from Gmc *femfa, *mun0a; a number of fonns show 
variable reflexes, cf. MHG sunt ~ slid. 

While a tree-like split into three distinct branches is well supported (Nielsen 2000, 
Heather 20 1 0), intra-Germanic contact was nevertheless pervasive. “North Sea Germanic” 
(Ingvaeonic) areas still form dialect continua, crossing the Dutch-German linguistic bor- 
der. And rich patchwork distributions of features occur across mainland Scandinavia. 
This has blurred the lines of a Stammbaum with chronic wave-like diffusion. 

More dramatically, English was initially formed by a mixture of Germanic varieties 
during the invasion of the British Isles, shaped by the shift of Celtic (“British”) speakers 
to English, then the North Germanic invasions during the Danelaw and eventually the 
Normans. What we now call English came into existence thanks to sociohistorical set- 
tings involving high numbers of bilingual and bidialectal speakers and second language 
and second dialect speakers, dialect contact (koineization or “new dialect formation”; 
Kerswill 2002) plus language contact (Trudgill 2010). This has led to the proposal that it 
has become North Germanic (Faarlund 2012) or a “fourth branch of Germanic” (Forster 
et al. 2006). These claims have garnered press attention, but the mainstream view remains 
that English is West Germanic with complex effects of historical contact (McWhorter 
2002, Trudgill 2010). 

Some characteristics of Germanic are often tied to the sociohistorical scenarios just 
sketched. Consider two examples. Substrate effects are likely in lexical material from 
pre -IE-speaking populations, and many terms reflect domains where vocabulary survivals 
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from earlier languages seem particularly plausible (flora, fauna, cultural vocabulary); 
Polome (1989) and Hamp (1990) argue for substrate status when words show aberrant 
phonological and/or morphological patterns vis-a-vis inherited vocabulary. Hamp lays 
out, for instance, the difficulties of the ‘apple’ word, showing reflexes of the extremely 
rare *b phoneme (also Salmons 2004, 2015). Let us turn to the more secure lexical con- 
nections with attested languages, Finnic and other IE groups. 

Germanic loans in Baltic Finnic show great antiquity and have been explored for what 
they show about Proto-Germanic (Kylstra 1991 — , Koivulehto 2001, both in some respects 
controversial). Finnish forms like kuningas ‘king’ (Germ. Konig) and ringas ‘ring’ retain 
nominal stem vowels plus endings lost in Germanic. Some words suggest pre -Germanic 
forms, like Finn, kana ‘hen’ with an unshifted *k, rather than Gmc *h. 

Borrowings from Germanic into Romance similarly allow us to date borrowings and 
identify the nature of contact. For example, early borrowings with word-initial labial 
glides from Germanic into Romance predate glides hardening: Old Frankish *wrakjo 
‘exile’ (OHG reccheo) becomes Old French garqun ‘servant’; Gmc *werre ‘war’ (Goth. 
warjan ‘defend’, cf. Eng. warrior, warden ) becomes Old French guerrer (cf. Eng. gue- 
rilla, guard). These cognates in English (post 1066) attest to borrowing from Germanic 
into Romance, then back into Germanic. 

Gennanic-Celtic contact predated the Germanic breakup, as all branches share some 
of the same cognates with Celtic. Borrowed from Celtic are some legal terms, including 
ancestors of words like oath (Lehmann 1986). Goth, andbahts ‘service, office’ (Germ. 
Amt) is distinguishable as a borrowing by the /a/ for IE syllabic *m (IE *-mb h i), which 
has anaptyctic /u/ in Germanic (OHG umbi ‘around’, but Gr. dp(pi). Another likely early 
borrowing is OIr. dim ‘fort, walled hillock’ (W dinas, ‘city’): OHG ziin ‘fence’ (Germ. 
Zaun ‘fence’), OEng./OS/ONor. tiin ‘village’ (Dutch tuin ‘garden, yard’). Zaun reflects 
the second sound shift, so predates that. While some accept such terms as Celtic loans 
(Orel 2003, Kroonen 2013), others are skeptical, e.g. Kluge 2011, where Zaun is of 
“uncertain origin” but presumably connected to Celtic. A third is more complex, Goth. 
eisarn- ‘iron’, OEng. isarn/iscern, OHG isarn, MDutch iser ‘iron’ (OIr. iarn). (The iron 
age came relatively late to Northern Europe, perhaps 400 bce, and some connection 
to Celtic is likely; Kluge (2011) suggests that Celtic and Germanic borrowed the term 
from a third language, while Lehmann (1986) notes possible multiple borrowings from 
Celtic.) 

Contact often also correlates with morphological reductions, e.g. case loss (O’Neil 
1978). A standard narrative about Germanic is framed around such “simplifications”. As 
argued below, that view must be balanced against increasing complexity, including in 
the segmental inventory (larger vowel systems) and new inflection. Simplification is a 
frequent result of contact, but hardly inevitable (see below). 

Germanic’s position within IE has been more controversial (see Polome 1972). Some 
(e.g. Ringe 2006: 5-6) assign Germanic to a “central” group including Balto-Slavic, 
Indo-Iranian, Armenian and Greek. Vocabulary is shared in complex ways with north- 
western neighbors (Meillet 1967), perhaps reflecting later contacts rather than earlier 
genetic unity. Contact took place with pre -IE languages, other IE branches and non-IE 
languages, especially Finnic (Roberge 2010, Mailhammer & Vennemann 2015), muddy- 
ing the waters between tree- and wave-like patterns. Like other IE languages, Germanic 
was likely forged from contact between Indo-Europeans and indigenous populations. Jut- 
land (Denmark), southernmost Norway and Sweden, and the Baltic and North Sea coasts 
are likely the earliest area where Germanic was spoken, with ongoing contact with other 
(post-)IE groups and non-IE groups. 
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PHONOLOGY 

Consonants 

Germanic stops and fricatives differ systematically from IE, other segments less so. The 
IE laryngeals, on mainstream views, were lost early and have little relevance to Germanic 
(but see Muller 2007). Traces of laryngeal phonology are visible in the ablaut system 
(Ringe 2006: 80): 

(3) PIE root-ablaut alternations pre-PGmc root-ablaut alternations 


a ~ 0 

> 

a ~ 0 

e~0~o 

> 

e ~ 0 ~ o 

Iqe-hj-hjO 

> 

e ~ 0 ~ o 

h,e ~ h, ~ h 2 o 

> 

a ~ 0 ~ o 

h 3 e ~ h 3 ~ h 3 o 

> 

o ~ 0 ~ 0 

eh, ~ h, ~ oh. 

> 

e ~ a ~ o 

eh 2 ~ h, ~ oh 2 

> 

a ~ a ~ 5 

eh 3 ~ h 3 ~ oh 3 

> 

5 ~ a ~ o 


Similarly, whatever the status of IE palatal versus velar stops, both surface in Gennanic 

as velars (Sihler 1995: 151-168): 

(4) IE English 

*yugom- yoke ‘yoke’ 

*genh,- kin ‘beget’ 

*gneh 3 - know ‘know’ 

IE syllabic resonants - liquids and nasals - resolve into *u plus resonant (Kluge 2011): 

(5) pre -Germanic Gennanic 

*wrm- *wunna- ‘worm’ 

*wlk w o- *wulfa- ‘wolf’ 

*ndher- *under- ‘under’ 

*smH- *sumera ‘summer’ 

This leaves this pre -Gennanic system: 


TABLE 8.1 PRE-GERMANIC CONSONANTS 



Labial 

Coronal 

Palatal 

Velar 

Labiovelar 

stops 






voiceless 

p 

t 


k 

k w 

voiced 

b 

d 


g 

g w 

murmured 

b h 

d h 


g h 

g wh 

fricative 


s 




sonorants 






liquid 


1, r 




nasal 

m 

n 




glide 

w 


j 
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Traditional IE obstruents differ from, yet correspond systematically to, Germanic: 


TABLE 8.2 IE AND GERMANIC STOPS 



IE 

Germanic 

voiceless 



labial 

*p 

*f 

coronal 

*t 

*e 

velar 

*k 

*x 

labiovelar 

*k w 

*x w > *h w 

voiced 



labial 

*b 

*p 

coronal 

*d 

*t 

velar 

*g 

*k 

labiovelar 

*g w 

* k w 

voiced aspirate 



labial 

*b h 

*b/*P 

coronal 

*d h 

*d / *5 

velar 

*g h 

*g/*Y 

labiovelar 

*ghw 

* g w / * y w 


In these changes (“Grimm’s Law’’, Germanic Consonant Shift, First Sound Shift), one 
change, *p > *f, reflects a change in manner of articulation, from stop to fricative, while 
others involve laryngeal phonology (meaning “states of the glottis”, i.e. phenomena like 
voicing and aspiration, not IE laryngeals). 

While the shift represents perhaps the most discussed sound change, discussion is often 
unclear about the precise laryngeal characterization of IE and Germanic stops and fricatives, 
especially with regard to the division of labor between phonology and phonetics. Crosslin- 
guistically, laryngeal contrasts can be captured with three features, [voice], [spread glottis] 
and [constricted glottis], plus laryngeally unmarked or unspecified consonants (Iverson & 
Salmons 1995, Avery & Idsardi 2001). In many languages, these contrasts are enhanced by 
additional phonetic cues, particularly for phonation (Henton et al. 1992). The traditional IE 
obstruent system would have had unmarked stops (*p, *t, *k, etc.), voiced stops (*b, *d, 
*g, etc.) and stops marked by both [voice] and [spread glottis] (*b h , *d h , *g h , etc.). Earlier 
voiceless stops become voiceless fricatives (“spirantization”), perhaps triggered by aspi- 
ration, as an enhancement. (A shift to a [spread glottis] system may have preceded these 
changes.) IE voiced stops devoice, a process found in many languages, perhaps motivated 
by the challenge of maintaining voicing - which requires considerable air flow across the 
glottis, while air flow is blocked in stops. The third are traditional “voiced aspirates”; mod- 
em descriptions of Indie treat them as “murmured” or “breathy”, with a slightly open glottis 
with high air flow; these deaspirate in Germanic. Controversy surrounds whether these 
become voiced stops or fricatives. For instance, a form like *b h end h - becomes Gmc *bind-, 
with voiced stops initially and after a nasal, but postvocalically we often find fricatives, e.g. 
ONor. ord ‘word’ from *wrd h o-. We assume underspecification for [continuant], similar to 
contextual stop-fricative alternations in Spanish. 

These changes - spirantization, devoicing, deaspiration - appear to be connected by 
encroachment of new sounds into the space of old ones or the filling of gaps created by 
earlier changes, “push” or “pull” chains respectively. On the fonner, if *d h deaspirated first, 
*d h er- would have risked merger with *der-, triggering devoicing of *der- to *ter-. On the 
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latter, if *pel- become *fel-, the lack of voiceless stops might allow devoicing to fill this 
gap. This pull is sometimes motivated in terns of voiceless stops as “unmarked”, lacking 
laryngeal specification. (Wrinkles include labiovelars; see Salmons & Smith 2005.) 

Germanic contrasts [spread glottis] or “aspirated” *p, *t, *k, etc. and unmarked *b, 
*d, *g, etc., rather than “voiceless” vs. “voiced”, making Germanic phonologically more 
different from IE than traditional views, with implications for later obstruent changes. 1 
(Dutch and Yiddish are [voice] languages, in both languages likely due to contact with 
Romance and Slavic, respectively.) 

A set of Germanic forms shows irregular voicing in fricatives, which once constituted 
a challenge to claims about the regularity of sound change: 

(6) Vemer’s Law 

Skr. bhratar- Gmc *bro0er ‘brother’ 

Skr. matar- Gmc *moder ‘mother’ 

Vemer (1875) famously established that this voicing was actually regular, conditioned by 
position of the IE “mobile” accent (§2.3), so that (expected) voiceless fricatives appeared 
where Sanskrit has a preceding accent and voiced when the accent did not precede. Ger- 
manic provides evidence for the position of the IE accent in verbal paradigms, reflected 
in a few modem alternations like Eng. was ~ were (/r/ from *z). Unmarked stops (Germ., 
Eng., Dan. <b, d, g>) are, as noted, treated as unspecified. They often lack phonetic voic- 
ing in initial and final position (systematically in Icelandic), but in a voice-friendly envi- 
ronment like between vowels, they are phonetically voiced. A similar kind of “passive 
voicing” (Iverson & Salmons 2003, among others) may have been at work in Verner’s 
Law: IE accent was likely realized with stiff vocal folds, so that a preceding accent would 
inhibit fricative voicing. 

Another “exception” is equally important, but never played the pivotal role that Verner 
patterns did concerning regularity: IE voiceless stops did not shift after *s: *sper- retains 
its stop in modem Eng. spear. Germ. Speer. The modern absence of aspiration - or 
phonological specification for [spread glottis] - supports connecting spirantization to 
aspiration. 

These developments illustrate continuity and change within Germanic. Complex 
consonantal “chain shifts” like Grimm’s Law are uncommon in the world’s languages, 
but occur again within later Germanic. High German is defined traditionally by a set of 
changes known often as the Second Consonant Shift (“High German Consonant Shift”). 
Like in Grimm, stops become spirants, though variably by environment and place of 
articulation, and in certain varieties and context showing a partial shift to affricate rather 
than fricative. Intervocalic stops are prone to shift but initial or post-consonantally less 
so, and coronals preferably over velars, as in English (unshifted) and German (shifted): 

(7) The Second Consonant Shift 


English 

German 

path 

Pfad 

help 

helfen 

town 

Zaun 

eat 

essen 

cool 

kiihl 

milk 

melken 

book 

Such 
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Modern parallels include affrication of initial /t/ in some varieties of Danish and affrica- 
tion/spirantization in Liverpool English (e.g. Honeybone 2002). 

The Second Sound Shift has a broader chain-like character. The lenis (in traditional 
terms “voiced”) *d becomes /t/ except in the north: Eng. drink is cognate with Germ. 
trinken. Farther south, *g becomes fkJ, and *b becomes /p/: Old Bavarian perec ‘moun- 
tain’, cf. Berg. Possibly related is the “stopping” of interdental *0 across most of Ger- 
manic. Excepting variation based on dialect and the environment, the basic progressions 
of the sound shifts are illustrated in (8). 


( 8 ) 


Indo-European 

Germanic 

Old High German 

b h d h g h - 



b d g — ~ * 

b d g — > 

b d g 

p t k — > 

p t k — * 

p t k 

— » 

f \> x>h — __ 

pf ts kh 


' — * 

f d h 

s > 

z > 

s 

r 



on of c 


sp st sk 

op al aA 

* 


Other widespread patterns include final laryngeal neutralization, “final devoicing” 
(Iverson & Salmons 2011). Some patterns of variability, like a wide range of reali- 
zations of rhotics across languages and dialects, continue ancient variation (Howell 
1991). 

Overall, attested changes often parallel those posited for prehistory and early history, 
with occasional twists on claims about innovation: while English is often tagged as an 
ill-behaved member of the family, its obstruent system remains surprisingly close to West 
Germanic. 


Vowels 

Early Germanic maintains a vowel system close to that of IE after the emergence of dis- 
tinct long vowels, with a five-vowel system with a length contrast. Prehistoric low-back 
vowels merged, in different directions by length: 


(9) (latef Indo-European > 


*g h ans 

*orb h - 

*b h rater 

*plo(w)- 


Germanic 

*gans 

*arb- 

*bro0er 

*fl5- 


‘ goose’ 

‘heir, inheritance’ 
‘brother’ 

‘flow’ 


Such changes have parallels in IE and are reenacted with the contemporary North Amer- 
ican merger of the low-back vowels /a/ and hi. 

A further early change is the emergence of a new long vowel, e 2 . In West Germanic, 
old *e (“*e 1 ”, occasionally written *te) had lowered, and the new vowel, attested in Old 
High German as ia, appears in cognates of here and as the stem vowel in class VII strong 
verbs of the let type, etc. 
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(10) The Proto-Germanic vowel system 


Short vowels 

Lons vowels 

*widuw- ‘widow’ 

*wTd- ‘wide, broad’ 

*fimf ‘five’ 

*slm- ‘string’ 

*et- ‘eat’ 

*ded- ‘deed’ 

*nas- ‘nose’ 

*bro0er ‘brother’ 

*wull- ‘wool’ 

*mus ‘mouse’ 



i: 

u: 

e: 

o: 


Earlier *ey monophthongizes to *1. Aside from that, Germanic also included three 
“diphthongs”; that the *a in these participates in the low-back merger may suggest that 
these may have been vowel-glide sequences rather than single units. 


(11) Proto-Germanic “diphthongs” 

aj 

ew 

aw 

*ajh-/ajg- ‘have’ 

*skajd- ‘separate’ 

*dewp- ‘deep’ 

*kews- ‘choose’ 

*rawd- ‘red’ 

*hawzij- ‘hear’ 


From there, things become steadily more complicated, driven by patterns of vowel to 
vowel assimilation. Umlaut can be understood only in the context of height harmony and 
consonantal conditioning. In Gennanic, vowels assimilated partially to the height of fol- 
lowing vowels, e.g. /e, o/ > /i, u/ before high vowels, as well as before coda nasals (with 
parallels in some modern dialects), and Gothic shows a general raising of Id, though Id 
still appears before /h, h w , r /. A parallel lowering before Id is also illustrated below: 

( 12 ) 


*geban 

> 

OHG geban ~ gibis ‘give ~ you give’, but Goth, giban 

*helpan 

> 

OHG helfan ~ hilfis ‘help ~ you help’, but Goth, hilpan 

*ed- 

> 

ONor. eta, OHG ejj an, but Goth, itan ‘eat’ 

*bendan 

> 

OHG bindan ‘tie’ 

*wiraz 

> 

OHG wer ‘man’ 

*juka 

> 

OHG joh ‘yoke’ 


Within Continental West Gennanic “primary” umlaut of Id to Id before i/j shows rais- 
ing but with additional fronting. That process was blocked dialectally by coda /h, r, 1/. 
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It preceded the general fronting of all back vowels since these are different structural 
patterns. Moreover, failure of umlaut is attested in southern German dialects (particularly 
with geminate velars) and in western Netherlandic (blocking environments or a) (Buccini 
1995, Iverson & Salmons 1996, Howell & Salmons 1997). In other daughters, chronolog- 
ical differentiation is not demonstrable, e.g. Old English. Conditioning also differs across 
the family: North Germanic umlauts heavy stems (gestr ‘guest’ < *gastiz) but not light 
ones ( stadr ‘place’ < *sta6ir) of the relevant morphological classes (Iverson & Salmons 
2012). See §3.2 on the morphologization of umlaut. 

Above we discussed chain-like consonant shifts, e.g. with fortis or voiceless stops 
repeatedly becoming affricates and/or fricatives. Similar changes are pervasive in the 
vowel system. “Chain shifts” have been extensively described for North and West Ger- 
manic (Sievers 1876, Kuspert 1988, Wiesinger 1970), and related shifts are underway 
across North America (Labov 1994, among many others). Consistent trends emerge 
across these independent changes: raising of long or tense vowels, lowering of short or 
lax vowels and fronting of back vowels. 


Prosody 

IE “accent” was marked by pitch and was mobile in that it could fall on different syllables 
within the paradigm, like in modem Russian (§2.1). The Germanic “Accent Shift” fixes 
the accent on the initial syllable, obviously after Verner’s Law. Germanic stress correlates 
broadly with vowel duration and intensity. Kiparsky & Halle (1977) and Halle (1997) 
provide phonological accounts of the change, while Salmons (1992) argues that language 
contact played a role, alongside such language-internal motivations. 

The effects of this shift are far-reaching, including ties to innovations in verbal and pro- 
nominal agreement. For example, Gothic first and second person dual subject pronouns 
wit and jut seem to derive from the stems of plural weis and jus and an enclitic variant of 
‘two’, twai/twa/twos. Clitics were arguably present in earliest Old Norse (Faarlund 2004: 
35, though see Biberauer & Roberts 2008: 105) but disappeared with the development of 
the Scandinavian lexical accent, ca. 800 (Oftedal 1952, Kristoffersen 2011). This accent 
counteracted the clause-based prosody of other Germanic languages; Scandinavian lexi- 
cal items determined prosody, independent of clausal position. 

West Germanic reduction of final unstressed syllables introduces paradigmatic ambi- 
guity, since Germanic typically marked information on suffixes. Consider the three weak 
verb classes well attested in West Germanic and distinguished by final vowels (§3.1, §4). 


TABLE 8.3 OLD HIGH GERMAN WEAK VERBS 


Verb class 

Old High German 

Modem German 

i 

fellen ‘fell’ 

fallen ‘fell’ 

ii 

diondn ‘serve’ 

dienen ‘serve’ 

iii 

haben ‘have’ 

haben ‘have’ 


As unstressed vowels reduced to schwa, these distinctions were lost. Such increased 
ambiguity within inflectional paradigms, often presumed to be driven by reduction and 
subsequent loss of morphological marking, correlates with an increase in periphrastic 
forms, e.g. higher frequency of pronominal subjects (Axel & WeiB 2011) and develop- 
ment of obligatory definite articles in Old High German (§5.5). 

Other processes create new affixes and inflectional categories. Typically unstressed 
elements like pronouns cliticize onto a preceding word, creating preferred foot structures, 
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and pronominal enclitics can be reinterpreted as inflectional affixes or pronouns (De 
Vogelaer 2010). The OHG 1 pi. verbal -emesl-emes, as in habemes, alternates with a short 
form, haben, both meaning ‘we have’. The longer inflection is likely from an enclitic uuis 
‘we’, with assimilation of the coda nasal Ini or /m/ to the bilabial glide in the onset of 
the pronoun, yielding /m/. Axel (2007: 305) and Axel & WeiB (2011) show that the long 
ending regularly appears without a co-occurring subject pronoun, arguing that the long 
ending is in fact a reanalyzed pronominal form; subject pronouns are obligatory with 
the “short”’ ending (see also Early Modem Nuremberg -ma (1 pi.); Fertig 2000: 47). 
Rhotacized forms regularly appear in all Germanic languages other than Gothic, so the 
innovative inflection with OHG s reflects a prehistoric process. 

Similar are innovative 1 pi. pronouns, including Flemish, Zeelandic and Brabantic 1 pi. 
me, derived from enclitic wij/we and the preceding nasal Ini of the verbal inflection, which 
can appear not only phonetically conditioned but also in topic position (De Vogelaer 
2010: 15); and Bavarian 1 pi. mir/ma from wir (FuB 2004, 2005), which may similarly 
appear in topic position. Analogous pronominal innovations in North Germanic similarly 
derived Old Norwegian 1 pi. mit/mer from wit/wer, prevalent between 1200-1400 ce; and 
Sw. 2 pi. ni from i during the 17th century (Haugen 1976: 302, 375). Innovative dir also 
appears in numerous dialects of German (Weise 1907: 20 Iff), from reinterpretation of 2 
pi. inflectional -t as part of pronominal ihr. 

A final prosodically-driven change is complementizer agreement (C-agr), from the 
reanalysis of pronominal elements as inflectional affixes. C-agr is possible on a number 
of different lexical categories heading a subordinate clause (Reis 1985, Kathol 2001). 
C-agr is attested across multiple non-standard West Germanic varieties as early as the 
mid 13th century (Goeman 1997). An oft-cited example is from Bavarian (Bayer 1984: 
233), where the complementizer wenn ‘if’ inflects for person and number, here 2 sg. 

(14) Wennst kummst 
if-2sG. pro come-2sG. 

‘If you come’ 

Similar to innovative inflectional affixes and pronouns, the encliticization of unstressed 
pronouns to a stressed lexeme is argued to be a historical prerequisite for complementizer 
agreement (WeiB 2005, forthcoming). Together with innovations in the verbal and pro- 
nominal paradigm, C-agr illustrates prosodically based complexification and is relatively 
stable in non-standard varieties and language islands (Bousquette 2014). 

In another area of prosody, syllable and foot structure, the classic insight into Germanic 
syllable structure is “Prokosch’s Law” (the “Weight Law”), that stressed syllables must 
contain two moras. 2 Murray & Vennemann (1983) show a conspiratorial pattern of sound 
changes in all three branches of Germanic consistent with one syllabification of medial 
consonants: (1) West Germanic gemination, (2) North Germanic vowel lengthening and 
(3) Gothic glide strengthening. 

These effects all fall out, they argue, from the conflict between Prokosch’s Law (which 
they call the “Stressed Syllable Law”) and the Syllable Contact Law, namely (1983: 520, 
cf. Vennemann 1988, S = syllable boundary): 

(15) Syllable Contact Law. The preference for a syllable structure A$B, where A and B 
are marginal segments and a and b are the Consonantal Strength values of A and B 
respectively, increases with the value of a minus b. 

For example, Icelandic and Faroese show slightly different patterns of syllabification in 
CVCCV- words, according to consonantal “strength”. Icelandic allows more word-internal 
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onset clusters, p, t,k,s+ j, v r, so visSna ‘wither’ but vd$kva ‘to water’; Faroese only toler- 
ates p, t,k + r, /, except tl, so sjogSvur ‘sea’ but suSkri ‘sugar’ (1983: 522-523). 

Similarly, Dresher & Lahiri (1991) propose that a wide range of early Germanic sound 
patterns can be captured in terms of foot structure. They define the “Germanic foot” 
as “maximally binary, left-headed, where the head must dominate at least two moras” 
(1991: 251). The head could include one or two syllables, and the non-head element may 
contain only one mora. That is, a foot required a head of two moras, which could come 
from one heavy syllable or from two light ones. In their analysis of Old English High 
Vowel Deletion (simplifying), high vowels delete only in the weak branch of a foot, but 
those in the head are maintained: CV:C or CVCCV, in OEng. go$d < *go$du ‘good’ or 
word < *wordu ‘word’, for example) versus CVCV, in OEng. lofu ‘praises’. 

In prosody, Germanic and its daughter languages historically also show patterns where 
prosodically weak, phonetically reduced elements affix to adjacent syllables (as in the 
cliticization examples above). They thereafter could be incorporated into a “prosodic 
word” (pword), allowing processes otherwise confined to word boundaries, as noted in 
Somers Wicka’s (2009: 8) treatment of the Otfrid’s OHG Evangelienbuch. Hanna (2009) 
revisits Jespersen’s Cycle, also in the Evangelienbuch, as well as in the MHG Nibelun- 
genlied, arguing that “secondary stress [as] the outcome of an interaction between pho- 
nology and syntax ... is sensitive to rhythmic context, to syllable structure, and to vowel 
height” (2009: 201). Characteristic of Germanic even from early attestations is the pho- 
nology’s sensitivity not only to foot structure but also to syntactic environment. 

Smith (2004) and Smith & Ussishkin (2014) frame foot and other prosodic structures 
in terms of templates, allowing us to capture conspiracy-like effects here, just as Murray 
and Vennemann do for the syllable level. 

The traditional view of Germanic prosody is that the accent shift triggered weakening 
of final syllables leading to the dramatic reductions, “simplifications”, in inflectional mor- 
phology, discussed just below. We have also illustrated how reduction can have the oppo- 
site effect, increased complexity via cliticization and grammaticalization. More broadly, 
a simple “phonology first” view of such changes has been questioned within Germanic 
and beyond recently by Enger (2013) and Menz (2010). Enger (2013: 18) concludes that 
sound change alone cannot account for reductions in the history of Norwegian, arguing 
for a driving role for an autonomous morphology. 


MORPHOLOGY 
Inflectional morphology 

Germanic strong verbs exploit ablaut for tense. Traditionally divided into 7 classes, the 
first 5 are inherited from IE and involve a present tense e-grade, which alternates with an 
a-grade in the preterit singular, and zero-grade in the preterit plural and past participle. 


TABLE 8.4 GERMANIC STRONG VERBS, CLASSES I-V 


I 


Template 


CejC 


Present e-grade 

Preterit singular 
tf-grade 

Preterit plural 
0-grade 

Past participle 
0-grade 

Meaning 

dreiban 

draib 

dribum 

dribans 

‘drive’ 

reidan 

raid 

ridum 

ridans 

‘ride’ 

leugan 

laug 

lugum 

lugans 

Tie’ 

sleupan 

slaup 

slupum 

slupans 

‘slip’ 


II 


CewC 
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III 

CeLC 

helpan 

halp 

hulpum 

hulpans 

‘help’ 


CeNC 

drinkan 

drank 

drunkum 

drunkans 

‘drink’ 

IV 

CeL 

stelan 

stal 

stelum 

stulans 

‘steal’ 


CeN 

k w eman 

k vv am 

k w emum 

k w umans 

‘come’ 

V 

CeC 

geban 

gab 

gebum 

gebans 

‘give’ 



wegan 

wag 

wegum 

wegans 

‘move’ 


This so-called e-group is built around the CV templates shown in Table 8.4 (second 
column). In the absence of a preterit plural root vowel, glides in class I and II develop 
respectively into a short /i/ and short /u/; in class III the liquids and nasals epenthesize 
/u/ (§1). The long /e/ in the zero-grade of class V and in the preterit plural of class IV are 
innovations on analogy to other strong classes (see below). 

Class VI and VII are often characterized by an /a/ rather than /e/ root vowel, and utilize 
two different strategies for differentiating between the principal parts (van Coetsem 1994: 
eh. 8). Class VI verbs show /o:/ in the preterit singular and plural, and the same, short a-grade 
fonn for the participle as in the present. Class VII verbs show the same syncretism between 
the present and participle forms, but mark the preterit singular and plural by partial reduplica- 
tion. Except for s+stop clusters, Gothic reduplicates only the initial consonant(s) and a fixed 
short /e/, as in *flokan ‘bewail’ ~ <faiflok> containing reduplicated [fe-], ‘they bewailed’, 
with fixed vowel and simplified onset cluster. With .s+stop clusters, we find reduplication 
of st- and sk-, as in gastaldan ‘to receive’ ~ gastaistald (Nehemiah 5:16) and afskaidan ‘to 
part ways’ ~ afskaiskaidun (Luke 9:33). Class VII verbs are not attested as a productive 
tense-marking strategy in North or West Germanic (Bammesberger 1986: 64, Ringe 2006: 
248-250). 3 Still, while the e-grade and a-grade strong verbs mark the principal parts dif- 
ferently, a-grade class VII verbs look strikingly like class I — III verbs of the e-grade in their 
consonants, and class VI verbs are built similarly to class IV and V (van Coetsem 1994: 127). 

Returning to the origin of /el in the preterit class IV and V, the structural similar- 
ity to class IV verbs suggests analogous use of a long vowel to distinguish tense. The 
reconstructed zero-grade from geban ‘give’ would be *gbum in the preterit plural, not a 
licit consonant cluster in Germanic; a lengthened root vowel on analogy to the class VI 
verbs repairs an otherwise unpronounceable consonant sequence. Beyond repair strate- 
gies, analogy must also account for the presence of /el in IV verbs like stelan ~ stelum 
‘steal ~ stole. pl’ and k w eman ~ k"emum ‘come ~ came.PL’, the expected forms of which 
would have been stelan ~ *stulum and k"'eman ~ *k w umum, since zero-grade liquids and 
nasals develop epenthetic /u/ in Germanic, preserved in the preterit presents. This is also 
the case with class III verbs helpan ~ hulpum and drinkan ~ drunkum. Analogy does not 
occur since class III verbs are structurally similar to class VII, which does not use vowel 
length alternations to distinguish principal parts. 

In addition to the 7 strong classes, weak verbs - see §4 - formed their preterits with a 
/d/ or /t/ suffix, as in Modem Eng. -ed or Modern Germ. -te. This “dental preterit” was 
organized into four classes, which often correspond to particular functional distinctions: 
(i) causatives/denominatives, (ii) iteratives, (iii) duratives and (iv) inchoatives. 

The trend is a steady expansion of weak verbs at the expense of the strong (cf. Lieb- 
ennan et al. 2007, Carroll et al. 2012). The process has left many residues and much 
variation at any given stage. Historically strong class III OEng. meltan ‘melt’ has become 
weak (e.g. melted) but preserves a frozen adjectival form, molten, derived from the 
strong paradigm. Ongoing change is visible in Modem Germ, backen ‘bake’, which has 
an archaic strong preterit buk alongside the more common, innovative backte in the 1/3 
sg., but the strong past participle gebacken. Movement in the other direction is rarer, 
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e.g. American English dive, historically weak, has developed a strong preterit dove, likely 
on analogy to verbs such as drive ~ drove. 

A last category is the so-called preterit presents, verbs with a preterit form but present 
meaning. This category includes what have become our modem modal verbs, e.g. OHG 
mugan ‘may’, skulan ‘shall’, wellen ‘want’; others like muozan ‘must’, kunnan ‘can’, 
durfan ‘be allowed to’ (ONor .purfa ‘require, need’, Modern Dutch durven ‘dare’). Early 
on, these arguably acted more like full verbs. These maintain archaisms from the strong 
paradigms, including the syncretism of the first and third person singular. Additionally, 
since the preterit presents already have a strong preterit fonn in the present, the past is 
formed by adding the weak dental suffix. 


Nominal system 

The IE system of case, number and gender is complex relative to most attested languages, 
including Germanic. IE had 8 cases: nominative, accusative, dative, genitive, ablative, 
locative, instrumental and vocative. Germanic is reconstructed as having six, based on 
attestations in the earliest texts. The early daughters retain four robustly: Nominative, 
Accusative, Dative and Genitive, and two more in limited fashion: Vocative is attested 
only in Gothic, and Instrumental is attested in West Germanic. The receding of the Voc- 
ative and Instrumental are early examples of case loss. Vocative was absorbed into other 
classes; even in Gothic it was already identical in form to the Nominative in the plural, a 
pattern inherited from PIE. Vocative differed in form from the nominative singular only 
when the nominative singular ended in -s, which includes only Gothic masculine a-stems 
(IE short o-stems) and their subclasses. In that event, the vocative was identical in form 
to the accusative singular (yvulfs ‘wolf’ nom.sg.; ww/f ‘wolf’ acc./voc.sg. An exceptional 
case is the M-stem sunus ‘son’ nom.sg.; sunu ‘son’ acc./voc.sg. 

The instrumental was attested in Old High Gennan but restricted to a subset of nominal 
classes even early, specifically the masculine and neuter a-stems, the masculine «-stems 
and the masculine and feminine /-stems. Some of the nouns belonging to these classes 
are particularly well suited to instrumental use, as in swert ‘sword’ (neuter a-stem), scaft 
‘spear’ (feminine /-stem) and witu ‘wood’ (masculine «-stem, OEng. wudu), as in (16) from 
the Hildebrandslied, 1. 40. Such an example is provided below, in (17), from the Hildeb- 
randslied 1. 37, as Hadubrant expresses his distrust of his opponent (his father, Hildebrand). 

(16) wili mih dinu speru werpan 

want.2sG.PRET.OPT. me.ACC.SG. your.iNSTR. spear.iNSTR.SG. throw 

‘You would want to kill (lit. ‘throw’) me with your spear’ 

Hadubrant’s utterance three lines earlier shows an instrumental - doubly marked morpho- 
logically and by a preposition, mit ‘with’ - as he suggests how the untrusted Hildebrand 
should hand over an armband as a gift. 

(17) mil geru seal man geba infahan 

with spear(point).iNSTR.SG. should one gifts receive 

‘One should receive gifts with the point of the spear.’ 

Eventually the dative incorporates the function of the instrumental, most often also 
marked with a preposition. This is likely accelerated by the propensity for heavy a-stems 
in particular to appear without inflectional morphology, due to both their syllable structure 
and analogy to uninflected nominative and accusative forms in the paradigm (Braune & 
Eggers 1987: 182-183). 
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Number is similar to IE, with singular, plural and dual. The dual was restricted in the 
earliest attestations. While dual pronouns exist in Old Norse, Old English, Old Saxon and 
Gothic, only Gothic retains a verbal paradigm specific to the dual; in other languages, 
dual pronouns inflect as plural (Howe 1996). 

Gender largely echoes what is reconstructed for (late) IE. Germanic retains three 
genders - masculine, feminine and neuter - but daughters tend to lose this distinction 
in the plural. While many early languages maintained gender-specific inflections in the 
nominal paradigm (e.g. for a group of men), variation increases when the plural involves 
nouns of more than one gender. For example, Luke 8:1-4 (Codex Argenteus) describes a 
gathering of both men and women who listen to Jesus preach. Following a dative plural 
(which is ambiguous for all genders in the pronoun and strong adjectives), the relative 
pronoun for the gathering of men and women is masculine plural paiei : gaqumanaim pan 
hiumam managaim jah paim paiei us baurgim gaiddjedun du imma, qap pairh gajukon: 
(‘And when many people were gathered together, and were come to him out of every city, 
he spake by a parable:’, Luke 8:4, www.wulfila.be). Masculine is sometimes the default 
for indefinites (cf. Codex Argenteus, Luke 8:13), but also appears as a masculine plural 
relative pronoun paiei, even anaphorically to a feminine singular noun so manage i ‘the 
multitude’, alja so managei, paiei ni kunnun witop, fraqipanai sind ‘but this multitude 
who do not know the law are cursed’ (Codex Argenteus, John 7:49). Over time, the trend 
is to retain gender distinctions in the singular but to collapse gender into a unified plural 
in the nominal and pronominal paradigms (except Modem Icelandic; Einarsson 1945). 

The nominal system remains based on inherited classes, organized primarily around 
thematic vowels and secondarily around gender. Key Germanic noun classes include the 
a-stems (masculine and neuter, IE o-stems), o-stems (feminine, IE d-stems), /-stems (mas- 
culine and feminine), M-stems (masculine and feminine) and subclasses of these. Consonant 
stems include the 7-stems (a small class, familial relations), 77 -stems (the basis for new 
weak adjective endings) and 7?rf-stems (based on participles). Additionally, there are smaller 
classes like the -izl-az stems (originally comprising names for baby animals; this class over 
time comes to play a major role in German plural formation, along with /-stems). 

Masculine a-stems, illustrated with ‘day’ in Table 8.5, are one of the largest and most 
stable noun classes. Still, changes are visible even early in the textual evidence of the 
collapse of some parts of the paradigm, due to a combination of sound change and (sub- 
sequent) morphological changes. 


TABLE 8.5 .4-STEM MASCULINE NOUNS (adapted from Prokosch 1939: 241-242; Braune & 
Eggers 1987: 182) 



Germanic 

Gothic 

Old Norse 

Old High German 

Old Saxon 

Old English 

singular 

nom. 

*dagaz 

dags 

dagr 

tag 

dag 

dceg 

acc. 

*daga(n) 

dag 

dag 

tag 

dag 

dceg 

gen. 

*dagas(a) 

dagis 

dags 

tages (-as) 

dages, -as 

dceges 

dat. 

*dagai 

daga 

degi 

tage (-a) 

dage, -a 

dcege 

instr. 

*dago 

— 

— 

tagu, -0 

dagu, -0 

— 

voc. 

*dag 

dag 

— 

— 

— 

— 

plural 

nom. 

*dagozez 

dagos 

dagar 

taga (-a) 

dagos, -as, -a 

dagas 

acc. 

*daganz 

dagans 

daga 

taga (-a) 

dagos, -as, -a 

dagas 

gen. 

*dago(n) 

dage 

daga 

tago 

dago 

daga 

dat. 

*dagamaz 

dagam 

dggum 

tagum, -om; -un, -on 

dagon, -un 

dagum 

instr. 

*dagamiz? 

— 

— 

— 

— 

— 

voc. 

?*dagozez 

— 

— 

— 

— 

— 
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Gothic maintains a comparatively unambiguous paradigm (except the vocative), while 
later daughters exhibit changes that introduce ambiguity or reduce unifonnity across the 
paradigm. For example, West Germanic languages collapse the nominative and accusa- 
tive singular, either through the extension of the accusative to the nominative, or due to 
the loss of the nominal inflection -s (in Gothic) or -r (in Old Norse). A similar collapse 
is visible in the nominative and accusative plural, though Old English and Old Saxon 
extend the nominative into the accusative, while Old High German extends the accusa- 
tive into the nominative. In terms of sound change, Old Norse shows /-umlaut in dative 
singular, and M-umlaut in the dative plural, resulting in three different vowels across the 
paradigm. Old English, on the other hand, experienced Anglo-Frisian fronting of /a/ to 
/re/ (“brightening”), visible in the singular but reversed in the plural, by height harmony 
with the suffixal /a/; the dative plural change was the same process. This split introduces 
an alternation correlated with the singular/plural distinction. 

The increasing tendency to mark singular/plural within the existing nominal class was 
widespread across Germanic. Similarly, nouns realign from one noun class to another or 
adopt new inflections based on formal similarities to other classes. Consider the Old High 
German comparison below, between masculine /-stem gast ‘guest’, neuter -izt-az stem 
lamb ‘lamb’ and neuter a-stem wort ‘word’. 


TABLE 8.6 EXAMPLES OF OTHER NOUN CLASSES (adapted from Sonderegger 2003; Prokosch 
1939: 245; Braune & Eggers 1987: 182-198) 

gast (/-stem) lamb (iz/az- stem) wort (a-stem) 



oldest OHG 

late OHG 

OHG 

oldest OHG 

late OHG 

singular 

nom. 

gast 

gast 

lamb 

wort 

wort 

acc. 

gast 

gast 

lamb 

wort 

wort 

gen. 

gastes 

gastes 

lambes 

wortes 

wortes 

dat. 

gaste 

gaste 

lambe 

worte 

worte 

instr. 

gastu/gastiu 

— 

lambu, -o 

wortu 

— 

plural 

nom. 

gasti 

geste 

lembir 

wort 

wort 

acc. 

gasti 

geste 

lembir 

wort 

wort 

gen. 

gasteo, -io 

gesto 

lembiro 

worto 

worto 

dat. 

gastim 

gesten 

lembirum, -un, -on 

wortum 

worten 


Through sound changes, especially the rise of umlaut in plurals, the clearest difference 
in the paradigm is between singular and plural, marked by a vowel alternation originally 
conditioned by suffixal -/ (gast, oldest OHG); this remains as a morphologically condi- 
tioned vowel alternation after the conditioning high vowel has reduced to a schwa and 
disappeared. The same alternation arises for lamb. The neuter a-stems, like wort, have 
more paradigmatic ambiguity, with uninflected forms for nominative and accusative, in 
both the singular and plural. At the same time, the inflectional endings in the singular 
are identical to those of the masculine /-stems and of the -izl-az stems. In late Old High 
German and especially in Middle High German, some former a-stems mark plurality with 
umlaut, on analogy to unrelated noun classes, in order to repair a deficient paradigm. Cru- 
cially, this involves the spread of a portmanteau affix from one class as a plural marker in 
another. Some of these include MHG rat ~ reder ‘wheels’, blad ~ bleter ‘leaves, pages’ 
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and hits ~ hiuser ‘houses’. This is indicative of the tendency toward marking of number 
to expand at the expense of noun classes ( Pluralprofilierung ; Luiten 2011, Luiten et al. 
2013, and many others), and reflects the weakening of the inherited noun class system. 

One additional noun class deserves mention, »-stems, from which the so-called weak 
adjectival endings were derived. Not inherited from IE, weak adjectival endings were 
employed in Germanic to mark definiteness (Behaghel 1923: 197); the use of inherited, 
strong adjectival endings marked indefinites. In short, definiteness marking in Germanic 
arose with an innovative system of complementary adjectival paradigms before the 
development of definite or indefinite articles (Harbert 2007: 130ff). For pronouns, the 
fullest available historical treatment is Howe 1996. 

WORD FORMATION 

Trips (2014: 385), focusing on Germanic and noting some exceptions, writes this about 
diachronic derivational morphology: 

Much has been written about derivation, and more generally about word formation, 
from a theoretical and synchronic perspective describing and analyzing instances of 
derivation in many different contemporary languages. Much less attention has been 
paid to historical aspects of derivation (or word formation), especially to the rise of 
affixes and systematic descriptions and analyses of affixal inventories available in 
diachronic stages of languages. 

We briefly survey some key developments in nominal and verbal derivation diachronically. 

Reflecting syntactic patterns, compounding is restricted by the relative ordering of 
constituents. Modem West Germanic dialects and languages exhibit both left-headed and 
right-headed structures, meaning that various grammatical elements either precede or 
follow their complements. For example, prepositions precede the nouns they modify; 
in English, verbs precede their complement - often a noun - but the opposite is true 
for German and Dutch. Reflecting the underlying typology of the respective systems as 
either left- or right-headed, English and German may have more or less free combina- 
tions of adjectives, nouns and verbs in the formation of nominal compounds. However, a 
preposition must uniformly precede its complement, even in word formation, since post- 
positional prepositions are a mere subset of prepositions as a whole (e.g. underworld, 
German Unterseeboot ‘submarine’; lit. ‘undersea boat’ (Konig & Gast 2012: 260ffi). 
Additionally, verb-object compounds - even in Modern English - are no longer produc- 
tive. Compounds such as turncoat, cutthroat and spendthrift sound archaic. Object-verb 
constructions like barnburner and fishfry are acceptable for most speakers, and the strat- 
egy is productive for new compounds. 

Germanic compounding is more rigid, being exclusively right-headed, reflecting the 
syntax of the early Gennanic period (Harbert 2007: 30ffi). Runic and Gothic compounds 
of two nouns reflect older “stem-based” compounds, in which the first element is not a 
stand-alone noun, marked only with a root and thematic noun class marker; case is assigned 
only to the rightmost element in the compound (Lass 1994: 194). Possible remnants of 
these earlier stages include such examples as Schafskopf (a card game, literally ‘sheep’s 
head’), which shows an archaic word-internal genitive inflection on the first element of 
the compound, a paradigmatic Fugenelement ‘linking element’. A non-paradigmatic -s is 
obligatory in some noun-noun compounds where not historically present, such as when the 
first element is feminine, e.g. Schonheitsideal ‘ideal of beauty’ (cf. die Schonheit). Duden 
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(2006: 721-723) suggests all linking elements are at least homonyms of original (genitive) 
inflectional endings; linking elements generally no longer serve an inflectional purpose but 
rather are primarily phonologically conditioned and characterized by considerable regional 
variation. Evidence of earlier stages is also reflected when the first element is a weak noun 
(from the earlier M-stem class): Wangenbein ‘cheekbone’, cf. die Wange ; Zungenspitze ‘tip 
of the tongue’, cf. die Zunge; nach Herzenslied ‘to your heart’s content’, cf. das Herz. 

In the verb system, word formation through the affixation of prefixes and suffixes 
affected both the syntax and morphology of Germanic. Kuroda’s (2007) study of verbal 
prefixes in Otfrid’s Evangelienbuch shows the development of productive verbal prefixes 
by Old High German, which were built on basic verb forms: though the basic verb form 
may only have taken a prepositional phrase as complement, verbal prefixes bi-,fir-, int-, 
ir-, missi-, ver- and zir- supplement the valence of the basic form to allow for accusative 
objects; this is also possible even when the basic verb is intransitive (2007: 31). Per- 
haps more striking are instances where verbal prefixes modify the meaning of the verb, 
and “[d]er Bedeutungsunterschied, der zwischen dem Basisverb und dem Kompositum 
besteht, ist jedoch so drastisch, dass dazwischen ein Ableitungsverhaltnis kaum ange- 
nommen werden kann” (the difference in meaning which exists between the base verb 
and the compound is, however, so drastic that a derivational relationship between them 
can hardly be assumed; 2007: 41). This trend toward greater verbal prefixation is attested 
in Old High German and increases in frequency toward the modern period. 

A well-known Germanic derivational pattern is the already-noted rise of weak verbs, 
marking the past tense by a dental suffix (Hill 2004, 2010, Ringe 2006). Four classes, 
based at least originally on the semantic properties or grammatical origin of the weak 
verb, are often labeled i—iv (Prokosch 1939: 154-159). Class i is “causatives” formed by 
adding a -jan suffix to an existing lexical item. They are often derived from the preterit 
singular (a-grade) forms of strong verbs, or from existing nouns, i.e. Gothic mats ‘meat, 
food’ > matjan ‘to eat’. This yields a variety of minimal pairs that have persisted into 
the modem languages, e.g. Modem Germ, trinken ‘drink’ > trdnken ‘water [a horse]’ 
(Modem Eng. drink ~ drench), sitzen/setzen, liegen/legen, essen/atzen, etc. Class i verbs 
were also built from adjectives, as in Goth, daups ‘dead’ yielding daupjan ‘kill’; or OHG 
frewide ‘happiness’ yielding frewen (from *frawjan) ‘make happy’. 

Class ii verbs are often denominatives, often iterative. An example is OHG salbon, 
‘anoint, apply salve’. This class is also exemplified by such OHG verbs as dionon ‘serve’ 
and samanon ‘gather’. Class Hi is typically durative, e.g. OHG haben ‘have’, leben ‘live’ 
and wonen ‘reside’, all denoting events occurring over stretches of time. Still, some vari- 
ability is possible between class ii and Hi verbs, especially when the verb is ambiguously 
iterative or durative, as in OHG hazzon/hazzen ‘hate’, klagdn/klagen ‘complain’, faston! 
fasten ‘fast (i.e. to abstain from eating)’ and wTsdn/wTsen ‘search for something; visit’ 
(Braune & Eggers 1987: 299). On the one hand, that some verbs jump between class ii 
and Hi implies that these classes were semantically transparent to some degree still in Old 
High German; on the other hand, confusion regarding class assignment may also suggest 
that distinctions between classes were disappearing. This was surely further complicated 
by the reduction of the unstressed vowels that distinguished verb classes (see above). 

The fourth class has a *-nan suffix and inchoative meaning (Prokosch 1939: 156-157). 
It was a small class, not productive in West Germanic. Goth, gadaupnan ‘di e' ,fraquist- 
nan ‘perish’ and Old Norse vakna ‘awaken’ convey a change of state, while Goth, fullnan 
‘become full’ and usmernan ‘be proclaimed’ express the beginning of a process. Already 
disappearing as a morphologically marked verbal class from many of the daughter lan- 
guages in the earliest texts - including notably West Germanic - the syntactic function of 
this class is later subsumed by the periphrastic passive. 
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The weak preterit’s origin is debated, possibly deriving from an IE past participle suffix 
*-to- or a grammaticalized form of IE *d h eh 1 -/d h oh 1 - ‘put’. One view reconstructs its origin in 
reduplicated plural fonns attested in Gothic, which look like a grammaticalized form of an IE 
strong preterit; the lack of reduplicating fonns in later-attested North and West Gennanic - 
excepting OEng. dyde > Modem Eng. did - results from haplology (Ringe 2006). Arguing 
that the North and West Gennanic non-reduplicated suffixes are more indicative of the pro- 
to-fonn, Hill (2004, 2010) reconstructs the origins of the dental preterit using an IE imper- 
fect, which “secondarily adopted the optative inflections of the conesponding reduplicated 
perfect”; the differences between North and West Gennanic, on the one hand, and Gothic, on 
the other, result from divergent leveling of the Gennanic paradigm (2010: 448^45 1). 

SYNTAX 

We now turn to interactions across syntax, morphology and phonology (interfaces) with 
attention to grammaticalization and ongoing changes: word order and verb second, com- 
plementizers, definiteness and changes in tense-mood-aspect. 

Word order 

Syntactic reconstruction is one of the most contested areas of linguistic prehistory (Ferra- 
resi & Goldbach 2008 and now Walkden 2014). Still, the predominant view is that Ger- 
manic maintained inherited IE verb-final word order, and German and Dutch maintain 
some verb-final constructions throughout history (e.g. subordinate clauses, impersonal 
imperatives). However, from the earliest attestations in Runic and Gothic, fronting/raising 
of finite verbs occurs, pragmatically and syntactically conditioned. Arguably under way 
before attested texts (Wackemagel 1892), changes in the position of the finite verb are 
interwoven with related developments in phonology and morphology. Going along with 
these broad and gradual changes are numerous decidedly Gennanic innovations in finite 
verb position: the inherited IE clause-final position of the verb gradually yields to com- 
peting word orders, first in conditioned environments, and later obligatorily. Variable, 
pragmatically conditioned word order is gradually replaced by increasingly restrictive 
word orders. Stockwell (1977) proposes a unified model, including the earliest, verb-final 
word order inherited from IE, through the V2 constraint (verb second; the obligatory 
appearance of the finite verb as the second syntactic constituent) observable in German 
and Dutch, to the SVO word order of English and Scandinavian (in most contexts). 
Stockwell’s account largely represents the mainline thinking regarding the impetus and 
descriptive typology for the change in position of the finite verb in Gennanic. 

Inherited SOV is found in Runic, though verb fronting was possible. Variation occurs 
even within inscriptions, even though the corpus is tiny: only 34 early inscriptions con- 
tain a verb and another element that would allow us to determine word order patterns 
(Antonsen 1975: 24). The Ro Runestone shows both the inherited clause-final position of 
the verb, along with the innovative earlier verb placement (Antonsen 2002). 

(18) ek hrazaz/hrapaz satido [s]tain[a] . . . swabaharjaz sairawidaz 

I Hrazaz/Hrajjaz set stone. acc.def. . . Swabaharjaz with wide wounds . . . 

. . . stainawarijaz fahido 
. . . Stainawarijaz painted 

‘I Hrazaz/Hrajjaz set the stone . . . Swabaharjaz with-wide -wounds . . . Satainawar- 
ijaz painted.’ 4 
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Clauses in which the finite verb appears earlier than the expected clause-final position 
arguably show a degree of emphasis and suggest a degree of flexibility in verb position 
based on pragmatics. The verb can also appear earlier in the clause based on syntactic 
context, e.g. in the presence of negation or a wh-e lement, or in commands (Thiersch 
1978, Eythorsson 1995, 1996). Eythorsson (1995: 26, 249-257) notes that fronting of 
the finite verb of this specific type (“V-to-C” movement - movement of the verb from a 
lower syntactic position to a higher verbal/clausal position in the clause) occurs in these 
contexts in all old Germanic languages. “CP operants” - e.g. w/z-elements and negation, 
which activate the highest syntactic phrase not otherwise present in the earliest-attested 
Germanic languages - provide a syntactic environment that triggers a non-clause -final 
word order, showing that innovative positions of the finite verb in Germanic may be con- 
ditioned grammatically, as well as pragmatically. For all old Germanic languages except 
Gothic and Old English, raising of the finite verb is also obligatory when nominal com- 
plements are topicalized (Eythorsson 1995: 3, 22-28), providing supporting evidence 
that obligatory fronting later supplanted the optional fronting of the finite verb. Despite 
positive evidence of variable word order in Runic, however, the small corpus prevents 
broader generalizations about the frequency of orders in early Germanic. Still, variation 
suggests that change was under way from at least the earliest written evidence (200-500 
ce). Given that IE is reconstructed as having the finite verb in a clause-final position, it 
is likely that the older SOV order appears alongside a newer V2 (verb second) or SVO 
(subject, verb, object) order attested in subsequent varieties. 

Rise of V2 

The V2 constraint is present in main clauses in German and Dutch, and attested also for 
other historical periods of other Germanic languages, including Old English and Old 
Norse (Faarlund 2004). That V2 developed from an earlier SOV (verb-final) order in 
Germanic is generally accepted, though the process is much more nuanced. In fact, this 
change remains incomplete in some modem daughter languages, with German and Dutch 
employing SOV in subordinate clauses. 


V2 over time 

Despite evidence of verb fronting, the position of the finite verb is not fixed in early 
Germanic. West Norse (Old Icelandic, Old Norwegian) and East Norse (Old Swedish, 
Old Danish, Old Gutnish) exhibit V2 in both main and subordinate clauses, though 
“[deviations from this [in the Edda] are shown to be due to the poetic character of the 
text (although it is possible that certain patterns reflect a more archaic stage at which 
verb-second was not obligatory)” (Eythorsson (1995: 190-191). In West Germanic, V2 
is by no means obligatory in all contexts, with Old High German texts also exhibiting 
verb- initial and V3 word orders. In the Isidor, Axel identifies a number of instances where 
the verb raises to a medial position, even above a complementizer in subordinate clauses 
(2007: 79-81). Robinson (1994: 356) notes, also in the Isidor, that the position of the 
finite verb is also often not consistent with the Latin Vorlage. Multiple verb-raising strat- 
egies were indigenous to Old High German, with V2 the most common: “the translator 
of the Old High German Isidor imposed an apparently native verb-second word order 
. . . despite the example of the Latin original, and also of apparent exceptions to this 
principle where the verb appears later in the clause” (Robinson 1994: 356). In declarative 
and main clauses, the Old High German translation agrees with the Latin Vorlage on 135 
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occasions, though the Old High German position of the verb differs from the Latin text 
in 255 clauses, including 205 V2 clauses imposed by the Old High German translator 
(Lippert 1974). One account for this is variation in the landing site of the finite verb, that 
is, where the verb appears in the clause, if not in clause-final position. Axel (2007) argues 
that V3 may result from the raising of the verb to a medial position higher than the clause- 
final position of IE but lower than the ultimate V2 position attested in Modem German 
(see Weerman 1989 for a similar argument for Old English). Commonly argued for Old 
English is that a poverty of OV structures relative to VO structures in the input resulted 
in an unraveling of the inherited, underlying O V word order (Stockwell 1977 and others). 
A corresponding lack of SVOV structures in Old English concurrently destabilized the 
transition from OV to V2 predicted by Stockwell’s (1977) typology, resulting ultimately 
in an (S)VO order (FuB & Trips 2002). Differences in the position of the finite verb in Old 
Norse, Old High German and Old English provide evidence that the position of the finite 
verb was highly variable, both across the earliest attested Germanic languages and within 
the grammars of the individual varieties. The subsequent adoption of V2 structures was 
a parallel development, occurring in West Germanic centuries after the split from North 
Germanic. 5 

Another account for word order variation relates to the structure of the IE clause and 
subsequent changes in Germanic. As Kiparsky (1995: 140) notes, Germanic maintained 
the structure of the IE complementizer phrase (CP), which had both a topic and a focus 
position above C, the highest syntactic position to which a finite verb could raise. Even 
assuming a fixed position for the finite verb, at C, variation in the position of any number 
of constituents would then result in V 1 , V2 or V3 word order, depending on whether there 
were zero, one or two elements preceding the finite verb. Remnants of this earlier clausal 
structure are illustrated by Eythorsson (1995: 101-102), who provides one example from 
the Old Norse Volundarfo’ida and three from Gothic where wh- elements appear to the 
left of C, as well as additional rare instances in Old Norse and Gothic of an indefinite 
negative quantifier appearing to the left of a negated verb (1995: 145-149). In view of 
the rarity of these examples, the poetic nature of the Old Norse texts, and the possible 
influence of the Greek Vorlage on Gothic, Eythorsson notes - and we concur - that these 
exceptional clausal structures are likely archaisms reflecting earlier stages that allowed 
for greater flexibility of word order. The later collapse of these two clausal positions 
through their generalization as a single topic position reduced the number of possible 
word orders, eliminating the possibility of V3 constructions and increasing the propensity 
toward VI or V2 structures characteristic of German and Dutch, respectively in imper- 
ative and conditional clauses and in V2 main clauses. This collapse of topic and focus 
positions was a prerequisite for obligatory V2, primarily in Continental West Germanic. 

Clausal syntax 

As noted, Runic and Gothic exhibit syntactically conditioned verb raising in the presence 
of a CP-operant - a w/i-element, negative particle - or in imperative constructions. Verb 
raising is possible but not obligatory elsewhere pragmatically - an asymmetry suggest- 
ing ongoing change toward the development of subordinating constructions in Germanic 
not present in (P)IE. As noted in Matasovic (p. 166), (P)IE did not have subordinating 
conjunctions but rather relied on subordinating strategies using participles. 6 However, all 
branches of Germanic use neuter demonstratives as complementizers (Goth, pat, OHG 
dass/dazs, OEng. pcet, ONor. ad, at, att) (Harbert 2007: 417). Likely derived from IE 
uninflected deictic particles, Germanic complementizers independently develop toward 
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innovative subordinating structures. As in the example below (Otfrid’s Evangelienbuch ), 
the neuter demonstrative thasz may have been parsed as the direct object of the first 
clause but later reanalyzed as the head of the subordinate clause. 

(19) Otfrids Evangelienbuch: Ev. 2, 2, 8 

[ CP Joh gizalta in sar thasz] [ CP thiu salida untar in uuasj. 

[ CP Joh gizalta in sar] [ CP thasz thiu salida untar in uuas] . 

‘And he told them immediately that good fortune was among them’ 

The potential ambiguity is yet greater, given that both V2 and verb-final constructions 
are possible in main clauses in (early) Old High German; the maintenance of verb-final 
constructions in subordinate clauses in Modem German and Modem Dutch is an archaic 
remnant of an earlier stage in the language. 

Related to this class of neuter demonstratives as complementizers are relative pronouns, 
which similarly build subordinating structures not present in IE. The differences between 
bare relatives and relative particles in Goth, ei. Old Norse es, er and OEng. pe, pe, de attest 
to an innovative rather than inherited characteristic. Moreover, these differences across the 
branches hint at a parallel post -Ausgliederung development or, more likely, at earlier dia- 
lectal variation within common Gennanic. Subsequent developments in all the Germanic 
languages transitioned from the use of relativizing particles to use of demonstrative pro- 
nouns as relative pronouns (Lehmann 2005-2007). Demonstrative and relative pronouns 
in Gennanic derive predominantly from IE deictic elements, t, h and s, a marked difference 
from Romance and other languages that derive relative pronouns from wh- elements (IE 
*k w o-). 7 Gothic shows a possible intennediate stage, where relativizing pronouns co-occur 
as clitics on demonstrative pronouns. In the example below ei serves as a stand-alone 
relativizing particle; as a relativizing enclitic particle attached to a demonstrative pronoun, 
as in patei; and also as a relativizing enclitic particle attached to a demonstrative pronoun 
marked for case, number and gender, i.e. serving as a relative pronoun. 

(20) Codex Argenteus, John 12:9 

fanp pan manageins filu Iudaie patei Iesus jainar ist, jah qemun, ni in Iesuis ainis, 
found then multitude many Jews. gen.pl. rel Jesus there is and came.3PL. not for 
Jesus only 

ak ei jah Lazaru selveina, panei urraisida us daupaim. 

but rel also Lazarus see.PRET.0PT.3pL. rel.M.ACC.SG. raised from death 

‘Then a great multitude of Jews found that Jesus was there, and they came, not only 
for Jesus, but also that they might see Lazarus, whom he raised from the dead.’ 
(Our translation) 

Marking for case, number and gender on subordinating elements was apparently typo- 
logically rare in IE daughters and crosslinguistically (p. 166-167, this volume), but here 
it appears to have arisen independently many times across the family. 

Definites 

Reconstructions of Germanic do not posit definite articles, but definiteness - with case, 
number and gender - was encoded with suffixes based on noun class. However, by the 
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earliest attestations, demonstratives derived from deictic elements p, h and s appear. 
Though p occurs across the family, variation in the pronominal paradigm once again 
points to either dialectal differences in Germanic or parallel developments across 
branches. One oft-mentioned similarity between Old Norse and Gothic is that, despite a 
majority of pronouns and demonstratives with p, archaic Gothic sa (nom.sg.m.dem.) and 
so (nom.sg.f.dem.) correspond to Old Norse sa (nom.sg.m.dem.) and sit (nom.sg.f.dem.). 
Second, pronouns and demonstratives with h are attested mostly in North Sea Germanic. 
The use of demonstratives, however, is similar across the family, and early texts employ 
demonstratives not only in emphatic or contrastive contexts but also anaphorically, as in 
the OHG gospel harmony Tatiair. 

(21) Tatian, chapter 1, sentence 1 

In anagitme uuas uuort inti thaz uuort 

in beginning . def was word.DEF.SG.M.A-STEM and that (same) word.DEF.SG.M.A-STEM 
uuas mit gote inti got selbo uuas thaz uuort. 

was with God and God self was that (same) word.DEF.SG.M.A-STEM. 

‘In the beginning was the word, and that same word was with God, and God (him) 
self was that word. ’ 

The first use of uuort ‘word’ employs the older definiteness marking, with an unpreceded 
noun characteristic of the Latin Vorlage. However, in the second and third uses, it is 
preceded by the demonstrative thaz, where the Latin Vorlage lacks a demonstrative: In 
principio erat verbum et verbum erat apud deum et deus erat verbum. Old High German 
seems to allow unpreceded bare nouns, on the one hand, but also employs preceded nouns 
in anaphoric contexts; a conservative translation of thaz in this case would be ‘that same’. 
This provides evidence of a gradual transition from the inherited IE system, moving 
toward the innovative precursors of definite articles. 

In terms of diachrony, definite articles are not reconstructed for Gennanic, or even 
attested in the early daughter languages. Rather, demonstrative pronouns are reanalyzed as 
definite articles, through language -internal processes of grammaticalization. Formalizing 
the process, van Gelderen (2007, 2009, 2011) categorizes grammaticalization of demon- 
strative pronouns as the loss of the feature [+emphasis], with the corresponding retention of 
interpretable phi-features (or [i-phi]) marking for person and number agreement. 8 Arguing 
that “the real sources of change are internal principles that bias the learner toward cer- 
tain structures’’ (van Gelderen 2011: 4), van Gelderen motivates this reanalysis through 
the Head Preference Principle (HPP) (2007: 283; 2011: 13-14). The HPP holds that spec- 
ifiers (and phrases) such as demonstratives are less preferable than syntactic heads such as 
definite articles, and that the natural process of language change involves reanalyzing the 
former as the latter. Such a change in the grammar is consistent with Kurylowicz’s (1965: 
52) definition of grammaticalization as “the increase of the range of a morpheme advancing 
from a lexical to a grammatical or from a less grammatical to a more grammatical status”. 

Indefinite articles are similarly unattested in the earliest Germanic languages. Indefi- 
nite nouns are marked by strong adjectival inflection preceding the noun. Indefinite arti- 
cles developed from the numeral ‘one’, with a corresponding reanalysis from number to 
indefinite marker. Much like the development of definite articles, this was a process of 
grammaticalization, advancing to a more grammatical function. Because the numbers 
one through four inflected in Gothic, 9 the numeral ‘one’ already inflected for case, num- 
ber and gender. Grammaticalization, then, involved reanalysis of a quantifier as an article 
(van Gelderen 2011: 202). 10 
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Tense, aspect, mood 

The Germanic systems of tense, aspect and mood (TAM) reflect a rather drastic depar- 
ture from IE, with subsequent development of indigenous innovations. Relative to IE, 
Gennanic underwent leveling followed by the rise of new tenses and aspects. The devel- 
opment of aspectual auxiliaries as part of a cyclical pattern - even as a small portion 
of ongoing change - is observable over a period of centuries in a variety of languages, 
including the earliest attestations of Old English and Polish through to the modem lan- 
guages (van Gelderen 2011: 273-276). 

Reconstructions of IE often posit a four-way system of aspect and tense, differenti- 
ating perfect and imperfect, and past and present, respectively; the present tense likely 
may have expressed future tense 11 (p. 162, this volume). Germanic collapses these two 
systems, using the perfect of strong verbs to mark past, while the present tense forms con- 
tinued to mark both present and future tense. Perfective aspect may be expressed through 
use of verbal prefixes, most commonly ga- (derived from IE *kom, ‘together, with, col- 
lective, intensive’). Inherently perfective verbs often carry this prefix, as in gadaupnan 
‘die’ or gatairan ‘destroy’ (Modem Germ, zerreissen ‘to rip, to tear (apart)’), both only 
attested with the perfective prefix ga-. Still, that the perfective prefix was productive is 
apparent in common verbs such as Goth, waurkjan ‘to prepare, bring about’, saihwan 
‘to see’, hausjan ‘to hear’ and gaggan ‘to go’, which have perfective variants gawaurk- 
jan, gasaihwan, gahausjan and gagaggan. This perfective prefix may also serve in some 
cases as a kind of aspectual perfect marker, as in the opening line of the 9th century 
Hildebrandslied (22). 

(22) Ik gihorta dat seggen 

I heard. Isg.perf. that said 
‘I have heard it told . . . ’ 

The narrator addresses the listener/reader directly, using a first person pronoun. The 
perfective gihorta establishes the narrator’s authority, having heard the tale prior to the 
present narrative; indeed, the story may have been well known, with analogues across 
Gennanic languages, as well as in Persian, Russian and Old Irish (Baesecke 1940). With- 
out a dedicated pluperfect, however, ordering of events in the past tense is often accom- 
plished using participles. 

(23) Codex Argenteus, Mark 5:39 

jah innatgaggands qap du im: Iva auhjopjah gretip pata barn 

and enter.PRES.PTCP. said.3sG. pro to them why fuss.2PL. and cry.2pL. pro that child 

ni gadaupnoda ak slepip 

not died.3.SG.PRET.PERF but sleeps 

‘And when he had come in, he said to them: why do you fuss and weep? The girl 
is not dead, but sleeps.’ (Our translation) 

While a literal translation of jah innatgaggands qap might be ‘and entering (he) said’ 
or ‘and as he entered, he said’, we draw from context - and from frequent attestation of 
present participles elsewhere in the Gothic Bible - that these events are sequential rather 
than simultaneous; coordinated structures employ jah ‘and’. 

A similar example shows the use of a present participle to order events at the begin- 
ning of the verse, with the act of exiting ( usgaggands ) preceding subsequent events. 
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Additionally, the perfective prefix ga- is used to convey something that had already hap- 
pened prior to the moment of speaking, in gasalv. 

(24) Codex Argenteus, Luke 1 :22 

usgaggands pan ni mahta du im rodjan jah fropun pammei siun 

exit.PRES.PTCP. then not might pro to them speak and perceived.2PL. that.REL vision 

gasah in alh; 

3sg.seen.perf. in temple 

‘When he came out, he could not speak to them, and they perceived that he had 
seen a vision in the temple. ’ (Our translation) 

The second half of the verse exemplifies an emerging periphrastic construction, com- 
bining a past form of wisan ‘be’ and participial bandwjands ‘signaling’. 

(25) Codex Argenteus, Luke 1 :22 

jah silba was bandwjands im jah <ga>was dumbs. 

and same was signaling them. dat.pl. and was. 1 so. speechless. 

‘ . . . and the same one [Zacharias] was signaling to them, and [he] was speechless.’ 

The English translation may be most colloquial with a present progressive, but the Gothic 
participle is not unambiguously verbal: the inflectional -s could also be a nominative suffix 
(masculine nrf-stem), that is, a copular construction between the grammatical subject and an 
inflected nominal or adjectival fonn meaning, here, ‘one who signals; a signaling one’. This 
interpretation of otherwise ambiguous Gothic data is consistent with the later development 
of the periphrastic perfect, in which verbs meaning ‘have’ and ‘be’ become auxiliaries and 
co-occur with participles. Crucially, these participles frequently appear with inflectional 
suffixes, supporting the argument that they were originally nominal (or adjectival) rather 
than verbal. Perfective prefixes ga-, gi- and ge- (IE *kom-) regularly co-occur with nomi- 
nal suffixes but are notably absent from reflexes of double modal constructions and more 
grammatical uses of OHG lazzan ‘let, allow’, heizzan ‘command’ and werdan ‘become’. 
The reanalysis of these as past participles corresponds with the loss of inflectional affixes. 

Germanic also reduced other parts of the TAM system. IE is reconstructed with - in 
addition to the indicative - an optative and subjunctive that collapse into a single “opta- 
tive” in Germanic (Matasovic 2012). Marked in Old High German by verbal affixes - e 
in the present and F in the preterit - the optative begins to collapse with unstressed syl- 
lable reduction. In the present indicative, the distinction between present indicative and 
present optative is lost, first in the first person plural, e.g. faremes ‘we go’. Later, it is 
lost throughout the paradigm as full vowels reduced and centralized, and unstressed long 
syllables shortened, i.e. OHG faru ‘I go.lsG.PRES.iND.’ and OHG fare ‘I go.lsG.PRES.OPT.’ 
both collapse to MHG fare. The distinction is maintained only in verb classes where 
other sound changes caused vowel mutations, e.g. MHG nime ‘take. Isg. pres. ind.’ but 
neme ‘take. Isg.pres. opt.’ The preterit optative is identical to the preterit indicative for 
weak verbs in Old High German, and the OHG -i suffix for strong verbs causes umlaut 
before reducing; umlaut demarcating the optative (German Subjunctive I) is thereafter 
morphologically conditioned. Ultimately, as in other parts of the grammar, morphological 
marking of aspect yielded to periphrastic constructions. 

The other component of the innovative Germanic periphrastic perfect is the develop- 
ment of auxiliaries from the verbs ‘have’ and ‘be’. OHG haben (or eigan ) was originally 
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a lexical verb, e.g. ‘to own’. To ‘have’ something that is in a current (perfective) state 
implies that it was previously not in that state; thus, the current state is interpreted as 
a present tense, albeit one derived from an earlier completed action. For example, the 
oft-cited OHG sentence phigboum habeta sum giflanzotan (Tatian) is arguably to be 
interpreted as ‘someone had planted a fig tree’ (Lockwood 1968: 115). Some skepticism 
would be warranted, though, since the inflected participle looks more like an adjectival 
form than a perfective verbal form. Still, while somewhat ambiguous in its contemporary 
reading, such an argument is a plausible example of change under way. 

CONCLUSION 

Across the phenomena that separate Germanic from IE and separate the daughters from 
one another, we see surprising cohesion and perseverance in the kinds of change and 
patterns found today: 

• vocalic and obstruent chain shifts, 

• case loss, 

• strong verbs becoming weak, 

• independent development of typologically rare subordinating strategies. 

From IE to the first textual evidence, Germanic languages include phonetic/phonological 
reductions and various drastic reorganizations: 

• shift away from nominal classes toward a coherent singular/plural distinction, 

• expansion of periphrastic verbal forms to create marking for tense, aspect and mood, 

• development of non-inherited inflectional affixes and inflectional categories that 
increase the complexity of the inflectional system. 

Questions remain about the sociolinguistic contexts in which these changes took place. 
Extended contact would be consistent with the sorts of morphological reduction found: 
increasingly fixed word order and collapse of inherited TAM marking. Some innovations 
may reflect prehistoric variation: 

• the development of three innovative strategies for marking preterit (classes VI and 
VII, weak classes), 

• differences in the weak preterit plural ending, separating Gothic from other languages, 

• wide variation in the pronominal inventories, 

• a tripartite division in subordinating conjunctions and relative particles (Gothic ei, 
ONor. es, er and OEng. pe, pe, de), 

• divergent chronologies of nasal loss and compensatory lengthening, cf. mouth ~ Mund. 

Still, the Germanic family portrait shows profound familial similarities, not only in inher- 
ited forms and structures, but also in parallel developments among sometimes distant 
daughters; and shared developments and borrowings during extended, repeated periods 
of contact. 

Progress today is rapid thanks to developments in general, Germanic and IE linguis- 
tics, including better understanding of crosslinguistic patterns of language change (e.g. 
how common glide hardening turns out to be), new models of how change works (e.g. in 
prosody) and newer issues (the rise of complementizers). 
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Bammesberger, Alfred 1986, Der Aufbau des germanischen Verbalsy stems, Win- 
ter. Heidelberg; 1990, Die Morphologie des urgermanischen Nomens, Winter, 
Heidelberg 

Detailed treatments of derivation and inflection of early Germanic nouns and verbs. 

Jacobs, Neil 2005, Yiddish: A Linguistic Introduction, Cambridge University Press, 
Cambridge 

Yiddish is underdiscussed in the context of early Germanic but provides unique per- 
spectives in terms of language contact and dialectal relationships. 

Lehmann, Winfred P. 1967, A Reader in Nineteenth-Century Historical Indo- 
European Linguistics, Indiana University Press, Bloomington 

Edited collection of seminal texts in IE linguistics, with commentary on historical 
context within the field. 

Nielsen, Hans Frede 2000, The Early Runic Language of Scandinavia, Winter, 
Heidelberg 

Delivers the best survey of the Ausgliederung and internal structure of the Germanic 
family, contextualized in external evidence. 

Prokosch, Eduard 1939, A Comparative Germanic Grammar (William Dwight Whit- 
ney Linguistic Series), Linguistic Society of America-University of Pennsylvania, 
Philadelphia 

Classic older work on Germanic in the IE context. 

Ringe, Don 2006, From Proto-Indo-European to Proto-Germanic, Oxford University 
Press, Oxford 

The clearest statement of Germanic sound and word form patterns and developments 
from an IEist perspective, with a few pages on derivational morphology and men- 
tion of syntax and lexicon. 

Robinson, Orrin W. 1 992, Old English and Its Closest Relatives: A Swvey of the Ear- 
liest Germanic Languages, Stanford University Press, Stanford 

The most reliable and accessible introduction to the family. 


NOTES 

1 Many glottalic proposals posit implosives rather than voiced stops, and implosion is 
a form of “hypervo icing”, phonetically enhancing a [voice] contrast. 

2 Dresher and Lahiri (1991: 263) suggest that this restriction may in fact have been 
limited to word-initial feet. If so, similar patterns might be shared across dialects. 

3 Possible remnants of reduplication in West Germanic are controversial, such as 
OEng. heht from Gmc *haihait, or Modem Eng. do ~ did, possibly related to *ton 
(Braune & Eggers 1987: §381-382). 

4 While fahido is related to the term for ‘to paint’, it is also used broadly as ‘to fashion 
runes’, which may also include carving. 
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5 The development of V2 is parallel but not independent: verb fronting strategies show 
that change was under way when the branches diverged. 

6 Gothic makes extensive use of participles, though this relates more directly to the 
tense/aspect system than to subordinating structures (§5.6). 

7 Some modem languages use w/?-elements as relative pronouns in restricted contexts, 
most notably Eng. who/whom and which, in competition with null complementizers 
and that. Germ, was and Dutch wat serve as relative pronouns for indefinite or inan- 
imate nouns; Germ, wo occurs dialectally as a relative pronoun for animate nouns, 
and Dutch wie or welke may also be used, in formal or archaic contexts. 

8 Since many demonstrative pronouns already mark number and gender, this is a clear 
case of grammaticalization, cf. Gothic demonstratives sah (m. sg.), patuh (n. sg.), soh 
(f. sg.) and paih (m. pl.; other genders not attested); OEng. se (m. sg pcet (n. sg.), 
seo (f. sg.), pa (pl. all genders), etc. 

9 Some other IE daughters - including Slavic languages - also decline numbers. 

10 The nature of this reanalysis “depends on one’s assumptions about what numerals 
are and where indefinite articles are in the DP” (van Gelderen 2011: 202); we are 
concerned here only with the broader picture. 

1 1 Alternatively, the IE subjunctive may fulfill this function. 
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PART 9 


ARMENIAN 

Birgit Anette Olsen 


INTRODUCTION 

In modem times Armenian is found in two varieties, Eastern Armenian, the official lan- 
guage of the Armenian Republic, also spoken in Nagorno Karabagh and in north-western 
Iran, and Western Armenian, originally mostly found in what is now the north-eastern 
part of Turkey, but now mainly scattered around the Armenian diaspora with considerable 
bilingual populations in Lebanon, Syria, Israel, France, Canada and the USA. 

The first records of Armenian go back to the Old Persian Behistun (Bisutun) inscrip- 
tion by Darius the Great (about 520 BC), where we hear of the province Annina- and the 
people Arminiya-. Somewhat later, Appivlr] and Appsvtot are mentioned by Herodotus, 
while the indigenous literary tradition only goes back to the 5th century AD, the “golden 
age”, with the standardized Classical Armenian, also known as grabar, the written lan- 
guage. This will be the subject of the following pages. The creation of a written language 
was a side effect of the introduction of Christianity, which became a state religion as 
early as 301. According to tradition, the monk and missionary Mesrop Mastoe c invented 
the Armenian alphabet, consisting of 36 letters (cf. Table 9.1), with the purpose of trans- 
lating the Holy Scripture into the vernacular, and around 410 the translation of the Bible 
was completed, the first attempts based on a Syriac original, while the final version was 
founded on the Greek. During the following centuries Greek influence continues with 
translations of theological and philosophical literature. Important original works from 
the golden age include Eznik’s Elc alandocf, “Refutation of Sects”, Koriwn’s “Life of 
Mastoe c ” and the historical accounts by Agathangelos and P c awstos Buzandae c i. The later 
historian Movses XorenacT, probably from around the 9th century, is important for his 
descriptions of pre-Christian traditions as famously illustrated by the poem of the birth of 
Vahagn (Iranian VaraBayna-, Vedic Vrtrahan-). Apart from early stone inscriptions, the 
actual attestation is of a much later date than the 5th century, the earliest manuscript being 
the Moscow Gospel from 887. As a literary and learned language, the grabar survived 
well into the 19th century, and for religious purposes it is still in use. 

Middle Armenian, which already shows dialectal differentiation, is mainly transmitted 
in the western dialect of the Cilician kingdom (1080-1375; cf. the meticulous treatment 
by Karst (1901)). From around the 17th century, it gradually developed into the numer- 
ous modem dialects and the two standardized literary languages of Eastern and Western 
Armenian. 

During the first millennium BC, the Armenians settled in the region around Lake Van 
in eastern Turkey, which was then dominated by the non-Indo-European Urartians. This 
is approximately the region known by the Hittites as havasa-, which has often been com- 
pared with Ann. hay ‘an Armenian’, Havastan ‘Armenia’. As a separate Indo-European 
language, Armenian is not as obviously connected to any of the other branches as Indian is 
to Iranian, or Baltic to Slavic. According to Herodotus, the Armenians are identified with 
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TABLE 9.1 THE ARMENIAN ALPHABET 




Transliteration 

Value 

Name 



Transliteration 

Value 

Name 

u 

U1 

a 

1 

ayb 

& 

d 

c 

100 

ce 

p 

F 

b 

2 

ben 

U 

it 

m 

200 

men 

q 

q 

g 

3 

gim 

3 

1 

y 

300 

yi 

q 

q 

d 

4 

da 

L 

u 

n 

400 

nu 

b 

t 

e 

5 

ee c 

C 

2 

s 

500 

sa 

a 

q 

z 

6 

za 

n 

n 

0 

600 

0 

t 

t 

e 

7 

e 

Q 

2 

C c 

700 

e c a 

c 

c 

3 

8 

et c 

41 

n 

p 

800 

pe 

19- 

p 


9 

t c o 

2 

2 

J 

900 

Je 

cf 

d 

z 

10 

ze 

ft 

n 

r 

1000 

fa 

h 


i 

20 

ini 

u 

u 

s 

2000 

se 

E 

L 

1 

30 

liwn 

q 

4 

V 

3000 

vew 

lu 

lu 

X 

40 

xe 

s 

in 

t 

4000 

tiwn 

D 

b 

c 

50 

ca 

r 

P 

r 

5000 

re 

q 

\ 

k 

60 

ken 

3 

9 

C c 

6000 

C c O 

2 

h 

h 

70 

ho 

h 

i 

w 

7000 

hiwn 

2 

4 

j 

80 

ja 

d> 

4 1 

p c 

8000 

p c iwr 

q 

q 

1 

90 

lat 

p 

p 

k c 

9000 

k c e 


the Phrygians, and while this is hardly an exact description, the language does share a num- 
ber of linguistic features with Phrygian, Albanian and especially Greek, which has induced 
some scholars, such as Klingenschmitt (1994: 244f.), Matzinger (1998: 118 and later) and 
Holst (2009), to connect these languages under the common heading of “Balkanindoger- 
manisch”. Clackson, on the other hand, in a monograph dedicated to this particular question 
(1994), fails to recognize a significant relation even between Greek and Armenian. 

The recognition of Annenian as an independent branch of the Indo-European family 
came only in the late 19th century. Until then it had been considered an Iranian language, 
and, indeed, the Iranian elements of vocabulary, word fonnation and composition, not to 
mention loan translations and syntactic features, are so abundant that this early misconcep- 
tion is quite understandable. The largest part consists of Parthian loans, introduced during 
the Arsacid dynasty (247 BC-224 AD) and exhibiting particular north-western dialectal 
features such as *8 > Ann. r, e.g. aroyr ‘brass’ < *rau5a vs. South-West Iran, y, cf. MParth. 
Iwdv s. ManMPers. rwy ‘copper, brass’; preservation of -rd-, e.g. vard ‘rose’ as opposed to 
MPers. gul\ and *Sr > ( r)h , e.g. pah ‘watch, ward’ < *paSra- vs. s in ManMPers. p 's. Occa- 
sionally a single word or stem is found in several dialectal variants. A striking example is 
IE *wergom > Iran. *warj, borrowed from Middle Parthian as warz ‘study’; from Middle 
Persian as vard- in vardapet ‘master’; from a third Iranian dialect, possibly by Parthian 
mediation, as varj ‘reward, hire’; and finally (with analogical o-grade in the root) preserved 
in the indigenous noun gore ‘work’. A systematic survey of phonetic distinctions between 
the Middle Parthian and the later Middle Persian loans is found in Bolognesi’s classical 
study (1960), followed by the useful updating by Schmitt (1983). 
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MAP 9.1 ARMENIAN IN ANCIENT AND MEDIEVAL TIMES 

Source: Adapted from: Walker, C. J. 1984, Armenia. The Survival of a Nation , Croom Helm, London- 
Sydney: 22 

Other important sources of borrowings into Armenian are Greek and Syriac, both 
intimately connected with the introduction of Christianity, but apart from such easily 
recognizable elements, the Armenian vocabulary abounds in more or less obscure lexical 
items. As observed by Solta (1990: 13), Acaryan’s monumental etymological dictionary 
(1926-35) contains 10,722 head words, of which 4016 are recognized loanwords, while 
5572 are registered as being of unknown origin, and only 713 are considered inherited. 
These bare numbers would give some idea of the challenges connected with Armenian 
etymology. 

Note on the transcription 

Aspiration of a consonant is indicated by a raised c , e.g. f = [t h ]; c,j and c c stand for the 
affricates [t s ], [d z ] and [t sh ], c,f, and c c for the “shibilants” [T], [d 2 ] and [t 5h ] ; y equals [i], r 
is a geminate r, and / perhaps a retroflex lateral (cf. Schmitt 1981: 29). E derives from a 
former /-diphthong, *ey or *oy, and must originally have been a long vowel; however, in 
the classical language, where quantitative distinctions had been eliminated, the pronunci- 
ation was probably a closed e as opposed to the more open e from the old short vowel *e. 
Finally, the digraph ow is pronounced and transcribed as u. 

PHONOLOGY 

The phonology of Armenian is characterized by a series of fundamental changes, and 
since the inherited vocabulary is relatively limited the etymological interpretation of a 
given word or morpheme often depends on the precise formulation of a specific condi- 
tioned sound law. Thus, many details are still awaiting their final clarification. 

Vowels 

A characteristic feature of Armenian vocalism is the loss of the quantitative oppositions 
before the earliest attestation of the language. While long a, T and u simply merged with 
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their short counterparts, and short e and 5 maintained an open quality, the long vowels e 
and o underwent narrowing, finally merging with ill and ulu respectively. 

We thus arrive at the following unconditioned developments in post-laryngealistic 
terms: 

IE aaeoile uuo 

\J/ V 

a e o i u 

Examples: 

*a acem ‘I carry, bring, conduct’, Lat. ago , Gr. ayco 

*a mayr ‘mother’, Lat. mater. Dor. pdxr|p 

*e berem ‘I carry’, Lat .fern, Gr. cpepco 

*e mit ‘mind’, Gr. prjdoq 

*6 ost ‘branch’, Gr. oCpq, Goth, asts 

*5 tur ‘gift’, Gr. 8copov 

*i egit ‘(s)he found’, Ved. avidat 

*1 c c in ‘vulture’, Gr. iieuvoq 

*u nu ‘daughter-in-law’, Lat. minis, Gr. woq 

*ti mukn ‘mouse’, Lat. mus, Gr. puq 

Further raising of e to i and o to u occurred before the nasals n and m, cf. e.g. hin ‘old’ < 
*senos (Lith. senas, Lat. senex, senior) and cunr ‘knee’ < *gonu (Gr. yovu). 

Other vocalic changes include umlaut and epenthesis. As already observed by Meillet 
(1903: 501), we sometimes find -e- for expected -i- when the following syllable includes 
u, e.g. lezu ‘tongue’ beside Ved .jihvd-, but apparently the rule, or at least the tendency, 
may be extended to cover all sequences of high vowels in consecutive syllables exhibit- 
ing a sort of “dissimilatory umlaut”, beside i-u > e-u also i-i > e-i, u-i > o-i and u-u > o-u, 
e.g. korust ‘destruction’ beside Lith. giirti ‘fall to pieces’ (Olsen 1999: 801-804). The 
rule also applies to secondary i and u from older *e and *o, e.g. ter ‘master’ vs. teruliwn 
‘domination’ rather than tirufiwn (*e > i in unstressed position). 

In certain cases the vowels i and u as well as the semivowel y are observed to cause 
epenthesis, e.g. ayl ‘other, different’, Lat. alius, or artawsr ‘tear’ < *draku, Gr. Soucpu. 
The conditioning of //v-epenthesis clearly differs from that of u-epenthesis, and while the 
exact formulation of the rules may still be debated, the preceding vowel always seems 
to be -a- or -o-. Apparently, other decisive factors have been syllable length and stress, 
which are further connected with the loss of final syllables, e.g. artawsr vs. pi. artasut'. 

The status of other suggested conditioned changes affecting the vowels, such as e > a in 
tasn ‘ten’ vs. Lat. decern, or o > a in ateam ‘hate’ vs. Lat. odium, remains more uncertain. 

Diphthongs 

After the separation of Anatolian, the protolanguage or “core Indo-European” possessed 
six short diphthongs, *ay, *ey and *oy, *aw, *ew and *ow. As is also the case in, e.g., 
Italic, the diphthongs whose first part was an a-vowel remained intact in Annenian as ay 
and aw, while the remaining /-diphthongs merged in e, and the M-diphthongs in oy, pre- 
sumably via an intermediate stage *ow. 
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IE 

ay 

ey oy 

aw 

ew ow 



V 


V 

Arm. 

ay 

e 

aw 

oy 


Examples: 

*ay aye ‘goat’, Gr. 

*ey aor. e-dez ‘(s)he piled up’ < *e-d h eyg h -et, Ved. pres, dehmi ‘smear’ 

*oy dez ‘heap’, Gr. toI^o q ‘wall’ 

*aw awt c ‘passing the night’, Gr. avki q ‘lodging for the night’ 

*ew joyl ‘molten mass’ < *g h ewtlo-, Ved. hotra- ‘libation, sacrifice’ 

*ow p c ovt c ‘care; haste’, Gr. anou5f| ‘haste, zeal, effort’ 

A similar development is found in older Iranian loanwords where /-diphthongs are 
reflected as e, M-diphthongs as oy, e.g. k c en ‘hatred, animosity’, Av. kaena- ‘punishment, 
revenge’; goyn ‘colour’, Av. gaona-. 

Apocope, syncope and vowel weakening 

The early fixation of the stress on the penultimate in Proto-Armenian had severe con- 
sequences for the further phonetic development of vowels and diphthongs: the vowels 
of final syllables underwent apocope so that the synchronic accent regularly came to 
fall on the final syllable, e.g. *eb h eret ‘(s)he carried’ > eber. Moreover, the unstressed 
high vowels / and u, whatever their origin, were weakened to a mostly unwritten 
prop vowel, a, a phenomenon generally described as “syncope”, e.g. nom. *senos > 
hin ‘old’ vs. gen. *senosyo > hnov, nom. /ace. *dorom > tur ‘gift’ vs. gen. *dorosyo 
> troy, or *penk' v e > hing ‘five’ with apocope beside hnge-tasan ‘fifteen’ with 
syncope. 

Similarly, the old diphthongs (*ey/oy >) e and (*ew/ow >) oy in unstressed position 
were continued as / and u, e.g. ser ‘love’, gen. siroy,p c oyt c ‘care; haste’, gen ,p c ut c ov, and 
the sequence ea (< e/i + a after the loss of an intervocalic consonant) as e, e.g. ptep. bereal 
‘having carried’, gen. berelov. 

Vowel contractions 

After the elimination of intervocalic *y, *s and partly *w and the lenition of intervo- 
calic stops, the subsequent hiatic sequences were generally followed by vocalic con- 
tractions (cf. Schmitt 1981: 73fi). By contraction of homorganic vowels the result is the 
corresponding short vowel, while e is deleted in contact with a following or preceding 
a or o\ 


e - e 

> e 

*treyes > ereld ‘three’ 

o-o 

> 0 

*b h oso-g w o- > bok ‘barefoot’ 

a - a 

> a 

*p3,trb h i > harb ‘father’ (instr. sg.) 

e - a 

> a 

*wesr- (*gear- >) gar-un ‘spring’ 

a - e 

> a 

*pa 2 teres > hark c ‘fathers’ (nom. pi.) 

e - o 

> 0 

*swesores > IVorlV ‘sisters’ (nom. pi.) 
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Syllabic sonants 

In general, the syllabic sonants developed a preceding supporting vowel a, i.e. *r, *1, *n, 
*m > ar (jar), al (Jal), an, am: 

*r bard ‘pile’, Ved. bhrti- ‘carrying, support’ 

*1 malj ‘gall’, Av. msrszana- ‘belly’ 

*n an- ‘un-’, Ved. a-, Gr. a-, Lat. in-, Goth, un- 

*m t'amb ‘ham’, Lith. timpa ‘sinew’, ONor. pomb ‘paunch; bowstring’ 

The same development is found in prevocalic position, i.e. mainly before an original 
prevocalic laryngeal, as, e.g., am ‘year’ < *smhah,-, cf. Ved. sama- ‘season’. 


Consonants 

While the most striking feature of the consonant system seen in a historical perspective is 
certainly the Armenian sound shift, other recurrent features such as palatalizations, len- 
itions, metatheses and the development of consonant clusters have resulted in profound 
transformations of the inherited pattern. 


Stops 

By the Armenian sound shift, tenues are, in traditional terms, continued as voiceless aspi- 
rates, mediae as tenues, and voiced aspirates as mediae. However, the development of 
*p usually goes a step further to h or zero (cf. also Klingenschmitt 1982: 165-168 for 
a discussion of initial p c - in cases like p c lanim ‘fall’, Lith. piilti). This view has been 
challenged by adherents of the so-called glottalic theory, according to whom the Indo- 
European “mediae” were in reality glottalized stops, faithfully continued as such into 
modem Armenian dialects, e.g. Gamkrelidze 1985. As Armenian belongs to the “satam” 
group of the Indo-European languages, the velars and labiovelars merge, at least before 
non-palatal vowels, while the palatals are subject to further palatalization, *k > s, *g > c 
and *g h > j. Thus we arrive at the following table of unconditioned developments: 


IE 


Armenian 

Examples 

*p 

> 

h/0 

*pod-m > otn ‘foot’, Gr. ace. no5a; *pedom > het ‘footstep’ 

*t 

> 

t c 

fur ‘sword’, Gr. xopoq ‘piercing’ 

fcjj-lw) 

> 

k c 

terem ‘scratch’, Gr. Ksipco ‘cut’; tan ‘than’, Lat. quam 

*k 

> 

s 

sirt ‘heart’, Lith. sirdis 

*b 

> 

P 

stipem ‘press’, Gr. aietpco ‘step on’ 

*d 

> 

t 

tur ‘gift’, Gr. Scopov 

*g(w) 

> 

k 

think ‘crane’, Lat. grus, Gr. yepavoq; *g w ow- > kov ‘cow’, 
Gr. Pouq 

*g 

> 

c 

*gonu > cunr ‘knee’, Skr . janu, Gr. yovt) 

* b h 

> 

b 

berem ‘carry’, Skr. bharati, Gr. (pepco 

*d h 

> 

d 

durlf ‘door’, Gr. &upa, Lat .fores 

*g(w)h 

> 

g 

geljlf ‘glands’, Lith. gelezuones; *g wh - in goneav ‘above all’, 
Gr. £u&£vf)q ‘abundantly’, Lith. gana ‘enough’ 

*g h 

> 

j 

jmein ‘winter’, OCS zima, Skr. heman ‘in winter’ 
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After u, including old w-diphthongs, the contrast between (labio)velars and palatals is 
neutralized in favor of the palatal, thus *lewkos > lovs ‘light’, Ved. -rocas-, Av. raocah-; 
*d h ugh,ter > dustr ‘daughter’, Av. dugadar-, Lith. dukte. 

A fourth, sporadically attested series of voiceless aspirates *p h , *t h and *k h , whatever 
its origin, seems to have yielded Arm. p c , t c , x, e.g. tup' ‘bush’, Gr. xxxpr| ‘a plant used for 
stuffing’, ort c ‘calf’, Skr. prthuka-, (fax ‘twig’, Skr. sakha ‘branch’, xaxanlf ‘laughter’, 
Gr. Ka/d^co ‘laugh’. 

In intervocalic position, at least some of the original tenues and voiced aspirates are 
subject to lenition. Here the development of the labials *p, *b h > -w- is secured by clear 
evidence, cf. *(h,)epi > ew ‘and’, Skr. dpi ‘also’, *-V-b h oros > -(a)wor, lit. ‘-bearer, -car- 
rier’ (adjective and agent noun suffix), and the same goes for the palatal *g h > z, cf. 
*d h eyg h os (or *d h oyg h os) > dez ‘heap’, Gr. xet^oq ‘wall’ , Av. pairi-daeza- ‘enclosure’. The 
regular continuation of intervocalic *-t- is *-w- before back vowels, as in *gen3,to > cnaw 
‘(s)he was born’, Gr. sysvexo, vs. *-y- before front vowels, as in *p3,ter > hayr ‘father’, 
Gr. 3iaxf|p. However, this only seems to be the case in the position immediately after the 
synchronically stressed syllable. Thus, the *-t- (dissimilated from *-tw-) of *k w etores 
c c ork c ‘four’, Ved. catvarah, is completely lost after the palatalization of *k w before -e-. 

The expected reflex of *-d h - remains a matter of controversy: according to some, the 
outcome is -z- (e.g. *h,lewd h - > eluzanem, aor. eluzi ‘extract, produce’, Ved. rodhati 
‘grows’, Gr. fut. eksuaopai ‘will come’), while others assume a development to -i simi- 
lar to that of *-§- in loanwords of Parthian origin (Jasanoff (1979: 145) points to the verb 
ayrem ‘burn’: Gr. a’ftko, which may, however, be an Iranian loanword). Apparently the 
velars and labiovelars were resistant to lenition, cf. aor. *e-lik w et > elik 0 ‘(s)he left’, Gr. 
ekuxe (cf. an alternative scenario in Kortlandt 1980a). 

As in other satam languages, original velars and labiovelars are prone to secondary 
palatalization before front vowels and *y in Armenian, though the exact conditioning 
is somewhat doubtful. According to an old idea by Holger Pedersen, later elaborated by 
Pisani (1950), Armenian and Albanian agree on a diverse treatment of simple velars and 
labiovelars: the labiovelar tenuis and voiced aspirate *k w and *g wh and the velar media *g 
would undergo palatalization to <5 c ,/and c respectively, while the labiovelar media *g w 
(somewhat like the situation in Greek) and the plain velar tenuis and voiced aspirate *k 
and *g h would remain unaffected: 

Velars and labiovelars before front vowels and y: 

*k > k c *k w > c c 

*g > c *g w > k 

*g h > g *g wh > J 

Cf. *k w > c £ in *k w etores > cork" vs. *k > k' in k c erem ‘scratch’, Gr. Keipco; *g wh > / in 
*g wh ermo- > jerm ‘warm’, Gr. Ssppoq vs. *g h > g- in getjk c ‘glands’, Lith. gelezuones; 
and *g > c in cmem (< *cim-) ‘squeeze’, OCS zbmg vs. *g w ena 2 > kin ‘woman’, Gr. yuvf), 
OIr. ben. The attempt to consider palatalization the regular development with all velars 
and labiovelars alike (Kortlandt 1975, Martirosyan 2010: 711) is bought at the cost of 
excessive analogy. 

Sibilants 

Word-initially the sibilant *s is continued as either h- or zero without any clear condi- 
tioning, e.g. *senos > hin ‘old’, Lith. senas vs. *septm > ewfn ‘seven’, Lat. septem, and 



428 BIRGIT ANETTE OLSEN 


deleted intervocalically, e.g. *swesor > *k h eur > ldovr ‘sister’. Two remaining problems 
are the word-final development, which may be either zero or -k c as in the plural marker, 
or most likely both (cf. the discussion of the nom. pi. ending) and the possibility of a 
special RUKI development that would connect Armenian with other satam languages 
such as Indo-Iranian and Balto-Slavic. At least we find both r and rs from *-rs-, perhaps 
as an indication of early dialectal variation, in the verb fammim vs. farsamim ‘wither, 
dry out’, Gr. repoopai (cf. also Martirosyan (2010: 709), who hypothesizes an original 
accent distinction between the two variants), but it is an object of debate whether the 
high vowels *i and *u had a similar effect on a following *s. If this is the case, as already 
suggested by Pedersen (1905: 228) and later argued by Winter (1975: 115ff.) and Olsen 
(1989a: 5-15), the result is -r, probably with an intermediate stage *-z, as, e.g., found in 
the aorist imperatives *d h eh r s > dir ‘put’ and *deh 3 -s > tur ‘give’ from original injunc- 
tives, and /-stem verbal nouns like dir ‘position’ and Hr ‘fullness’, which could represent 
the old //-stems *d h eh,tis (Av. -Haiti-, Goth, deds, Lith. detis, OCS -detb) and *pleh,tis 
(Ved. pratl-, Lat. com-pletio) respectively. 


Liquids 

The usual outcome of Indo-European *r is r, with the variant r occurring before nasals 
and as a reflex of *sr, cf. berem ‘carry’, Gr. (pspco vs. bem ‘burden’, Gr. (pepvrj ‘dowry’, 
and aru ‘stream’ < *sruti- (Gr. puou; ‘stream’, Ved. sruti- ‘road’). As is also the case in 
Greek and Anatolian, Armenian has a synchronic phonotactic constraint against initial r-, 
but it is a controversial issue whether this state of affairs should be projected back into 
the protolanguage. If this is the case we have to reconstruct the protoform of Ved. rajas-, 
Goth, riqiz ‘darkness’, Gr. epe(3oq ‘the dark of the underworld’, Arm. erek ‘evening’ with 
an initial laryngeal, *h,reg w os, and we would get a laryngeal-based “prothetic vowel” 
*h,- > e- in Greek and Armenian. If, on the other hand, initial *r- is accepted, we may 
reconstruct *reg w os, which would automatically yield the same result (cf. also the section 
“Laryngeals”). The development of prothetic vowels where no laryngeal is involved is 
also found before *r- in metathesized sequences (cf. “Consonant Clusters”) and in many 
Iranian loanwords with original initial r-, e.g. aroyr ‘brass’ from a North-West Iranian 
*rau5a-, originally ‘the red (material)’, arag, erag ‘quick’ vs. MParth. rag ‘quick’, Av. 
rayu. 

The distribution between plain l and velarized l is partly predictable: the general rule 
stipulates / initially (e.g. lizem Tick’, Gr. T-si/co) and intervocalically (e.g. gelum ‘roll, 
turn’, Lat. uoluo) vs. / before consonants (e.g. eln ‘deer, hind’, Lith. elnis ‘deer’), but the 
original distribution is often blurred by analogy. In final position both variants are found, 
with a tendency toward -/ in the oldest manuscripts (Meillet 1936: 47), thus e.g. gavl 
beside gavl ‘wolf’. 

Nasals 

The nasals *m and *n are among the most stable phonemes in Armenian as well as in 
most other Indo-European languages. Nevertheless, the development in final position 
is open for discussion. The nasal is apparently lost in the ace. sg. of o- and a-stems as 
opposed to a merger of *-m and *-n into final -n in cases like *g h yom > jiwn ‘snow’, Gr. 
Xtcbv, or *dom > tun ‘house’. Similarly, a final vocalic *-m is continued as *-n, e.g. ace. 
*pod-m > otn ‘foot’, Gr. noSa. The original distribution can hardly be determined without 
a certain amount of analogy: Kortlandt (1984: 97) assumes that “original postvocalic *-n 
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had apparently become a feature of the preceding vowel and subsequently been lost”; 
i.e. we have to posit an intermediate stage of nasalized vowels, and jiwrt and tun must 
be analogical. Alternatively, one might suggest that nasalization only occurred with the 
open vowels *o and *a, in which case jiwn and tun would be regular while, on the other 
hand, the endingless accusatives of animate i- and w-stems would have to be analogical. 


Semivowels 

The fate of the semivowels *y and *w is still open for interpretation in several respects. 
While intervocalic *-y- is lost at an early stage, e.g. *treyes > ereld ‘three’, the develop- 
ment in word-initial position remains unclear due to the scarcity of diagnostic examples. 
Most likely, the regular development is /- as in jur ‘water’, which should probably be 
compared with Lith. jures ‘sea’. The initial /- of leard ‘liver’ and luc ‘yoke’ should not 
necessarily be considered counterevidence insofar as leard may constitute a contamina- 
tion between *yek w rt (Ved. yak;t etc.) and the stem of either OEng. lifer etc. < *liparo- 
‘fat’ or Hitt, lissi- ‘liver’, while luc, rather than being a direct continuation of *(H)yugom 
(Ved. vugam, Gr. ^uyov, Lat. iugum ), may have been influenced by the verb lucanem 
‘dissolve’. There are two reliable examples of initial j-,ju ‘egg’, SCr .jaje, and jez ‘you’ 
(acc./dat./loc. pi.), Ved. yuyam, but in both cases the protofonn may have had two sub- 
sequent syllables in *yV- as a disturbing factor. According to Martirosyan (2010: 706), 
who derives the interrogative/indefinite stem *o- from the relative stem *yo-, the regular 
continuation is zero. However, o- is preferably connected with the more appropriate stem 
*k w o-, as *k w - and *p- seem to merge before *-o-, cf. also *k w olso- > oln ‘spine, back- 
bone’, Lat. collus, collum. 

In initial position *w- is reflected as g-, no doubt via an intermediate stage *gw- as 
also suggested by the early Georgian loanword ywino ‘wine’, Ann. gini < *woynio- (Gr. 
olvoq, Lat. uinum), but the intervocalic development is more complicated (cf. the treat- 
ments by Eichner (1978), Olsen (1986) and Matzinger (1992)) with the reflexes -g-, -w- 
(-V-) and zero. Possibly -g- is the regular continuation before back vowels apart from 
-u-; -w/v- before front vowels and and zero between homorganic vowels: cf. e.g. 
oivganem/ 'aroganem ‘sprinkle’ from the root *srew-, Ved. sravati ‘streams, flows’; hoviw 
‘shepherd’ < *h 2 owi-pah 2 -; gen./dat./loc. aler < *-ewer- from alewr ‘flour’, Gr. c&eiap < 
*h 2 leh,wr. An important detail has been added by Schindler (apud Matzinger 1992), who 
stipulated a regular development *-wi- > -wu- whereby the otherwise puzzling transfer 
of original /-stems like haw ‘bird’, Lat. auis, to the u-declension has found an elegant 
solution. 

Laryngeals 

The fate of the IE laryngeals is one of the most hotly debated issues of Armenian phonol- 
ogy, and while certain details seem relatively clear, there is hardly a single phonological 
conditioning where it has been possible to achieve full consensus. The following contexts 
are particularly noteworthy: 

Initial position before consonants ( *HC-)\ As is also the case of Greek and Phrygian, 
Armenian is characterized by so-called prothetic vowels deriving from laryngeals, with 
prominent examples like astl ‘star’, Gr. doifip, Hitt. h(a)sterzv s. Goth, stairno, Lat. Stella 
< *h 2 ster/l-; and avr ‘man’, Gr. dvf|p, Phryg. avap vs. Ved. nor-, Umbr. ace. pi. nerf < 
*h 2 ner-. There is, however, no general agreement as to whether the prothetic vowel is 
always a-, irrespective of the quality of the laryngeal (thus, e.g., Klingenschmitt 1982: 
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105 and Olsen 1985), or Armenian shares the feature of “triple representation” with 
Greek as argued by, e.g., Winter (1965), Kortlandt (1987) and Beekes (1988), so that 
*h,C- > *eC-, *h 2 C- > *aC-, *h 3 C- > *oC-. Since the adherents of “triple representation” 
at the same time favor a conditioned development of *o > a in such cases as anicanem 
‘mock’ < *h,neyd- (Gr. oveiSli/o, Goth, ga-naitjan ), the evaluation depends on the cor- 
rect interpretation of examples with *h r . This is further connected with the status of 
*r- in Indo-European: if *r- did occur in initial position in the protolanguage, erek ‘eve- 
ning’, Gr. epePoq may go back to *reg w os, and examples such as atamn ‘tooth’, Aeol. pi. 
eSovxeq, and anun ‘name’ with the possibly cognate Laconian names 'EvopaKpaxiSou;, 
’EvugaviiaSaq may be reconstructed with an initial *h,-; if not, the correct reconstruction 
of erek is *h 1 reg w os, which would then favor the idea of “triple representation”. This is, 
however, not what we find in other contexts. Prothetic vowels do not occur before w-, cf. 
gelmn ‘fleece, wool’ < *h,wel3 1 mn-, Hitt, hulana-, and gov ‘(s)he is’ < *h,wos-, Gr. ixeaa 
vuKia ‘spent the night’. 

Interconsonantal position (*CHC): When a laryngeal is “vocalized”, or more cor- 
rectly develops a prop vowel interconsonantally, this vowel is always a, so at least in 
this position a “triple representation” matching the situation in Greek is excluded: thus 
cnaw ‘engendered’ < *gen3[to, Gr. syevsxo with *-3 r ; hayr ‘father’ < *pa,ter, Gr. ;xaxf|p 
with *-s 2 -; arawr ‘plow’ < *h,ar3 3 trom, Gr. dpoxpov with *-a 3 -. However, as is also the 
case in, e.g., Iranian, interconsonantal laryngeals are sometimes lost, as in armn ‘root’ < 
*h 2 rm3,no-, Gr. oppevoq or dustr ‘daughter’ < *d h uga 2 ter, Gr. fluyaxr|p. The conditioning 
of this diversity is not quite clear (cf. the discussion in Olsen 1999: 768f.), but at least 
vocalization seems to be regular in initial and final syllables. 

Position between vocalic sonant and stop/s (*RHC): Sequences of sonant + laryngeal 
in the position between consonants are reflected as either aRa or aRaw with a prop vowel 
on both sides of the sonant. Thus, canac c em ‘know’ (for *canae c em) < *gnh,-ske/o-, Gr. 
yiyvcooKco, as opposed to the corresponding derivative canawt c ‘known; knowing’ < 
*gnh 3 to/i-, where the development to aRaw may be restricted to peak position as orig- 
inally suggested by Winter (1965: 110), cf. also amac c em ‘am ashamed’ beside amawf 
‘shame’ and alac c em ‘pray’ beside alawfld ‘prayer’. 

Initial position before vowels (*HV-): Apparently an initial consonantal laryngeal 
is sometimes reflected as h- in Armenian, while in other cases it simply seems to be 
lost. An attempt to account for this bewildering state of affairs has been made by Kort- 
landt (1980b) (followed by Beekes (1988: 76), Martirosyan (2010: 712f.) and Kloek- 
horst (2008: 75)), who further seeks to corroborate the theory with Anatolian evidence. 
According to Kortlandt’s analysis, the laryngeals *h 2 and *h, would be preserved in the 
protolanguage before an original e- vowel, while they would be lost before *o, i.e. *h 2 e- > 
ha-, *h 3 e- > ho- vs. *h 2 o and *h,o > o. This distinction would still be reflected in Arme- 
nian, where an initial h- (if not from *s- or *p-) is assumed to indicate an e-grade in the 
protoform. However attractive the idea of such a clear-cut regularity may be, the actual 
material is difficult to reconcile with the suggested distribution, as when, e.g., the word 
for ‘plow’, arawr, on account of the missing initial h- has to be analyzed as a zero-grade 
formation *h,rh 3 trom as opposed to the evidence for full grade in the other languages, or 
when the verb acem ‘bring, lead, conduct’ is considered a match of Lat. gerd < *h,gese/o- 
rather than, as traditionally assumed, identical with Ved. ajati, Gr. cxyco, Lat. ago. 

Position after i/u before a consonant (i/uHC): Traditionally a laryngeal following the 
high vowels i and u is assumed to be lost with a compensatory lengthening that would 
leave no trace in Armenian, e.g./// ‘sinew’ < *gwiHslah r , cf. Lith. gvsla, LaX.filum, or 
c c ut ‘roof’ < *skuHto-, cf. Gr. aieuxoq. However, as also seems to be the case in Greek 
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and Tocharian (cf. Francis 1970: 276-84, Normier 1977: 182 and 1980: 273), it has been 
suggested (Olsen 1992, 1999: 770-73; cf. also Olsen 2009 on the conditioning in Greek) 
that *h 2 and *h 3 caused “laryngeal breaking”, i.e. *ih 2/3 > *ya/ea, *uh 2/3 > *wa/ea, thus, 
e.g., erkar ‘long’ < *dwaro- < *duh 2 ro- = Ved. dura-, Lat. durus, Dor. 5apov. 


Consonant clusters 

A comparison between the syllable structure of the Indo-European protolanguage and 
that of Armenian makes it clear that very few original consonant clusters have survived 
in a historically transparent form. The treatment of several clusters remains unsettled, and 
at this place only a selection of the more well-established rules will be presented. A rich 
collection of potential examples may be found in Dzaukjan 1982. Cf. also Godel 1975: 
78-85. 


Clusters of two stops 

The evidence for such clusters is rather scarce, but at least the word-internal development 
*-pt- > -wf- is secured by ewfn ‘seven’ < *septm, analogically followed by uf ‘eight’ < 
*opto (for *ok w to), while the regular outcome of a palatal stop (after u ) + t is illustrated 
by dustr ‘daughter’ < *d h ug3,ter. Most likely, word-initial *pt- is continued as f-, cf. felt 
‘elm’: Gr. mekea, fer ‘side’ < *pter-, Gr. niepov ‘wing’. 


Clusters of resonant + stop 

Voiceless stops are generally assumed to be voiced in the position after resonants, e.g. 
bard (r'-st.) ‘pile’ < *b h rti-, Ved. bhrti- ‘burden’; hing ‘five’, Ved. panca, Gr. nevre. 
However, the cluster *nt apparently has a double reflex, nd as in anderk' ‘entrails’, Gr. 
evxepa, dr-and ‘doorposts’, Lat. antae, vs. n in, e.g., the 3 pi. ending -(V)n < *-(V)nti, 
or decadic numbers in -sun, e.g. eresun ‘30’, Gr. ipidKovia. The conditioning is differ- 
ently interpreted: Olsen (1989b: 227) stipulates voicing after a stressed vowel, while 
Clackson (1994: 56) assumes -nd- in internal position vs. -n word-finally after the loss 
of end-syllables. Somewhat similarly, the intervocalic clusters *mp and *mb h appear to 
have two outcomes, mb and m, e.g. *mp > mb in famb ‘(pack) saddle, ham’, ONor .pgmb 
‘paunch; bowstring’, Lith. timpa ‘sinew’ vs. *mp > m in amokc c em ‘soften; mix; cook, 
boil together’, Skr. sam-pacati (cf. Dumezil 1938 and Olsen 2005 on a possible regular 
distribution). 


Clusters of resonant + s/s + resonant 

For resonant + s/s + resonant we observe the following rules: 


*Ns > s 
*ls > l 
*rs > r or rs 
*sN > N 

*sr > f 


e.g. *memso- > mis ‘flesh, meat’, Ved. mariisd-, Goth, mimz 
e.g. al ‘salt’, Gr. akq 

e.g. Caramim/farsamim ‘wither dry out’, Gr. xepoopai (cf. “Sibilants”) 
e.g. gin ‘prize’, Ved. vasna-, Lat. uenum; nu ‘daughter-in-law’ < *snu- 
sos, Gr. vuoq 

e.g. k°er < *swesros, gen. of k‘oyr < *swesor ‘daughter’; aru ‘stream’ 
< *sruto-, Ved. sruta- 
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Clusters of stop + resonant 

In initial position a voiceless stop is lost at least before a liquid with further development 
of a prothetic vowel before what would otherwise be initial /■-, thus *pleh[tos > li ‘full’, 
Lat. -pletus; *k w lutos > lu ‘heard’, Gr. kLuto:;, Ved. sruta-, Lat. -clutus\ *treyes > ereb 
‘three’, Ved. trdyah, Gr. ipeiq, Lat. tres; *preysg w us > erecf (u-st.) ‘elder’, Cret. jipsioyrx;. 
Word-intemally, we generally find a lenited version of the stop: 


*-tr- > -wr(-), 


*-kr- or *-kr- > -wr(-) 


*-tl- > -wl(-)/-wl(-) 


*-pn- > -wn(-) 


e.g. arawr ‘plow’, Gr. apoxpov, Lat. aratrum; hawr < 
*p3,tros, gen. of hayr ‘father’ 

e.g. mawrut ‘beard’, Ved. smasru- ‘beard’, Lith. smab-as 
‘chin’ 

e.g. ul (for *uwl; o-st.) ‘kid’ < *putlos, Osc. puclo- ‘son’; 
agent noun suffix -awl, e.g. cnawl ‘parent’ < *-VtlV- (cf. 
“Nominal Word Formation”) 

e.g. town (/-st.) < *dapni- ‘feast’, Lat. daps ‘sacrificial meal’, 
ONor. tafn ‘victim’ 


Metathesis 

Metathesis regularly occurs in clusters of media or voiced aspirate + liquid, triggering the 
development of a prothetic vowel word-initially, cf. the following examples: 


*dr 

> 

rt 

*swidro- > k'irtn ‘sweat’, Latv. sviedri 

*gw r - 

> 

Mrk- 

*g w rah,wn- > erkan ‘grindstone’, Ved. gravan- 

*b h r 

> 

rb 

*kub h ro- > surb ‘pure, holy’, Skr. subhra- 

*gh r 

> 

rj 

*meg h ri > merj ‘near’, Gr. ps^pt 

*g(w)hj._ 

> 

Vrg- 

argand ‘womb’, Slav, gripdb ‘breast’ 


In case of the structure * Vr - r there is regular dissimilation to (V)l - r, thus, e.g., *b h rater 
> elbayr ‘brother’, *b h rewr > albewr ‘fountain’, Gr. cppeap. 

A small group of exceptions includes srunk' ‘leg, shank’ < *krusni- or the like (Lat. crus 
‘shank’, Lith. slaiinis ‘hip, thigh’), not *Vrunk c ; krunk ‘crane’, Lat. grus, not *Vrkunk; 
and possibly glia ‘head’, Lith. galva, etc., if this is from *g h luH- rather than the morpho- 
logically unlikely *g h 5lu-. These examples share the features of an initial velar or palatal 
and M-vocalism, so perhaps a sequence *KRu- would develop a prop vowel to *KuRu- 
whence, with loss of the unaccented -«-, KRu-. 


Clusters of two resonants 

In a few cases we have evidence for a development *-ln- > -1-, e.g. folum ‘let’ < *tol-nu-, 
Lat. tollo (cf. Klingenschmitt 1982: 243fi), while it is more difficult to determine the 
conditioned distribution of wn beside mn from the sequence *mn (cf. the discussion in 
Olsen 1999: 792fi). 

Clusters of s + stop/stop + s 

The cluster *st is preserved initially as well as in original intervocalic position, thus stipem 
‘press’, Gr. oisipco; sterj ‘barren’, Gr. crtsipa; similarly *zd in ost ‘branch’ < *(H)o-zdo-, 
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Gr. oCpq. The regular reflex of *sp- is more uncertain: sp- as in sparnam ‘threaten’, Lat 
sperno, or p c - as in p c oyf ‘zeal’, Gr. O3iou8f). An even more complicated matter is the fate of 
.sK/Ks-clusters. According to Godel (1975: 80),£and *kare neutralized in connection with 
*s, and, indeed, we find quite a number of sound etymologies including such a cluster with 
a common reflex c c , e.g. *-ks in vec c ‘six’ < *suweks, *-sg w - in erec c ‘elder’ < *preysg w us, 
Cret. 7rp£ioyu<;, *sk- in c'urt ‘cold’, OCS seven ‘north’, and *sk- in c c ayfem and c c aytem 
‘bubble, sparkle’, Lat. seated, Lith. skaidi'us; similarly, the original three -consonant clusters 
*-s-sk- in avc c ‘visit, search’ < *h 2 ays-sk- and *-k-sk- in harc'anem ‘ask’ < *prk-sk-, Ved. 
prcchati. Another potential development is which may be found in *-sk-verbs of the 
type canac'em ‘know’ < *(gi-)gnh 3 -ske-, Gr. yiyvcboKco, alaefem ‘pray’ < *(si-)slh,-ske-, 
Gr. iMaKsofkn, but here it is a matter of dispute whether the Armenian verbs represent a 
faithful continuation of the Indo-European pattern (thus, e.g., Klingenschmitt 1982: 84) 
or an extended suffix *-sky-. Finally, there exists a small group where an initial s- appar- 
ently goes back to *sk- (or *skh-, cf. Martirosyan 2010: 516): set ‘oblique’ < *skel-no-, Gr. 
GKeXkoc, ‘bandy-legged’, Alb. gale Tame’, Lith. kulnas ‘heel’; sil ‘cockeyed’, OHG skelah; 
serf ‘splinter’, Lith. skeden’d; sertem ‘scratch’ < *skerd-, Lith. skerdziu. In all these cases, 
the clusters occur at the same time initially and before front vowels. 


7-clusters 

The correct interpretation of the fate of v-clusters is severely hampered by the scarcity of 
reliable examples. Thus, the regular continuation of *ty and *dy is assumed to be either 
the “shibilants” c c and c or the affricates c c and c, while the development of *d h y > / is 
quite uncontroversial, cf. me/ ‘middle’ < *med h yo- (with *e > e before a palatal, cf. also 
*(h,)ekwos > es ‘donkey’), Ved. madhya-, Lat. medius. Truly, the assumed continuations c° 
and c would mean a neat parallelism between the three series, but the best example of *ty 
> c c is the verb koe'em ‘call’, allegedly < *g w otye- and thus a cognate of Goth, qipan ‘say, 
speak’, and this may have been influenced by the almost synonymous god'em < *wok w ye- 
‘shout, cry, call out’. For *dy > c the phonetic details of the alleged connection between 
orocam, -em ‘chew, ruminate’ and Lat. rodo, Ved. radati ‘gnaw’ (Greppin 1993: 19) are 
unclear. According to an alternative theory (Olsen 1993), *ty is continued as the affricate <f 
in, e.g., k c ec c ‘isolated’ < *swetyo-, Lith. svecias ‘stranger, guest’ (from the reflexive stem 
*swe-) and the verb anc c anem ‘pass, cross’, Gr. avnaopai ‘stand opposite’, aviidco ‘come 
toward’ (cf. also under “Verbs” on the subjunctive). The existence of parallel examples of 
*dy > c (cf. Godel 1975: 82) largely depends on the interpretation of verbs such as hecanim 
‘ride’, Gr. eippou ‘sit’, anicanem ‘mock’, Gr. oveiSI^co, Goth, ganaitjan as the continuations 
of y-presents rather than s-aorists. The development of *k (w) y > c c is safely attested in ac c k c 
‘eyes’, Gr. oaas, OCS oci, while *ky most likely yields c c , cf. luc c anem Tighten’ < *l5wkye- 
(*k > *k after M-diphthong; Klingenschmitt 1982: 194). Intervocalic *-sy- is continued as 
-V-, thus the o-stem gen. sg. ending *-osyo > -ov, and in the position after a sonant, *-y- is 
reflected as -/(-), e.g. anurj ‘dream’, Gr. oveipo:;. 


17-clusters 

Some of the most bizarre phonetic developments in Armenian are found with w-clusters, 
though the end results are less surprising when we consider both the strengthening of *w 
> g and the general tendency toward metatheses. Thus, *sw and *tw are reflected as t in, 
e.g., *swesor > tovr ‘sister’, Ved. svasar- and *two- > Ifo ‘your’, Ved. tva-. As famously 
discovered by Meillet (1924) *dw is continued as (Y)i'k in examples such as erku ‘two’, 
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Lat. duo and erkar ‘long’, Gr. Srjpocg. This remarkable sound law can hardly be dismissed, 
though it has often been doubted and its implications widely debated, cf. Klingen- 
schmitt 1982: 225 and238f., Vennemann 1986, Kortlandt 1989 and Viredaz 2003. On this 
background one might expect a parallel development *d h w > rg, and in fact this is what 
Klingenschmitt (1982: 238f.) assumes for the verb argelum ‘hinder, keep back’, which he 
compares with the synonymous ONor. duelia, OHG twellen. The regular continuation of 
the cluster *kw is s, cf. sun ‘dog’, Ved. svS; es ‘donkey’, Ved. diva-. 


MORPHOLOGY 

The characteristic features of the Indo-European morphological system are only partly 
preserved in Armenian, with a number of restructurings, simplifications and losses in 
both the noun and the verb. Several endings and morphological markers have been ren- 
dered opaque by regular sound change, including the loss of final syllables, and to the 
extent the corresponding categories are maintained, we observe a series of countermoves 
against the regular phonetic development such as the construction of suffix clusters and 
the addition of clarifying postpositions or adverbs. However, except for the numerous Ira- 
nian noun suffixes, the morphological elements as such generally seem to have belonged 
to the protolanguage. 


Nouns 

The system of noun inflection in Classical Armenian has undergone significant changes: 
the category of grammatical gender has been given up altogether, the dual is lost, and 
the case system, while preserving a distinction between seven cases in the pronouns (the 
inherited eight minus a separate vocative), has been subject to several syncretisms, so that 
a paradigm will usually only distinguish between four different forms in the singular and 
four in the plural. On the other hand, an archaic ablaut pattern has been preserved to some 
extent, particularly in the inflection of the n-stems and a few inherited kinship terms. 


Vocalic stem classes 

The inflection of the four vocalic stem classes, o-, a-, i- and «-stems, is almost identical. 
However, the a- and /-stems merge in the gen./dat./loc. sg., and the o-stems exhibit some 
aberrations in the oblique cases of the singular. 


Vocalic stem classes: 


o-stems a-stems 
es ‘donkey’ am ‘year’ 


sg. nom./acc. 

es 

am 

gen./dat. 

is-oy 

am-i 

abl. 

is-oy 

am-e 

instr. 

is-ov 

am- aw 

loc. 

es 

am-i 

pi. nom. 

es-fc 

am-t 

gen./dat./abl. 

is-oc c 

am-ac c 

ace. /loc. 

es-s 

am-s 

instr. 

is-ovfc 

am-awk 6 


/-stems 

w-stems 

ban ‘word’ 

zard ‘ornament’ 

ban 

zard 

ban-i 

zard-u 

ban-e 

zard-e 

ban-iw 

zard-u 

ban-i 

zard-u 

ban-id 

zard-ld 

ban-ic c 

zard-uc c 

ban-s 

zard-s 

ban-iwk' 

zard-ut 
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A subtype of the a-stems with gen./dat./loc. sg. -ay is practically restricted to proper 
names, e.g. Vardan, -ay (Weitenberg 1989). 

The numerous case syncretisms are naturally explained by the fact that final syllables 
would generally be lost. Thus, the endingless forms of the nom./acc. sg. are probably 
the regular continuations of *-os, *-om (o-st.), *-ah 2 , *-ah,m (a-st.), *-is, *-i (7-st.) and 
*-us, *-u (u-st.); the missing ending of the old animate accusative of /- and u-stems, 
*-im and *-um, may be either regular or analogical. In the o-stems the disyllabic ending 
of the gen. sg. *-osyo > -oy (cf. Skr. -asva) was extended to the dative and the ablative, 
where the inherited ending *-5d or *-ad would undergo apocope, and the endings of 
the instr. sg. are unproblematically derived from the stem vowel + *-b h i (- u repre- 
senting *-uw), but otherwise most of the inherited endings would have been lost by 
apocope. Therefore, the loc. sg. of the o-stems is generally endingless (IE *-oy > -0), 
and the gen./dat. of the a- and /-stems in -/ and that of the u-stems in -u must go back 
to disyllabic forms *-i-os, *-u-os, etc. The locative in -/, also occasionally found with 
the o-stems, most likely has its origin in a postposition *-h,en. Similarly, the ablative 
in *-e may be explained as an original postposition *-eti, cf. Ved. ati ‘over, beyond’. If 
intervocalic *-t- is continued as *-w- before a back vowel, as seems to be the case, the 
traditional derivation from *-etos corresponding to Ved. mukhatah ‘from the mouth’ 
etc. (Meillet 1936: 73) is phonetically impossible. 

In the plural the most troublesome ending is the nominative in -t where one expects 
a final *-s. Klingenschmitt (1982: 23f.) explains -Id in the nom. pi. beside the zero end- 
ing in the nom. sg. as sandhi variants: when in the a-stems the singular *-ah, > -0 was 
opposed to the plural *-ah,(e)s > *-0/-k c , -If was perceived as the characteristic plural 
marker, and in the course of time -If was generalized in all plural forms, nominal as well 
as verbal, thus also instr. pi. -ovlf etc., 1 pi. verbal ending -emt etc., pronouns such as 
melf ‘we’, dukf ‘you’, and, perhaps most significantly, erekf ‘three’ < *treyes and c'ork' 
‘four’ < *k w et(w)ores, the only numerals ending in *-s. Alternatively, Kortlandt (1984: 
89f.) assumes that -If is the only regular reflex of postvocalic *-s. In that case, however, 
the nominative must have been analogically replaced by the accusative, and it would be 
difficult to understand the general merger of masculine o-stems and neuter 5-stems. 

The -5 of the acc./pl. is the regular outcome of the m./f. ending -Vns in the vocalic 
classes, while the same ending in the loc. pi. may have been extended from the M-stems, 
i.e. *-(V)n-su > -5 with analogical restitution of the n in the n-stems themselves, e.g. 
harsn ‘bride’, acc./loc. pi. harsuns for regular *harsus. Another possible source is the 
5-stems, where a geminate *-s-su would probably yield -5 as in *h,es-si > es ‘you are’, 
while *-s- of the vocalic stems would be lost in intervocalic position without a trace. The 
origin of the plural oblique ending -(V)c £ remains a bone of contention. An attractive 
solution has been offered by Seldeslachts (1991: 261), who suggests a derivation from 
the original adjective suffix *-isko- > -ic c -, later analogically adapted to the other stem 
classes as -ocf, -ac° etc. Finally, the instr. pi. is formed by adding the plural marker -t to 
the instr. sg. Cf. also Matzinger 2005 for a more detailed discussion of the case endings. 

In some cases the vocalic stems are direct continuations of the inherited patterns, 
thus es ( o ) ‘donkey’ < *(h,)ekwo- ‘horse’, Lat. equus etc.; am (a) ‘year’ < *sm(h) 
ah,-, Ved. sama- ‘season’; ban (/) ‘word’ < *b h ah,-ni-, ONor. bon; z-ard ‘ornament’ 
( u ) < *-h,rtu-, Ved. rtu- ‘rule, order’, Lat. artus ‘limb’, but they have also been fed by 
moribund categories and by loanwords. Thus, e.g., neuter 5-stems like jet ‘tail’, Av. 
-zadah-, have been included in the o-stems, and root nouns like avc ‘goat’, Gr. diq, in 
the /-stems, presumably under the influence of the oblique plural *-isko- > -if-. Com- 
pare also the old root noun otn ‘foot’ where the ace. sg. *pod-m explains the singular 
inflection as an «-stem, while the plural, otlf, otic c , is treated as an /-stem. A number of 
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u-stems such as haw ‘bird’, Lat. auis, go back to older /-stems as a consequence of the 
regular development of *-wu- > -wi-. 

The oldest layer of Iranian loanwords seems to have preserved the inflectional class 
they had in Old Iranian, as originally assumed by Meillet (differently, Schmitt 1983: 
100), e.g. the /-stem uxt ‘oath’, Av. uxti- ‘word, speech’; the u-stcm xrat ‘advice’, Av. 
xratu-; or, not least, the numerous a-stem loanwords and loan suffixes going back to 
Indo-European thematic stems. 


Alternating stem classes 

Except for the regular syncope and vowel reduction in unaccented syllables, the vocalic 
classes display no stem alternation. With the consonantal subtypes, i.e. the «-, r- and 
/-stems, on the other hand, a distinction is made between at least the stem of the nom./ 
acc. sg. and that of the oblique forms. 


TV-stems 

From a historical point of view, the Armenian n-stems have several sources: some, like 
gain ‘lamb’, Gr. dpf|v, go back to various types of Indo-European n-stems, including 
-men-stems, Hoffmann formations and -nt-stems; others have been adapted from orig- 
inal root nouns, e.g. otn ‘foot’ from acc. sg. *podm, and still others derive from stems 
in *-no-/-na- such as t c orn ‘grandchild’ < *torno-, Lith. tarnas ‘servant, farm boy’ and 
hern ‘burden’ < *b h emah 2 -, Gr. (pepvf| ‘dowry’. As an archaic feature of the n-stems the 
paradigms display three or even four stem alternants: nlinlan, nlunlan and nlinlanlun, 
where -in-, -un- and -an- in principle represent the e-, o- and zero grade of the suffix 
*-en-, *-on-, *-n-, so that it is still possible to distinguish between various inherited ablaut 
types. Thus, e.g., anjn ‘person; self’, gen. anjin, nom. pi. anjint, continues a hystero- 
dynamic paradigm corresponding to ONor. angi ‘smell’, while the compound mi-anjn 
‘one -person’, i.e. ‘monk’ with gen. mi-anjun, nom. pi. mi-anjunk c , points to a composi- 
tional o-grade in the suffix with a similar allomorphy as Gr. jionf|p, acc. Jiaispa ’father’ 
vs. stmaicop, acc. eintdiopa. An example of an alternating (proterodynamic) paradigm is 
harsn ‘bride’ < *prkon (root *prek- ‘ask’; cf. Hamp 1988). 


The H-stem harsn ‘bride’ 


sg. nom./acc. 

harsn 

< *-on 

gen./dat./loc. 

harsin 

< *-en- 

abl. 

harsne 

< *-en- 

instr. 

harsamb 

< *-n- 


pi. nom. 

harsunt' 

< *-on- 

acc./loc. 

harsuns 

< *-on- 

gen./dat./abl. 

harsanc c 

< *-n- 

instr. 

harsambt 

< *-n- 


An extremely widespread subtype consists of nouns in -iwn, in particular abstract 
nouns in -ut c iwn, gen. -u-t c ean, e.g. cerutHwn ‘old age’ from cer ‘old’. Finally, the mono- 
syllabic tun ‘house’, Gr. 5c5, and sun ‘dog’, Ved. svS, Gr. kucov, have the oblique forms 
tan and san. 

N- stem plurals are also found in some nouns denoting persons or animals, e.g. nu (gen. 
nuov) < *snuso- ‘daughter-in-law’ with the plural nuant, erec c ‘elder’ < *preysg w us with 
the plural eric'unk 0 . Here the n-extension probably corresponds to the individualizing 
H-stem suffix of Germanic. 
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R-stems 

On the whole, the /-inflection is a relatively rare type, but, still, it contains some of the 
most essential elements of the vocabulary, the basic kinship terms. We may distinguish 
between three subtypes: 

1) “regular” /'-stems in nom./acc. sg. -r, gen./dat./loc. -er, e.g. dustr, dster ‘daughter’ < 
nom. *d h ug3 2 ter, with generalized suffix -ter- in the rest of the paradigm; similarly 
loanwords such as kaysr, kayser ‘emperor’. 

2) a small number of *-wer-/-wen-heteroclitics, e.g. alewr (later aliwr) ‘flour’ < 
*h 2 leh,wr, Gr. akeiap, gen./dat./loc. aler (Eichner 1978). 

3) the kinship terms hayr ‘father’, mavr ‘mother’ and elbavr ‘brother’ with preservation 
of the inherited ablaut pattern, here exemplified by the paradigm of hayr. 


Paradigm of hayr ‘father’ 


sg. nom./acc. 

hayr 

< *pa 2 -ter 

pi. nom. hart 

< 

*pa 2 -ter-es 

gen./dat./loc. 

hawr 

< *pa 2 -tr-6s etc. 

acc./loc. hars 

< 

*pa 2 -ter- + -s 

abl. 

hawre 

< *pa 2 -tr- + -e 

gen./dat./abl. hare c 

< 

*p3 2 -tr-sko- 

instr. 

harb 

< *p3 2 -tr-b h i 

instr. harbt 

< 

*p3 2 -tr-b h is 


The paradigm of tovr ‘sister’ is quite isolated and highly archaic: nom./acc. sg. tovr < 
*swes5r; gen./dat./loc. ter < *swesros etc.; instr. t'erb < *swesrb h i, analogical for *k c arb 
< *k c earb; nom. pi. tort < *swesores. 


Z-stems 

Besides a few secondarily adapted loanwords like skutl, gen. sktel ‘plate’, Lat. scutella, 
the only member of this stem class is astl, gen. astel ‘star’. The reason why this word is an 
/-stem rather than an /-stem like, e.g., Gr. daifip is unclear: perhaps analogical influence 
from the word for ‘Sun’, *sah 2 wol, which is, however, not transmitted in Armenian, or 
perhaps even a trace of an old /'//-alternation as suggested by Olsen (2010: 81). 


Heteroclitics and irregular paradigms 

A number of //-stem adjectives exhibit a peculiar heteroclitic pattern with -r in the nom./ 
ace. sg. and /7-stem inflection in the plural, e.g. barjr ‘high’, gen./dat./loc. barju, nom. 
pi. barjunt from the root *b h erg h -, where the //-stem has a parallel in Hitt, parkus and 
the /7-stem may go back to the same nt-stem as Ved. brhant-, Av. borszant-. The origin of 
the -/' of the nom./acc. is more doubtful. It hardly reflects the competing adjective suffix 
*-ro- as in Toch. A. parkar, B. parkare since //-adjectives in general seem to be replaced 
by /'o-adjectives in Tocharian. Moreover, a nom./acc. in -/• also occurs in a few original 
//-stem neuters, artawsr, pi. artasut ‘tear’ < *draku, cf. Gr. Soncpu; cunr ‘knee’, Gr. yovu; 
and melr ‘honey’, which is usually considered a contamination between *melit ‘honey’ 
(Hitt, melit etc.) and the //-stem *med h u, Ved. madhu ‘honey, mead’, while it is missing 
in masculine //-stems such as z-gest ‘clothing, garment’ < *westu-. On the whole, the best 
option may well be an idea by Alexis Manaster Ramer (p.c.), who stipulates a regular 
sound change of final -// > -/', which would imply that the neuter gender was generalized 
in the adjectives. 
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A modest number of nouns cannot be fitted into the regular inflectional types, thus, 
e.g., akn ‘eye’, gen. akan, pi. ac c k c , cf. Gr. oaae; unkn ‘ear’ with the plural akanjfk f; ayr 
‘man’ < *h,ner, Gr. dvip, gen. arm, metathesized from *h,nros, Gr. dvdpoq; kin ‘woman’ 
< *g w ena, with the enigmatic oblique form knoj and the plural kanavt with a remarkable 
similarity to Gr. yuvcdiceq. 


Gradation 

The Armenian adjective has no obligatory gradation (cf. Jensen 1959: 68fi). For the 
expression of the comparative, the positive is generally used either alone or with k'an (= 
Lat. quam) or k'an z-, cf. e.g. Matt. 18.6: law e nma ‘it is better for him’ or Mark 14.5: 
p c otr e If an zamenayn sermanis ‘it is smaller than all seeds’. Flowever, the original 
intensive in -agovn, an Iranian loan suffix, also serves as a comparative. For the expres- 
sion of the superlative function there are several possibilities: the simple positive, e.g. 
Matt. 5.19: p c otr koc'esti ‘he shall be called the smallest’; reduplication, e.g. mecamec 
‘the largest’ (or more commonly ‘very large’); or compounds with amena- ‘of all’, e.g. 
amenasurb ‘the holiest’ (also ‘very holy’). 


Pronouns 

The pronominal system of Armenian consists of the following categories: the personal 
pronoun; the reflexive, reciprocal and possessive pronouns; the interrogative, indefinite 
and relative pronouns; and the demonstrative pronouns. Only the pronouns have a dis- 
tinction between the nominative and accusative and between the genitive and dative of 
the singular. 


The personal pronouns 


Singular 

First person 

Second person 

nom. 

es 

du 

acc./loc. 

is 

tez 

gen. 

im 

to 

dat. 

inj 

tez 

abl. 

inen , injen 

ten, tezen 

instr. 

inew 

tew 

Plural 



nom. 

me/f 

dut 

ace. 

mez 

jez 

gen. 

mer 

jer 

dat./loc. 

mez 

jez 

abl. 

men] 

jenj 

instr. 

mewk f 

jewt 


1 sg.: the nominative es seems to be a sandhi variant for *ec < *(h 1 )ego, while the gen. 

im, from an original possessive pronoun *(h[)emo-, with assimilation to in- from 
the acc./loc. is < *im-s, dat. inj, apparently provided the stem of the remaining case 
forms. The dative contains the same element *-g h i as Lat. mihi. 

2 sg.: nom. sg. du for expected *t c u < *tu has the same (enclitic) development of 

*t- as the demonstrative d- < *to-. Flere again, gen. to < *two- (/-twe-) is the old 
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possessive pronoun whose stem has spread to the rest of the paradigm. The dat. 
Idez < *twe-g h i must be a substitution for *-b h i, cf. Lat. tibi, under the influence of 
the 1 sg. 

1 pi. meld goes back to either *mes like Lith. tries in analogy with the verbal ending or 

*meyes for *weyes with initial *w-, cf. Skr. vayam, Lat. uos etc. The acc./dat./loc. 
mez has probably taken over the element *-g h i from the singular, and the gen. trier 
contains the suffix *-(t)ero-, cf. Gr. ppsiepot;. 

2 pi. duk' for *yus > *juk c (?) is influenced by the singular du. 


The reflexive, reciprocal and possessive pronouns 

The reflexive pronoun iwr is common for the genitive, dative and locative; there also 
exists an abl. iwrtne, an instr. iwreaw and a plural acc./loc. iwreans, gen./dat./abl. -eanc c , 
instr. -eambld. Traditionally the stem iwr(-) is derived from *sewero-, but this is formally 
problematic as *-ewe- would be expected to yield *-e- (cf. gen./dat./abl. aler ‘flour’ < 
*-eweros etc.), and it is difficult to provide a functional justification for a vrddhi deriv- 
ative *sewero-. According to an alternative explanation by de Lamberterie (2014), iwr 
rather derives from *setro- (reflexive stem *se- beside *swe-), which, in the correspond- 
ing paradigm of the possessive pronoun, would be continued as ewr, gen. iwrov with the 
same change of ew > iw in unaccented syllables as the loanword ewl (later iwl), iwlov 
‘oil’, and hewr (later hiwr), hiwroy ‘stranger, guest’, allegedly a lexicalized variant of 
iwr. Another reflexive stem is ink'n ‘-self’. 

The function of reciprocal pronouns is fulfilled by the apparently reduplicated stems 
mimeans and irears (ace.) ‘each other’, the former presumably from *smi-smia(h 2 )-, cf. 
Lat. alter . . . alterum ‘one . . . the other’. 

As in other Indo-European languages the possessive pronouns are o-stem adjectives: 
im ‘my’, k°o ‘your’, trier ‘our \jer ‘your’. The forms are identical with the genitive of the 
personal pronouns. 


The interrogative, indefinite and relative pronouns 

In the protolanguage the interrogative and indefinite pronoun were derived from an alter- 
nating stem *k w e/o- : *k w i-, cf. Ved. *k w as > kah ‘who?’ vs. the indefinite particle -cit < 
*k w id; Lat. neuter adjectival quod vs. substantival quid, both interrogative and indefi- 
nite; or Av. gen. sg. kahiia < *k w osyo beside cahiia < *k w esyo. The original distribution 
between *k w e/o- and *k w i- is not entirely clear, but at least it seems likely that *k w e- is 
an archaism as opposed to *k w o- with the generalized o-vocalism of the thematic vowel 
known from the noun inflection. Despite the general loss of gender distinctions in Arme- 
nian the interrogative and indefinite pronouns distinguish between an animate ‘who?’, 
nom./acc. o(v), gen. ovr, dat./loc. um, abl. ume, instr. orov and a neuter ‘what?’, nom./ 
ace. zi, gen. er, dat./loc. (h)im, abl. itne, instr. iw. If *k w merges with *p before *o (cf. also 
oln ‘spine, backbone’ < *k w olso-, Lat. collus, collum, Goth, hals; apparently no coun- 
terexamples), we are dealing with the stem *k w o- in the animate forms with gen. ovr < 
*k w osyo- + -r (Ved. kasya, Av. kahiia) and dat./loc. um < *k w o-sm- (Ved. dat. kasmai, 
loc. kasmiri). The neuter z-i, on the other hand, cannot go back to *-k w id (> *(-)e c (i), cf. 
the equation oc c ‘not’ = Gr. ouki < *h 2 oyu-k w id for *ne h,oyu-k w id ‘not in a lifetime’, cf. 
Cowgill 1960), so here we have to assume analogical absence of palatalization in anal- 
ogy with the animate ov; the gen. er seems to continue *k w esyo- + -r, again with missing 
palatalization, but otherwise corresponding to Av. cahiia. An adjectival variant or may go 
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back to *k w otero- = Ved. katara-, Gr. iroispoq (Schmitt 1981: 123). The same forms are 
also used in the function of relative pronouns, as, e.g., in the Italic languages. 

The indefinite ot' ‘someone’ and -t in the phrase c c -ik c ‘there is not’ are constructed 
by the addition of -If < *-k w e to the stems o- and i- of the interrogative pronoun, cf. Ved. 
kas-ca ‘someone’, Lat. quisque ‘everyone’. Another indefinite pronoun is omn ‘someone’. 


Demonstrative pronouns 

A striking feature of Armenian morphology is the consistent distinction between three 
demonstrative markers with first, second and third person deixis, s ‘hie’, d ‘iste’ and 
n ‘file’ from *ko-, *to- and *no- respectively. These stems are the basis of the enclitic 
postponed articles - 5 , -d, -n; the anaphoric sa, da, na (probably from *so-ay, *do-ay, *no- 
ay); the demonstrative ays, ayd, ayn; the pronouns expressing identity sovn, dovn, novn 
‘this, that same’; various adverbs such as avsr, aydr, andr ‘here, hither; there, thither’, 
asti, avti, anti ‘from here, from there’, avspes, avdpes, aynpes ‘in this, that way’; the cor- 
responding adjectives ayspisi, aydpisi, avnpisi ‘of this, that kind’; and the interjections 
a(ha)wasik, a(ha)wadik, a(ha)wanik Took here/there’. 

The anaphoric pronouns, here exemplified by sa (similarly da and na), are inflected 
as follows: 


The anaphoric pronoun sa 

Singular nom./acc. s-a 

Plural nom. 

sofc-a 

gen. 

sor-a 

acc./loc. 

sos-a 

dat./loc. 

sm-a 

gen./dat./abl. 

soc c -a 

abl. 

sm-ane 

instr. 

so/f-aw/f 

instr. 

sov-aw 




The basic stems are so-, do-, no- with “internal inflection” where the genitives sora, dora, 
nora may be analyzed as *-otero- + the enclitic particle -ay, and the dat./loc. and abl. sg. 
as *so-sm-. 


Numerals 

The cardinal numbers from one to ten can all be derived from their Indo-European proto- 
forms with a few minor adjustments: mi ‘one’ < *smih 2 , the old feminine, cf. Gr. pia; erku 
‘two’ < *dwd, Lat. duo etc.; ereif ‘three’ < *treyes, Lat. tres, Ved. trayair, c c ork c ‘four’ < 
*k w etores, dissimilated from *k w etwores, Ved. catvdrah; hing ‘five’ < *penk w e, Gr. 7ievie, 
Ved . panca; veef ‘six’, perhaps from a Lindeman variant *suweks (Klingenschmitt 1982: 
61), cf. W chwech; ewt c n (with the variant eawt c n) ‘seven’ < *septm, Lat. septem etc.; 
ut c ‘eight’, probably from *optd rather than *okto under the influence of ‘seven’; the 
somewhat enigmatic inn ‘nine’, Gr. ewea vs. Lat. nouem etc., whether from a metathe- 
sized *enun as assumed by Eichner (1978: 152) or actually exhibiting a laryngeal-based 
prothetic vowel; and, finally, tasn ‘ten’ < *dekm with unclear vocalism. The singular 
mi, which may also have the function of an indefinite article, has an aberrant inflection 
with gen. mioj beside mioy (cf. kin, knoj ‘woman’, also a stem originally ending in *-h 2 ). 
Apart from Ifsan ‘20’, Lat. uiginti etc., the decadic numerals are characterized by the 
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element -sun < *-(d)komt3 2 , Gr. -kovtoi, in some cases with compensatory lengthening 
of the preceding element: eresun (< *eria-) ‘30’; tarns un ‘40’ as if < *k w twr-, cf. Lat. 
quadraginta; the formally remarkable yisun ‘50’ beside hing ‘five’; vafsun ‘60’ beside 
vet ‘six’; ewfanasun ‘70’; ufsun ‘80’; and innsun ‘90’. However, the inherited word 
for ‘ 100’, *kmtom, has been replaced by the etymologically unclear hariwr, and the Ira- 
nian loanwords hazar and biwr have been introduced for the larger numbers ‘1000’ and 
‘10,000’. 

Ordinals, apart from arajin ‘first’ < *prh 3 wyo- + -ino- (cf. Ved. ad v.purvyam ‘first’) are 
derived with the suffixes -(i)r, -ord, or the combination (-e-)rord: erkir and erkrord ‘sec- 
ond’, erir and errord ‘third’, c'orir and c c orrord ‘fourth’, hngerord ‘fifth’, with preserva- 
tion of the final -e- of *penk w e and hence vec'erord ‘sixth’. Erkir and erir also have the 
function of numeral adverbs ‘in the second place, zweitens’, ‘in the third place, drittens’, 
perhaps from the adverbial *dwis, *tris ‘twice, three times’ with RUKI development of 
*-s rather than the otherwise isolated protoforms *dwiro-, *triro-, cf. Ved. dvth, trih ; Gr. 
Sfg, xpiq, Lat. bis, ter. The semantic development would be ‘two, three times’ — > ‘the sec- 
ond, third time’ — > ‘second, third’, cf. also the partly competing erkic's ‘twice, the second 
time’ and eric c s ‘three times, the third time’. As for the productive suffix -ord, the most 
reasonable reconstruction may be *k w ortos, cf. Lith. kartas, OCS b-atr, ‘time’ (Winter 
1992) or even *-k w rt as in Ved. sakrt, Av. haksrst ‘once’ (with special development of the 
sonant after labial/labiovelar), in particular in consideration of the alternative function 
of -ord as an agent noun suffix, e.g. ors-ord, ‘hunting-maker’, i.e. ‘hunter, fisher’: Ved. 
mantra-krt ‘mantra-maker’ (Olsen 1999: 527-529). 

Verbs 

The Armenian verb has two aspect stems (imperfective and aorist), two verbal voices 
(active and mediopassive), two verbal tenses (present and preterite), three modes (indic- 
ative, optative and imperative), three persons (1,2 and 3) and two numbers (singular and 
plural). Thus, the perfect, the distinction between optative and subjunctive, the injunctive 
as a specific category, and the dual have been lost. 

With a distinction between four vocalic stem types, e-, /-, a- and M-stems (the single 
o-stem gom ‘be, exist’, is based on an old perfect *-h 2 wos-), the synchronic formation of 
the present indicative is entirely regular, even including the verb em ‘be’, which is iden- 
tical with the endings of an e-verb: 


The present indicative 

The present indicative 



e-stems 


/-stems 

a-stems 

M-stems 


em 

berem 

berim 

lam 

helum 


‘I am’ 

‘I carry’ 

‘I am carried’ 

‘I weep’ 

‘I pour’ 

sg. 1 

em 

berem 

berim 

lam 

helum 

2 

es 

beres 

beris 

las 

helus 

3 

e 

here 

beri 

lay 

helu 

pl.l 

emfc 

beremk 

berimt 

lamt 

helumb 

2 

ek? 

beret 

berit 

lavb 

helut 

3 

en 

beren 

berin 

lan 

helun 
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The paradigm seems to have arisen through a compromise between the athematic verb em 
and the thematic inflection with generalization of the thematic vowel *-e-, thus: 

1 sg. *h,es-mi — > em (for *im) => berem 

2 sg. *h,es-si > es => beres (for *bere) 

3 sg. *b h ere-ti > here => e (for *est?) 

1 pi. *b h eromes — > beremk' (analogical stem vowel) => emt 

2 pi. *b h eretes > herein => ek‘ 

3 pi. *h,senti (> *in) — » en\ *b h eronti — > beren (analogical stem vowel) 

Similarly the a- and M-stems. In the mediopassive /-inflection the stem vowel goes back 
to the stative marker *-eh,-. 


The imperfect 

In the imperfect there is no distinction between active and mediopassive, thus berei ‘I was 
carrying’, ‘I was being carried’: 


The imperfect 

Singular Plural 

1 berei bereat 

2 bereir bereilf 

3 berer herein 

Similarly, ei ‘I was’ etc., from a-verbs layi ‘I was weeping’, from M-verbs heiui ‘I was 
pouring’ etc. Here the endings are much less transparent than in the present, and their 
historical background is a matter of debate. The -/- endings of the singular are variously 
derived from either the imperfect *e-h[es- (Jasanoff 1979; Klingenschmitt 1982) or the 
athematic optative (Winter 1975) with reference to English constructions like ‘he would 
carry’. In particular, the 2 sg. ending (-)ir may be a direct continuation of *e-h,es-s under 
the assumption of a regular RUKI development (Olsen 1989a: 9). 


The imperfective (“present”) imperative 

The imperative based on the imperfective stem is only used in the 2 sg. and pi. with the 
particle mi (cf. Ved. ma, Gr. pf|) in the function of a prohibitive. Formally, the 2 sg. is 
characterized by the ending -r, while the 2 pi. is identical with the indicative, e.g. 2 sg. 
berer, 2 pi. berei P. If the enigmatic /"-ending does not go back to the athematic *-d h i as 
suggested by Jasanoff (1979: 146), it may have been transferred from aorist imperatives 
of the type dir ‘put’, tur ‘give’ from the old aorist injunctives *d h eh 1 -s, *deh 3 -s with RUKI 
development of the final *-s (Olsen 1989a: 9). 


The aorist indicative 

The Annenian aorist stems are found in two variants, the so-called root and weak aorist. 
While the root aorist is based on the synchronic root and may thus go back to either an old 
imperfect or an aorist stem, e.g. 3 sg. e-ber ‘(s)he carried’ < impf. pret. *e-b h er-e-t, Ved. 
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a-bhar-a-t vs. e-t ‘(s)he gave’ < aor. *e-deh 3 -t, Ved. a-dat, the weak aorist is characterized 
by a morpheme -c% in e-verbs -ec c -, 3 sg. -eac c . In the aorist the mediopassive is distin- 
guished from the active by a separate set of verbal endings, e.g. beri ‘I carried’, sirec c i ‘I 
loved’ vs. beray ‘I was carried’, sirecfay ‘I was loved’: 


The aorist indicative 
Active 


sg. 1 

beri 

sirec c i 

2 

berer 

sirec c er 

3 

e-ber 

sireac c 

pi. 1 

beraf 

sirec c at 

2 

beref, -if 

sirec c et, -it 

3 

berin 

sirec c in 

Mediopassive 


sg. 1 

beray 

sirec c ay 

2 

berar 

sirec c ar 

3 

beraw 

sirec c aw 

pi. 1 

beraf 

sirec c at 

2 

beravf 

sirec c ayt 

3 

berar) 

sirec c an 


Apart from the endingless 3 sg. going back to *-et and supplied with the augment in what 
would otherwise be a monosyllabic form, the active endings are similar to those of the 
imperfect. The mediopassive is characterized by the vowel -a-, which would be regular 
in the 3 pi. *-nto > -an and the 3 sg. of set roots *-ato > -aw, e.g. cnaw (< *cinaw) ‘begot, 
gave birth’, Gr. eyevexo. 

While neither the consonantal element -c c - nor the segment -ea- (> -e-) of the weak 
aorist is fully understood, there exists a more archaic-looking type where -c c - directly 
follows the root vowel, thus lc c i ‘I filled’, 3 sg. e-lic c (root *pleh r ), bac c i ‘I opened’ (root 
*b h eh 2 -). Traditionally -<?- is derived from *-sk-, but a connection with the s-aorist may 
be more likely, cf. lc c i : Gr. e;iA.r|aa or the type in -ac c i compared to Greek denominative 
5-aorists like exi|.ir|aa ‘I honored’. According to Klingenschmitt (1982: 286ff.), we are 
dealing with a “strengthened” *-ss- > -c c - despite the apparent counterevidence of *hjessi 
> es ‘you are’. At any rate it is remarkable that the source of the -c c -aorist seems to be 
roots and stems ending in a postvocalic laryngeal, so as a variant of Klingenschmitt’s the- 
ory one might perhaps suggest a development *-Hs- > -c c -, similar to *-Ks- > -c c - under 
the assumption of “laryngeal hardening”. 


The aorist imperative 

As opposed to the prohibitive imperfective/“present” imperative, the aorist imperative is 
only used in positive orders. When the aorist is of the “root” type, the corresponding imper- 
ative active 2 sg. is identical to the naked root (diachronic stem), e.g. ber ‘carry’ < *b h ere 
(aor. beri, 3 sg. e-ber), while the 2 pi. vacillates between the endings -ef (< *-etes) and 
-if. The 2 sg. imperative of the weak aorist equals the aorist stem without the aorist marker 
-(?(-), thus 3 sg. aor. ind. sireac c : 2 sg. aor. imp. sirea beside the 2 pi. sirecfef, sirecdf. In 
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the mediopassive the forms of the root aorist are 2 sg. berlberir, 2 pi. beraruk, and those 
of the weak aorist are 2 sg. sireacf, 2 pi. sirec'aruk , where (-a-)ruk possibly derives from 
*(-a-)d h uwe + plural marker -k (cf. Skr. mid. imp. bharadhvam), as suggested by Jasanoff 
(1979: 144f.), if -r- is a (conditioned) continuation of *-d h -, possibly before -u-. 


The subjunctives 

Armenian has two subjunctives, one derived from the imperfective (“present”) stem, the 
other from the aorist stem with the primary function of future. In both cases the subjunc- 
tive marker is -icf- (unaccented -c c -). 

The primary type is the aorist subjunctive with the following inflection: 


The aorist subjunctive 



active 

mediopassive 

sg. 1 

bericf 

berayc c 

2 

berc c es 

bercfis 

3 

berc c e 

bercfi (< *-iy) 

pi. 1 

bercfuk 

bercfuk 

2 

berjik 

berjik 

3 

bercfen 

bercfin 


When the subjunctive marker is added to a weak aorist stem, *-c c -c c - is dissimilated to 
-s-c c - and *-e c -J- to -s-j-, e.g. from sirem ‘love’: sirec°ic c , siresc c es, siresc c e, sirescfuk, 
siresjik, sirescfen. The ending -uk of the 1 pi. may derive from *-omes, the only case 
apart from the participial -un (< *-ont-/-om3,no-) where the o-alternant of the thematic 
vowel is preserved, while -jik probably represents an old optative *-yeh,-tes as suggested 
by Klingenschmitt (1982: 40). 

In the imperfective (“present”) subjunctive, -ic c - is simply added to the stem vowel 
and followed by the present endings, thus berem: *bere-ic c -em > *berec c em > beric c em 
etc.; berim : bericHm ; lam : laycfem; and helum : helukum with secondary adaptation to 
the M-stem inflection. 

Traditionally the element -cf- is derived from the Indo-European suffix *-ske/o- (e.g. 
Schmitt 1981: 143) with various explanations of the preceding whether as a match of 
Gr. -ioK(B in verbs like eupioKco ‘find’ or a combination of the optative morpheme + *-sk-. 
Alternatively, Liihr (1994) assumes a reinterpretation of the indicative haycfem ‘ask (for)’ 
as a subjunctive. A different scenario is proposed by Olsen (1988): if the 3 sg. thematic 
subjunctive *-e-ti, the unmarked form of the paradigm, was reinterpreted as a stem and 
supplied with a new set of personal endings we would end up with the attested paradigm of 
the aorist subjunctive: *b h erety-o > beric c , *b h erety-essi > bercfes, *b h erety-eti > bercfe, etc. 
The pattern would be similar to that of, e.g.. Modem Pers. hast ‘he is’ => hastam ‘I am’ etc. 


Verbal stem formation 

To some extent the vocalic stem classes reflect inherited types: 

e-verbs with corresponding mediopassives in -i- < stative *-eh,-, e.g.: 
simple thematic verbs: berem ‘carry’, Ved. bharati 

denominatives in *-eye-: srbem ‘cleanse’ from surb ‘pure’ 
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y-verbs : 
*-ske-verbs: 

inchoative statives: 

old perfects: 


mrmnjem ‘mumble’ (-«/- < *-ny-) 

cana£ c em (assimilated from *canac c em) ‘know’ < *gnh,- 
ske-, Gr. yiyvcboKco 

hangcdm ‘rest’ < *sm-k w i(e)h 1 -ske-, Lat. conquiesco 
(Klingenschmitt 1982: 78f.) 
gitem ‘know’ < *woyd-e- 


a-verbs, e.g.: 
root-verbs: 
reduplicated verbs: 


bam ‘speak’ < *b h eh,-, Lat .fan 

tarn ‘give’ < * di-da,-, Gr. SiScopt, 1 pi. SlSopsv 


More complex formations include variations of nasal presents, e.g. It'anem ‘leave’, Ved. 
rinakti, Lat. linquo, Gr. Lquravco; Ved. sparnam ‘threaten’, Lat. sperno; z-genum ‘dress’ 
< *-wes-nu-, Gr. ewupt; and the productive type of denominatives in -anam, e.g. cera- 
nam ‘grow old’. A particularly important subtype is the causatives in -uc c anem, aor. -uc'i 
(3 sg. -oyc c ), e.g. usanim (aor. usav) ‘learn’, cf. Lith. jirnkti ‘get used to’, vs. causative 
usucfanem ‘teach’ . In the paradigmatic relation between imperfective and aorist stems we 
find traces of the inherited state of affairs as when a nasal present is matched by a “root” 
aorist, e.g. bekanem ‘break’, Ved. bhanakti : aor. beki, 3 sg. e-bek. 


Non-finite verb forms 

The Armenian system of infinitives and participles (monographically treated by Stempel 
1983) consists of an o-stem past participle in -eal, mostly based on the aorist stem, e.g. 
bereal ‘having carried’, cf. OCS nesh jesmb ‘I have carried’, a voice-indifferent o-stem 
infinitive in -l < *-Vlo-, e.g. berel ‘carry/be carried’ (from berem/berim ), lal ‘weep’, 
helul ‘pour’, which is the basis of an indeclinable future participle in -eloc c , e.g. bereloc c 
‘to be carried’ and an ea-stem gerund in -eli, e.g. sireli ‘lovable, beloved’, formally and 
functionally matching Tocharian constructions like Toch. B ma vokalle ‘there is not to 
be drunk, one should not drink’. To these one may add the mostly substantivized verbal 
adjectives in -un, e.g. sirun ‘beloved’ and ‘loving’, probably a merger of the active *-ont- 
and the mediopassive *-om3,no-, and the agent nouns/active participles in -oll-awl, e.g. 
cnawl ‘parent’ (see under “Nominal Word Formation”). As the suffix of the old passive 
participle in *-to- lost its phonetic transparency and was thus unsuitable as a marker of 
a whole morphological category, it left only isolated traces in lexicalizations such as 
mat'd (o-st.) ‘man’ < *mrto- (isolated from the privative *n-mrto- ‘immortal’), li ‘full’ 
< *pleh r to- and lu ‘heard (of)’, i.e. ‘well-known’ < *klu-to-. 


NOMINAL WORD FORMATION 

Due to the dramatic phonetic changes, including loss of the vowels in final syllables, 
many inherited noun suffixes are rendered opaque. Thus, e.g., the Indo-European suffix 
*-ti-, used for the formation of verbal abstracts, is reflected as -f(i)- after M-diphthongs, 
e.g. alawf-k * (z'-st.) ‘prayer’ < *slh,-ti-, and as -d(i)- after sonants, e.g. bard (z'-st.) ‘pile’, 
Ved. bhrti- ‘support’, Av. boroti-, and is lost intervocalically in, e.g., lir (z'-st.) ‘fullness’ 
< *pleh r ti-s, Ved. prati-. As a countermeasure against this development, several suffixes 
are occasionally linked together to create more characteristic and easily recognizable suf- 
fixal chains, e.g. the extremely productive abstract suffix -ufiwn < *-e(h 1 )u-ti-h 3 on(h 2 )- 
(cf. Lat. -tio, -tionis), which may be derived from verbal, adjectival and nominal stems 
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alike, e.g. spananem ‘kill’ — » spanut c iwn ‘killing’, cer ‘old’ — » cerutHwn ‘oldness’, 
elbavr ‘brother’ — > elbayrut c iwn ‘brotherhood’. 

Examples of inherited suffixes and suffix clusters based on inherited elements are: 

-ac c i: adjectives denoting descent or appurtenance, e.g. galileacfi ‘Galilean’, k'alak- 
c ac c i ‘citizen’ < *-skyo/ah 2 -, cf. Goth, judawisks ‘Jewish’ 

-awl (and -ol): agent nouns/active participles, e.g. cnawl ‘parent’, probably < *geno r 
tlo-, cf. Gr. ysveicop, Lat. genitor; the connecting vowel -a- may originate in 
set roots like *gen3,-, or in derivatives based on denominative a-verbs, cf. Lat. 
imperari => imperator, or simply derive from Iranian bases such as nmanawl ‘sim- 
ilar’ from *nimana- 

-eair. adjective suffix, typically used in patronymics of the type Abrahamean ; accord- 
ing to Olsen (1992 and 1999: 385-391) from IE *-i-h 3 no- = Lat. -Tnus etc. 
adjective and noun suffix, e.g. asui ‘woollen’ from asu ‘wool’, kogi ‘butter’ = Ved. 
gavya-, Av. gaoiia-, Gr. -(3oio<; ‘cow-’, Toch. B kewive ‘butter’; IE *-(i)yo/ah 2 - 
-ic c -\ agent nouns, e.g. ergic c ‘singer’; IE *-ikyah 2 -, cf. Slavic derivatives in -acb and 
-icb such as OCS kovacb ‘blacksmith’, kotoricb ‘quarrelsome’ 

-oyt c : abstract nouns derived from verbal or nominal stems, e.g. erewoyt c ‘appearance’ 
from erewim ‘appear’ (Gr. npenai); identical with the Greek suffix -euaiq and the 
basis of the suffix conglomerate -ufiwn 
-urmr. action nouns, e.g. gelumn ‘rolling’, cf. Gr. eUmpa, Lat. uolumen 
- un : verbal adjectives, originally participial, but mostly substantivized, e.g. zerun 
‘creeping’, i.e. ‘snake’; anasun ‘not talking’, i.e. ‘animal’. Since Meillet (1928: 
2), traditionally interpreted as the middle participle corresponding to Gr. -pevoq, 
but, as pointed out by Stempel (1983: 81-83), the function is often active, and the 
active participle in *-ont- would give the same result. Most likely, we are dealing 
with a merger of the two forms. 

Numerous other common suffixes or original final members of compounds are borrowed 
from Iranian (cf. Greppin 1973, Jahowkyan 1993 and Olsen 1999). Examples of Iranian 
loan suffixes and suffix clusters are: 

-agovir. adjectives denoting manner and appearance; intensive and comparative suffix, 
from MIran. -gaona-, e.g. vardagoyn ‘rosy’, Sogd. wrdywn 
-ak\ diminutives, e.g. naw-ak ‘small ship’ 

-akair. adjective suffix, e.g. giser-akan ‘nightly’ 

- apet : nouns with the meaning ‘master of . . . ’, from Iran. *-pati-, e.g. talat'apet 
‘ruler of a city’ 

-astair. local nouns, e.g. herastan ‘a place far away’ from heri ‘distant’ 

- aran : nouns denoting holders or containers, from Iran. *-8ana/i-, e.g. ganjaran 
‘treasury’ 

- ik : adjectives of appurtenance and diminutives, e.g. hayrik ‘little, dear father’ 

- pes : adverbs of manner, e.g. avspes ‘in this way’, cf. Av. -paesa(h)- 

Nominal compounds 

Most categories of compounds as known from other Indo-European languages such as 
Indo-Iranian, Greek and Germanic are maintained as living, productive types in Armenian, 
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where only prepositional compounds and compound verbs are rarely found. In many 
cases we are dealing with simple loans from an Iranian source or calques/semi-calques 
from either Iranian or Greek, e.g. Gr. ppipoiroA-iq — » mayrak'alak' ‘mother city’, but there 
is ample evidence for indigenous creations such as tn-ank ‘poor’ (‘(whose) house (tun) 
(has) fallen’). The following examples will serve to illustrate the most common subtypes: 

Dependent determinatives: ter ‘master’ (ti- < *dems- + ayr ‘man’, cf. Gr. Seanoiqi;, 
Ved. dam-pati-; tikin ‘lady, mistress’ (ti- + kin ‘woman’); skesrayr ‘father-in-law’ 
( skesur ‘mother-in-law’ + ayr ‘man, husband’) 

Possessive compounds (bahuvrlhis): jernunayn ‘empty-handed’ (lit. ‘hand-empty’); 

c‘arakn ‘envious’, lit. ‘evil-eye’, a caique from MIran., cf. MPers. duscasm 
Verbal governing compounds, active or passive meaning: dimadarj ‘opponent’, lit. 
‘face -turning’; amawt c alic c ‘shameful’. 

SYNTAX 

In general, Classical Armenian syntax is similar to that of other ancient Indo-European 
languages with preservation of a rich case system. The word order is relatively free, with 
SVO as the unmarked pattern. In translations from Greek originals it has to a large extent 
been possible to maintain the Greek word order. A few characteristic points deserve to 
be mentioned: 

- the concordance between adjectives and substantives in case and number is not 
always observed. Most often a preceding adjective is left uninflected, e.g. bazum 
awurld but awurt bazumt ‘many days’ (Meillet 1936: 137). 

- the definite object is marked by a preceding z-, reminiscent of the Middle Parthian 
use of o < *h,mb h i in connection with both the indirect and the direct object. 

- a peculiar construction, the “transitive perfect” is found in connection with the 
participle in -eal, where the agent is in the genitive and the object in the accusative, 
including the marker z-, e.g. nora gorceal e z-gorc ‘he has done the work’. 

FURTHER READING 

The literature on Armenian historical grammar in English is rather scarce, but for a gen- 
eral introduction one may refer to Godel’s sketch (1970) and his concise monograph 
(1975). A good historical introduction to the classical language in German, including text 
samples, is Schmitt 1981, and for a contemporary French introduction de Lamberterie 
1988-89 is recommended. Otherwise, the brief, but wonderfully clear classic grammar 
by Meillet (1936) remains an unsurpassed treasure trove of information. 

A comprehensive etymological dictionary is still a desideratum. Evidently Hiibsch- 
mann’s pioneering work (1897) is severely outdated, and Acaryan’s four-volume dic- 
tionary in Armenian is practically inaccessible to the beginner. To some extent the need 
has been satisfied by Martirosyan’s recent and in many ways excellent work (2010), but 
unfortunately only part of the inherited vocabulary is covered. As a supplement one may 
consult the lexical study by Sofia (1960). 

Monographic studies on specific subjects include Klingenschmitt 1982 on the verb, 
Olsen 1999 on the noun and Matzinger 2005 on nominal inflection. Holst 2009 is a highly 
idiosyncratic treatment of selected phonological and morphological problems. 
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The important early articles on Armenian by Pedersen (1982) and Meillet (1977) have 
now been made more easily accessible in collected volumes, and the same goes for Kort- 
landt’s very original work on the subject (2003). 
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PART 10 


TOCHARIAN 

Douglas Q. Adams 


INTRODUCTION 

The speakers of the Tocharian languages enter history in the first century BC, when the 
Han Empire sent military expeditions into the Tarim Basin (what is now the southern 
half of the Chinese province of Xinjiang). Our linguistic records date only to the late 
fourth century AD in the case of Tocharian B and to the seventh century AD in the case 
of Tocharian A. The latest documents in the two languages are probably no later than 
the ninth century AD (the last dated Tocharian B texts are from the final decade of the 
eighth century AD). The age of our texts, somewhat older than has often been thought 
in the past, is confirmed by both paleography and radio-carbon dating. It is also sig- 
nificant, with respect at least to Tocharian B, that our texts cover a half millennium of 
time, time sufficient for us to see significant change. When it was thought that all the 
attested texts from Tocharian B were more or less contemporary, within a century or so 
of one another, the linguistic differences among the texts were reasonably enough taken 
as the result of there being regional dialects (Winter 1955). However, now it is quite 
certain that at least most of the differences are the result of the documents’ different 
ages (Peyrot 2008). 

Tocharian B was the native language of the Kingdom of Kucha (Tocharian B Kusi, 
Uyghur Kucha, Sanskrit Kuci, Chinese Qiuci), and Tocharian A that of the Kingdom 
of Agni (Uyghur Qarashehr, Sanskrit Agni, Chinese Yanqi) to the east of Kucha. Both 
languages were also used, at least as liturgical languages and languages of learning, in 
the Turpan Basin to the northeast. Almost all of the Tocharian A texts and the majority 
of those written in Tocharian B are of Buddhist content. But there are non-religious texts 
as well: medical texts, economic texts of one sort or another (inventories, bills of sale, 
etc.), travel documents/passports, and graffiti. Rare are original literary works (though 
by far the longest text in Tocharian A is a dramatic narrative, the Maitreyasamiti, based 
on Indian Buddhist models but composed in Tocharian A). Missing entirely in both lan- 
guages are historical, mythological, or legal texts. 

The languages were called “Tocharian” by early investigators because of the presumed 
connection of the people who spoke them and the classical Tokharoi who lived in what 
is now Uzbekistan but were known to have been driven there by the Huns from Gansu in 
China (east of Xinjiang, between Tibet to the southwest and Mongolia to the northeast). 
Though often doubted, the connection is probably correct; we may note, for instance, that 
their Turkish-speaking neighbors referred to the speakers of Tocharian A as the Tuyr'i. If 
the connection is true, we can see that, at the very outset of their entering into Central 
Asian history, the Tocharian and Tokharoi would have extended from the western part of 
the Tarim Basin to the outer marches of the then Chinese Empire (see Adams 2000). We 
are not certain what the native speakers called the two languages, hence the neutral, but 
banal, designations “Tocharian A” and “Tocharian B.” 
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MAP 10.1 KUCHA (= KUCI) AND ITS DEPENDENCIES IN HAN TIMES 

To the west of Kucha, in the vassal Kingdom of Gumo (this is the Chinese name; in 
Sanskrit records it is Bharuka), was spoken a language but little different from that of 
Kucha according to early Chinese observers. And to the southeast of the Tarim Basin, in 
the Kingdom of Kroraina (Chinese Loulan), was apparently spoken a related language 
(“Tocharian C” if you will) known only through some loanwords and loan morphemes 
(e.g., Kroraina Prakrit -mci, an adjective suffix with the meaning ‘pertaining to whatever 
noun it’s derived from’ and Tocharian A -nci ‘id.’) from it into the official Indian lan- 
guage (“Kroraina Prakrit”) of the state (Mallory & Mair 2000: 278-279). To the south 
and southwest (beyond Gumo) in the Tarim Basin were Iranian languages, specifically 
two Saka Iranian languages, Tumshuqese and Khotanese. Tumshuqese was well within 
the cultural and political sphere of Kucha (Tumshuq’s “saka-raja” was a vassal of the 
Kuchean king), while Khotanese was the language of a powerful state in the southern part 
of the Tarim Basin. The linguistic situation of the Tarim Basin appears to have been quite 
stable during the first Christian millennium until the last century or so thereof when all of 
these languages were replaced by the Turkish Uyghur, originally spoken northeastward 
of the Tarim Basin in what is now western Mongolia. 


Characteristics of the Tocharian languages and Indo-European languages 

Though quite “advanced” phonologically when compared to contemporary Indo- 
European languages (and Tocharian A is more “advanced” than Tocharian B), the 
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Tocharian languages are otherwise very much “age-appropriate” Indo-European lan- 
guages. They preserve both nominal and verbal dual forms, and thus, for the middle of 
the first Christian millennium, are in this matched in conservatism only by Slavic and 
Baltic (admittedly we have no idea about pre -Albanian in this time period). They have 
a large array of nominal cases (again matched during the period of Tocharian attes- 
tation only by Slavic and Baltic), though in several cases the exponents of particular 
case functions are clearly innovative (i.e., the “secondary” cases, on which see below). 
Among nominal categories, only the effacement of the neuter (so that it is identical with 
the masculine in the singular and with the feminine in the plural) is a clear structural 
innovation, and its “retreat” is a development matched by contemporary Romance and 
Celtic languages. 

The verbal system is also “old-fashioned,” though of the Proto-Indo-European triad 
of present, aorist, and perfect, it has lost the finite forms of the perfect, preserving only 
its participle (combined with remnants of the aorist participle into a general “preterit” 
participle”). Of contemporary Indo-European verbal systems, it reminds one most of 
the Greek system and, to a somewhat smaller degree, the Romance system. Its core 
vocabulary is pretty much composed of straightforward inheritances from Proto-Indo- 
European (cf. the words for ‘father’, ‘mother’, ‘brother’, sister’, which are (in Tocharian 
B/Tocharian A order) pacerlpacar, macerlmacar, procer/pracer, ser/sar, which reflect 
pretty regularly PIE *ph,ter, *meh,ter, b h reh 2 ter, *swesor). However, that core vocab- 
ulary is supplemented by a large number of borrowings from Sanskrit for Buddhist 
concepts and ideas. 

What makes the Tocharian languages seem so far from the Indo-European mainstream 
is their pronunciation, their “innovative phonology.” Between the breakup of Proto-Indo- 
European and our first attestation of Tocharian, there were some major sound shifts. The 
PIE plain velars, (e.g., *k) had fallen together with the palatal velars (*k); Tocharian 
shares this development with all the Indo-European groups of the west (Germanic, Italic, 
Celtic, Greek). Particular to Tocharian was the merger of the three series of PIE stops: 
voiceless, voiced, and voiced aspirate. Thus, for instance, the bilabials, *p, *b, *b h , had 
all merged into a single Proto-Tocharian voiceless stop (e.g., *p). By these two changes 
the fifteen PIE stops had become only four (*p,*t, *k, *k w ) in Proto-Tocharian. On the 
other hand, pre- or Proto-Tocharian underwent a very thorough-going shift from unpala- 
talized consonants before front vowels to palatalized, so that *t, *k, and *k' v appeared in 
Proto-Tocharian as *c, *c, and (also) *c in that environment (in attested Tocharian, both 
A and B, Proto-Tocharian *c had become s). Similarly, *s, *n, and *1 were palatalized to 
*s, *n, and (written as <ly>). Exceptionally, dentals had become *ts when before *y 
and PIE /d/ had disappeared (via 67) or become affricated (ultimately to become <ts>) 
under certain circumstances difficult to specify. The PIE vowels also underwent major 
changes that are perhaps most easily grasped in list form; see Table 10.1 (the closing 
apostrophe, ’, indicates that the preceding consonant had been palatalized). 

Taken together, these changes, to consonants and vowels, may make the PIE anteced- 
ents of a Tocharian form momentarily (or not so momentarily) opaque, e.g., PIE 
*d h ewk- ‘hide’ > Toch. AB cuk-, PIE *h,ekwos ‘horse’ > Toch. B vakwe [A yuk], PIE 
*widwo- ‘wise’ > Toch. B uwe, PIE *pode ‘(two) feet’ > Toch. B pai(ne) [A pe(m)]. The 
merger of the three orders of PIE stops (voiceless, voiced, and voiced aspirated) makes 
Tocharian etymologies more difficult to do and feel certain about than those in other 
major Indo-European groups because, say, a Tocharian verbal root tak- might reflect a 
PIE initial *t-, *d-, or *d h - and a final *-k-, *-k-, *-g-, *-g-, *-g h -, or *-g h -. 
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TABLE 10.1 THE DEVELOPMENT OF PIE VOWELS IN TOCHARIAN 

PIE Toch. B/Toch. A 


*e 

* 

*e 

*a 

*a 

*o 

*5 

*u 

*u 

*ey 

*oy 

*ew 

*ow 

*ay 

*aw 


’a I’d (but a in the neighborhood of PIE *s or *w; ’i/’i when part of a living ablaut series) 

’i/’i 
’a I’d 

a/a (automatic "reduced” vowel inserted to break up difficult consonant clusters) 

’e/’a 

aid 


aid (*H indicates any vocalized laryngeal, vocalized only when between two obstruents) 
aid 

e/a (but a when unstressed before word-final *m, *n, *y, *w, and *r) 
ala 

ala (later than the change of *i/e to ’a; u/u when part of a living ablaut series) 
o/u 


’i/’i 


ei (later ai)/e 
’u I’u 


eu (later au)/o 

ai/e 

au lo 


Place within Indo-European 

It is universally assumed that the pre -Anatolians were the first Indo-European subgroup 
to separate themselves from the rest, and it is widely assumed that the pre-Tocharians 
were the second subgroup to separate themselves. Both assumptions are probably correct. 
But the gap between the time of the pre- Anatolians’ separation and that of the pre-Toch- 
arians’ was much longer than the gap between the pre-Tocharians’ going their own way 
and the time of the more general breakup of the remaining PIE speech group. Just one 
example of the difference in the relationship of Anatolian and Tocharian with the rest of 
the Indo-European speech community is the complete lack of both subjunctive and opta- 
tive in Anatolian but the presence of both in Tocharian, albeit in a clearly less developed 
form (cf. Verbs infra) than shown by other Indo-European groups. 

While the Proto-Indo-European that pre -Tocharian separated from cannot have 
been completely uniform, Tocharian does not show a close resemblance to any other 
Indo-European branch. Thus, the exact location where, dialectologically speaking, that 
separation took place is not easy to determine. However, the thoroughness with which 
Tocharian nouns and adjectives of whatever declension have become mixed with /7-stem 
forms strongly suggests that at some point Tocharian had developed productive //-stem 
by-forms to other declensional types and that these //-stem forms had a definitizing value 
that contrasted with a non-definite value of the other kind of declensions. This develop- 
ment is seen in an even fuller form in Germanic (so also apparently in the history of Alba- 
nian- Matzinger 2006: 200). So also do some very specific lexical comparisons point in 
the direction of “northeastern Indo-European” (i.e., Germanic and Balto-Slavic) such as 
Germanic *deug- ‘hide’ (cf. Old English deogol ‘secret, hidden’) and Toch. tuk- ‘hide’, 
both from a latish PIE *d h ewk-, a metathesized variant of the more widely distributed 
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*kewd h - (cf. Germanic *hud- ‘hide’, Greek Ksu&e/o- ‘hide’, and Iranian cognates), or 
Toch. B proksa (pi.) ‘millet’ beside Slavic proso (sg.) ‘millet’ (< PIE *prokso/eh 2 -). The 
Toch. B ablative ending -mem, immediately from PIE *-mons, points in the same direc- 
tion, in that it shows an affinity with the plural oblique cases in -m- in Germanic and Balto- 
Slavic (compare particularly Old Prussian -mans; note that the *-ns of Proto-Tocharian 
and Old Prussian are independent innovations on a more original *-s) as opposed to the 
*-b h - that characterizes these cases in other Indo-European languages (such as Italic, 
Celtic, Greek, Indo-Iranian, and perhaps Albanian). Even more specific is the preser- 
vation of the reduplicating syllable, PIE *Ce- (where C = the reduplicated word-initial 
consonant), that is the mark of the PIE perfect as such, with a morphologically restricted 
retention of Ce- in both Gothic and Tocharian and not as **Ci- (in Germanic) and **C’a- 
(in Tocharian) as the sound-laws of Germanic and Tocharian would otherwise require. 

The dating of the breakup of that Proto-Indo-European unity that remained after the 
pre-Anatolians had departed is very much a matter of discussion. However, a date of 
4,000 BC (± 500 years or so) is likely. As stated above, the pre-Tocharians probably left 
relatively early in the dissolution of the Proto-Indo-European speech community. How 
they got from the northern rim of the Proto-Indo-European-speaking world, adjacent to 
the pre-Slavs and pre -Germanics, and presumably somewhere on the Russo-Ukrainian 
steppe, to the Tarim Basin remains archeologically opaque. They are often thought to 
have been members of the Afanasevo culture of south central Siberia (so, for instance, 
Mallory & Mair 2000). That is not impossible. But any archeological traces of an exten- 
sion of Afanasevo southward towards the Tarim Basin are faint. The timing would be 
problematic as well. The Afanasevo culture begins about 3,500 BC (lasting until about 
2,500 BC), and if the pre-Tocharians were its bearers, there would have been precious 
little time for any interaction with early Indo-Iranian as described in the next paragraph. 
More importantly, perhaps, there are no traces of cereal agriculture associated with the 
Afanasevo culture, and cereal agriculture clearly had a continuous history among Toch- 
arian speakers, from attested Tocharian back to the Proto-Indo-European homeland (wit- 
ness proksa [pi.] ‘millet’, Proto-Slavic *proso ‘millet’). Another possibility is that they 
were somehow participants in the Andronovo culture (or cultures) of northern Kazakh- 
stan and adjacent southern Siberia (in various forms dating from 2,000-900 BC). Among 
the Andronovos some agriculture was always present. The graves of the earliest culture 
found in the Tarim Basin, the Qawrighul culture (beginning about 1,800 BC; presumably 
the culture of the pre- or Proto-Tocharians), unhelpfully show some similarities with both 
their Afanasevo and Andronovo counterparts (Mallory & Adams 1997, s.vv. Afanasevo, 
Andronovo, Qawrighul; Mallory & Mair 2000). 

The pre-Tocharians and the pre-Indo-Iranians were all on the eastern frontier of the 
Proto-Indo-European world at one time or another. They would certainly have run into 
one another if the pre-Tocharians had spent an appreciable amount of time as members of 
the Andronovo culture. It is worth noting that there is no particular evidence for any early, 
i.e., PIE-era, association of Tocharian with the other eastern Indo-European languages, 
the Indo-Iranian languages. Certainly, as mentioned already above, Tocharian is a clear 
centum- language (i.e., the PIE palatal velars, *k, *g, and g h , remain unpalatalized and 
fall together with PIE dorsovelars *k, *g, and *g h ), just as Indo-Iranian languages are 
clearly satem-languages (where PIE *k, *g, and g h did undergo palatalization and remain 
distinct from the dorsovelars *k, *g, and *g h ). There is only one lexical item clearly and 
certainly shared by just Tocharian, Iranian, and Indie: B masce ‘fist’, Avestan musti- ‘id.’, 
Sanskrit musti- ‘id.’ All go back unproblematically to PIE *mustey-. A second example 
is far less certain (because of the inexactness of the semantics): B welke (part of a plant), 
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Avestan varaka- ‘leaf’, Sanskrit valka- ‘bark’. Any random Indo-European threesome 
might, quite accidentally as it were, share exclusive ownership of two words. 

Interestingly enough, however, given the lack of shared items of certain Proto-Indo- 
European age, there is a small set of really quite ancient borrowings from Iranian to Toch- 
arian and, in one case, from Indie to Tocharian. Examples (all from Tocharian B) are tsain 
(pi. tsainwa) ‘arrow’ (< *Proto-Iranian *dzainu-, cf. Avestan zaena- armor, weapon’, 
Armenian zen (a u-stem) ‘id.’, also borrowed from Iranian), tsainne ‘ornament’ (< Proto- 
Iranian *dzai- ‘to ornament’), waipecce ‘possession’ (< Proto-Iranian *hwaipatya-, cf. 
Avestan hvaepaiOva- ‘ones’ own’), sent ‘axle’ (< Proto-Iranian *axsam, cf. Gathic Ave- 
stan asa-), iscem ‘brick, tile’ (i.e., ‘fired clay’) (< Proto-Iranian *istyam, cf. Late Ave- 
stan istiia- [n.]), eksinek ‘dove’ (< Proto-Iranian *aksinaka-, cf. Ossetic cexsinceg), and 
*wastare ‘camel’, unattested but cf. wdstarye ‘pertaining to a camel’ (< Proto-Iranian 
*ustra- ‘camel’, cf. Avestan ustra-). These borrowings are significant because they show 
an early stage of Iranian phonological development: preservation of the affricate *-dz- (in 
attested Iranian always *-z-), and of the cluster *-ty- (rather than later *-0y-); word-final 
stress; or even, once, a fonn that had not undergone the RUKI rule, i.e., *ustra-, not 
*ustra-. The Iranian we see through the window of these early Tocharian borrowings 
noticeably antedates the Avestan of Zarathustra. Since Zarathustra is commonly, though 
not universally, dated to about 1,000 BC, that puts these early Iranian-Tocharian contacts 
well into the second millennium BC. Whether these contacts took place inside the Tarim 
Basin or outside cannot be said. 

The singular example of a demonstrably early borrowing from Indie is kercapo ‘ass’ 
(= Sanskrit gardabha- ‘id.’ < pre-Indo-Aryan *gordeb h 6-). It is particularly significant, 
however, because that borrowing must have taken place before the merger of PIE *e, *a, 
and *o in Indo-Aryan and so even earlier than the early borrowings from Iranian where 
that merger had already taken place. This merger had already been accomplished by 1500 
BC as seen in Indo-Aryan loanwords borrowed into Mitanni in eastern Asia Minor. In 
this case we can be certain the borrowing must have occurred before the pre-Tocharians 
entered into the Tarim Basin or the pre-Indo-Aryans left Central Asia for India (the latter 
event being, on most accounts, sometime between 2,000 BC and 1,500 BC). 

Out of all these considerations a picture, both fuzzy and speculative, begins to emerge 
of the pre-Tocharians leaving the northern frontier of the Proto-Indo-European world and 
moving east about 4,500 BC or so to the southern Urals-North Kazakhstan steppes. There 
they had limited contact with both Proto-Indo-Aryans (first) and Proto-Iranians (second), 
already dialectally distinguished from one another, in the period, say, 2,500-2,000 BC. 
After that contact the Proto-Tocharians moved further east, into the Tarim Basin, while 
the Proto-Indo-Aryans and Proto-Iranians, in that chronological order, moved further and 
further south (with one arm of the Proto-Iranians, the Khotanese and Tumshuqese, also 
eventually entering the Tarim Basin and settling the southwestern portion of it). Such 
a scenario would allow the pre-Tocharians to be in the Tarim Basin as the bearers of 
the Qawrighul culture of the eastern Tarim Basin (beginning about 1,800 BC), a culture 
known almost exclusively from its burial sites and spectacular mummies. The area occu- 
pied by the Qarighul culture was later the nucleus of the historical Kingdom of Kroraina 
(Chinese Loulan), where we find the remnants of “Tocharian C.” 

Contact with Iranian was maintained, or renewed (and now clearly with the Tocharians 
in the Tarim Basin), in the middle of the first millennium BC or thereafter. We find such 
borrowings with later Iranian phonology as newiya ‘(a kind of) canal’ (< Proto-Iranian 
*nawiya- ‘navigable’), murye ‘(akindof) canal’, artte ‘(feeder) canal \peri ‘debt’ (< Proto- 
Iranian *parya-), sate ‘rich’ (< Proto-Iranian *cyata-), which suggest a very different 
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cultural level of inter-language contact. Somewhat later, the advent of Buddhism in the 
Tarim Basin brought with it a host of religious/cultural terminology, either as borrowings 
or as caiques, from Sanskrit and, to a lesser extent, from Iranian. 


PHONOLOGY 

The two Tocharian languages have exactly the same phonemic inventories. 


TABLE 10.2 TOCHARIAN CONSONANTS 



Bilabial 

Apico- 

Dental 

Apico- 

Alveolar 

Lamino- 

Palatal 

Dorso-Velars 

stops, affricates 

P 

t 

ts 

c 

k 

continuants 



s 

s 


nasal stops 

m 

n 


n 


laterals 


l 


6 <ly> 


trill 


r 




approximants 

(w) 



y 

w 


TABLE 10.3 TOCHARIAN VOWELS 


The symbol <a> represents something like IPA [i] and <a> something like IPA [a]. 

Stress: It is possible that both Tocharian languages have contrastive stress, but only 
Tocharian B shows it in ways visible to us, as described below (under the heading 
“Morphophonology”). This contrastive stress may result in minimal pairs, e.g., takam 
‘we will be’ (< /takam/ and takam ‘we were’ (< /takam/)). 

Writing System: Though neither Tocharian language has been spoken for a thousand 
years or so, we can be quite confident that our knowledge of the phonology of both 
languages is accurate and substantially complete. That is so because they are written in 
a form of Brahmi, the Indian alphabet whose variants have been used to write various 
Indian and Indo-Chinese languages since the last centuries before the Christian era. The 
phonetic values of the signs of the alphabet are well known. Unique to the Tocharian 
variant of this writing system are that so-called Fremdzeichen (‘foreign signs’) created 
by those who adapted Brahmi to Tocharian to signal the vowel /a/, unknown to Sanskrit 
and other Indie languages. 

Morphophonology: In Tocharian there are three major, productive, morphophono- 
logical processes, which shape many Tocharian paradigms: palatalization, umlaut, and 
ablaut. In addition, there are widespread, purely phonological, processes that account for 
the distribution of [a], [a], and [a] in Tocharian A and Tocharian B. We will take up these 
latter processes first. 

Distribution of [a], [a], and [a] in Tocharian B: In Classical and later Tocharian B 
the central vowel /a/ is strengthened in stressed syllables to [a]. In unstressed closed 
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syllables it remains as [a], while in unstressed open syllables it disappeared completely 
(e.g., /stamo-/ ‘having stood’ > stmo-). Similarly, in stressed syllables /i / and Ini may be 
reflected in the spelling as <i> and <u> respectively (i.e., the long vowel signs are used to 
show stressed syllables). On the other hand, in stressed position /a/ remains unchanged, 
but, conversely, in an unstressed syllable /a/ appears as [a]. Both the strengthening of /a/ 
(and /i / and /u/) and the “destrengthening” of /a/ are shown orthographically in Classical 
and Late Tocharian B. To give but one example: in Archaic Tocharian B the third person 
singular of ‘(s)he released’ would have been carka, but in Classical and Late Tocharian 
B it was carka. It is from the strengthening of /a/, the destrengthening of la.!, and the loss 
of unstressed /a/ in open syllables that we derive the majority of our information about 
Tocharian B stress, since stress itself is never marked orthographically. 

Distribution of [a], [a], and [A] in Tocharian A: Superficially resembling in some 
ways the interchange of [a], [a], and [a] in Tocharian B is a similar interchange of the sur- 
face representatives of /a/ in Tocharian A. Though considerably disturbed by analogical 
restorations of one sort or another, the basic historical pattern is like this: if, in a word of 
two syllables or more, the first syllable contained /a/, /e/, or lol and the second syllable 
contained /a/, then the second syllable’s /a/ shows up on the surface (1) as [a] if the second 
syllable is word final and closed, (2) as [a] if the second syllable is not the final syllable 
of the word but is a closed syllable, and (3) as zero if the second syllable is not the final 
syllable of the word and is an open syllable. 


Condition 1: 

kotnas ‘(s)he cuts off’ from Ikotnasl (cf. Toch. B kautanam < PToch. *kautna-, or, 
within Tocharian A when the first syllable of the word is “light,” A karsnas ‘(s)he 
knows’) 

kropat ‘(s)he gathered’ (cf. Toch. B kraupate < PToch. *kraupate or, within Tocharian 
A, kalpat ‘(s)he achieved’) 
samam ‘monk’ (B samane < PToch. *samane) 
aknats ‘foolish’ (B aknatsa < PToch. *aknatsa) 


Condition 2: 

tapakyan ‘mirrors’ (cf. tapaki [< *tapaky] ‘mirror’) 
kakarpu ‘having descended’ (B kakarpau < PToch. *kakarpa-) 


Condition 3: 

bvpte ‘thou has gathered’ (cf. B kraupatai < PToch. *kraupatai, or, with Tocharian A, 
kalpate ‘thou hast achieved’) 

kroplune ‘gathering’ (cf. B kraupalne or, within Tocharian A, kalpalune ‘achievement’) 
kaktnu ‘having brought’ (cf. B kakamau < PToch. *kakama-) 

In addition, as in Tocharian B, /a/disappears in open syllables, even from those syllables 
that, on the basis of Tocharian B evidence, were stressed. 

Palatalization: Palatalization (as already outlined above) developed when a suscepti- 
ble root-initial consonant or consonant cluster preceded a PIE *-e- or *-i-, whether short 
or long (short *-i- does not palatalize if it is further in the environment of *-w- or *-s-). 
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Thus, we have k > s (via *c), t> c,ts > s (Tocharian A only), n >n,l> ly, w> y (Tochar- 
ian B only), s> s,sk> ss, st > sc, sp > spy p, m, r are not palatalizable in either language, 
and neither are any of the consonants that can be the result of palatalization (c, n, v, etc.). 
However, in Tocharian B certain preterits to marked causative roots exceptionally allow 
“palatalization” of most of the non-palatalizable consonants. Thus, we find the “special” 
or “secondary” palatalization: k > ky, n > fiv, p > pv, m > my, ts > tsv. Non-causative 
preterits show only “primary” palatalization, e.g., Toch. B lyaka ‘he saw’ and Toch. A 
(imperfect) Ivak ‘he was seeing’. 

Root-initial palatalization is strongly favored by causativity. Almost all preterits to 
morphological causatives in Tocharian B are preterits that uniformly show palatalization, 
regular and “special,” where possible. Causative presents or subjunctive forms with the 
suffix -ask- sometimes, but not commonly, also show palatalization: B sarsask’a/e- [A 
sarsas ’d/a-] ‘teach’, causative to kars- ‘know’ (non-causative present karsna- in both lan- 
guages), B satkask- (beside katkask-) ‘send over’ to katk- ‘cross over’ (both with preterit 
satka), B salask’d/e- ‘to throw’ (but A salds’d/a-) to sal- ‘fly’, B spantask’a/e- ‘con- 
vince’ to spant- ‘trust’, B sparkask’dte- ‘destroy’ to spark- ‘perish’, and sparttask’d/e- 
(~ sparttask’a/e — sparttask'd/e-) ‘turn (trans.)’ to spartt- ‘turn (intr.)’. Note, with 
“secondary” palatalization we have B pyutkask'd/e- [A pvutkas'd/a -] ‘come into exis- 
tence’. But palatalization may also mark the present as opposed to the subjunctive, e.g., 
cesam/cekem ‘touch(es)’ but tekam/takam* ‘will touch’ (A present cak'd/ a-, subjunctive 
taka-) or B cepiv(e)- ‘step forth, appear’ but tapa- ‘will appear’. 

Umlaut: Both Tocharian languages have undergone a change whereby an *-a- in a 
following syllable has changed a preceding *-(’)e- to *-(’)§-, a phonological process 
we can call “d-umlaut.” The change works somewhat differently in Tocharian A and 
B, and that difference suggests the change was only in process at the time, whenever 
that was, that the two varieties of Proto-Tocharian that were to become Tocharian A and 
Tocharian B were losing contact with one another. In Tocharian B any *-(’)e- became 
*-(’)a- when another *-a- followed in the next syllable. In Tocharian A the change 
occurred only when the *-(’)e- was, on the basis of the Tocharian B evidence, unstressed. 
There is a common subjunctive fonnation that has the root-vowel *-e- (< PIE *-o-) in 
the active singular and *-a- elsewhere. When the thematic vowel, the morpheme join- 
ing root and person-number endings, is -a-, the root-vowels remain unchanged, thus 
B tekam/takam (< *tekan/takan) ‘(s)he will touch/they will touch’. By normal change 
Proto-Tocharian *-e- becomes Tocharian A -a-, but otherwise there is no change in the 
latter language either. When the thematic vowel is *-a-, however, Tocharian B shows the 
effect of d-umlaut, e.g., tarkam/tarkam (< *terkan/tarkan) ‘(s)he will release/they will 
release’. In Tocharian A the result is tarkas/tarkenc with the regular change of *-e- to 
-a- but with no d-umlaut to **tarkas. However, PToch. *lyeka- ‘saw’ gives lyaka- in 
both languages (3 sg. B lyaka, A Ivak). The effect of d-umlaut is analogically effaced in 
some situations. Thus, the plural of B pile ‘wound’ is pilenta, with preserved -e-, and not 
the **pilanta required by d-umlaut. But in verbal morphology d-umlaut is quite regular. 
Similar to d-umlaut, but much rarer, is o-umlaut, e.g., /sesoyu-/ ‘satisfied’ > B sosovu-. 

There is another kind of vowel affection that is to be seen in Tocharian B: in many sit- 
uations in the Tocharian B verb where we expect a ... e or e ... a what we find is o . . . o 
instead. Thus, the imperative of aks- ‘speak, announce’ should be /pe-akse/ underlyingly 
but appears in the surface structure as pokse instead. Likewise, Class IV presents, parallel 
in every way to Class III presents save that Class IV’s have the root-vowel -d- instead of 
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Class Ill’s -a-, have the shape, for example, B kloyotar ‘(s)he falls’ rather than **klayetar 
(compare the Class III B palketar ‘bums’ (intr.)). 

Ablaut: Both Tocharian languages show abundant remains of PIE ablaut, particularly 
in verbal paradigms. Ablaut is not found paradigm-intemally in present formations but 
is relatively common in both the subjunctive and preterit. As described above, many 
show a pattern of ablaut wherein the active singular is distinguished by one vowel (-e- [A 
-a-] or -a-, while the rest of the paradigm has -a-). In the preterit there are two loci for 
inter-paradigmatic ablaut: ( 1 ) in certain a-preterits (“Class I”) and (2) in sigmatic preterits 
(“Class III”). 

For Class I preterits with ablaut, the usual ablaut situation is an -a- in the active with 
preceding palatalization and -a- with no palatalization in the medio-passive and preterit 
participle, e.g., B carka ‘(s)he released’, B carkare ‘they released’ vs. B tarka(n)te ‘(s)he 
was/they were released’ (and preterit participle tarkau). In A the situation is different, and 
demonstrably older. In the active singular there is an - ’a- (as in B), cark ‘(s)he released’, 
but the preterit (dual and plural) have -a-, e.g., tarkar ‘(s)he released’. The medio-passive 
(tdrka(n)t) and participle ( tdrko ) have non-palatalizing -a-, as in Tocharian B. Tocharian 
B has one example of the older, Tocharian A, situation, where we find underlying -a- 
with preceding palatalization in the active singular, -a- (< underlying (Toch. B) -e- by a- 
umlaut) in the non-singular active, and -a- with no palatalization elsewhere: scama [3 act. 
sg.], stamais [3 act. dual], and (once) stamare [3 act. pi.] from stam- ‘stand’. The active 
singular, then, represents an old PIE *e-grade and the medio-passive an old PIE zero- 
grade. In Indo-European terms the more archaic type would appear to reflect an (active 
singular) *e-grade, (active plural) *o-grade, and (medio-passive) zero-grade. A generally 
accepted explanation for this ablaut system remains to be found. 

Class III or sigmatic preterits offer two ablaut (or rather palatalization) patterns. 
In the first we have Tocharian B -e- in the active, usually in the medio-passive, and 
the preterit participle - all with no palatalization. In some verbs of this group the 
medio-passive (MP) and preterit participle (PP) have Tocharian B -a-. An example with 
constant e- is tam- ‘be born’ ( temtsamai [1 MP sg.], temtsate [3 MP sg.], temtsamte 
[1 MP pi.], temtsante [3 MP pi.]; PP tetemu). Examples with an interchange of -e- and 
a- are park- ‘ask a question’ (preksa [3 act. sg.], parksante [3 MP pi.]; PP pepai'ku; 
A prakas [3 act. sg.], praksat [3 MP sg.], PP paparku ) and rak- ‘extend [one’s hand]’ 
(reksa [3 act. sg.; cf. Latin rexit ], raksamai [1 MP sg.], raksate [3 MP sg.]; PP rei'aku*; 
A rakas [3 act. Sg.], PP rarku). In the second subgroup we find the active with pala- 
talization + -e- and the medio-passive and preterit participle with -a-. A good example 
is the causative paradigm of palk- ‘bum, torture’ (pelvkwa [1 act. sg pelyksa [3 act. 
sg .], palyksatai [2 MP sg.]; PP pepalykit*). In Indo-European terms the first, non-pal- 
atalizing, subtype would appear to reflect the (active) *o-grade and (medio-passive) 
zero-grade, while the second, palatalizing, subtype would appear to reflect the (active) 
*e-grade and (medio-passive) *e-grade. Tocharian A shows the same two patterns 
(with -a- of course rather than -e-) but differs greatly from Tocharian B in the lexical 
distribution of the two types. 

MORPHOLOGY 

The Tocharian languages are typically Indo-European in their complex morphology, 
using a rich set of inflections in both nouns and verbs. 
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Nouns: ATocharian noun belongs to one of three gender classes, masculine, feminine, 
and neuter. The last class is also referred to as alternating, as nouns in this class look 
identical to masculine ones in the singular and feminine ones in the plural. Nouns inflect 
for number: singular, dual, and plural, and case. (For a discussion of the Tocharian dual 
and the supposed “paral” (for natural pairs), see Winter 1962.) The various cases mark 
the nouns’ syntactic role in a sentence. In Tocharian B there are nine cases: nominative, 
accusative, genitive, vocative (only distinct from the nominative in the singular), abla- 
tive, causal (only in the singular), comitative, locative, and perlative. The first four are the 
so-called primary cases, and the last five are secondary cases. 

The primary cases are developments of PIE case endings; have different forms for 
singular, dual, and plural; are always marked on the noun; and, in many nouns, cause a 
shift in stress. For example, the nominative and accusative of vakwe ‘horse’ have stress 
on the first syllable, but the genitive is ydkwentse with stress on the second. (Evidence 
for accent-shifting can only come from Tocharian B.) This mobility of accent does not in 
general reflect anything of PIE age. By a variety of regular phonological and analogical 
changes, not always well understood, nominal stress came usually to be on the first (left- 
most) “strong” vowel (-e-, -o-, -a- and diphthongs) and not on a “light” vowel (-/-, -u-, 
-a-). Thus, an original PIE *h[ekwo- ‘horse’ became PToch. *yakwe-. Complicating the 
surface structure of end-stressed nouns, like *yakwe-, at least in Tocharian B, is a later 
phonological development, Marggraf’s Law, that retracted stress from absolute final syl- 
lables, and thus the nominative-accusative singular *yakwe > *yakwe, but the genitive, 
*yakwen(a)se, remained unaffected. 

Adjectives agree in case with their head nouns when the head nouns are in the pri- 
mary cases. Secondary cases, on the other hand, are thought to derive from PIE postpo- 
sitions that have become fused with the noun, have the same shape whatever the number 
of the noun they is attached to, occur only once on a series of nouns in the same case, 
and do not cause any accent shift. Adjectives do not agree with their head nouns when 
the head nouns are in any of the secondary cases. When modifying nouns in second- 
ary cases, adjectives are uniformly in their accusative shapes. However, the distinction 
between primary and secondary cases is not absolutely sharp. In particular, the Toch. B 
ablative in -mem often causes accent shift, giving rise to the possibility that it was once 
a primary case ending that has become “secondarily a secondary case” if you will and 
the same is true of the Toch. A ablative ending -s (< PIE *-ti, represented also by Hittite 
-(a)z < *-(o)ti). 

A rather typical nominal paradigm would be vakwe ‘horse’ from PIE *h[ekwos (the 
Toch. A equivalent yuk, is given in italics in square brackets); see Table 10.4. 


TABLE 10.4 AN EXAMPLE OF TOCHARIAN NOMINAL DECLENSION 



Singular 

Dual 

Plural 

Nom. 

yakwe [yuk ] 

yakwene [yukam] 

yakwi [yukan] 

Acc. 

yakwe [yuk ] 

yakwene [yukam] 

yakwem [yukas] 

Voc. 

yakwa [yuk ] 

yakwene [yukam] 

yakwi [yukan] 

Gen. 

yakwentse [yukes] 

yakwenaisan [yuknis] 

yakwemts [yukassi] 

Abl. 

yakwemem ~ 

yakwemem [yukas] 

yakwenemem [yuknas] 

yakwemmem [yukas as] 

All. 

yakwesc \yukac] 

yakwenesc \yuknac] 

yakwemsc [yukas ac] 
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Com. 

yakwempa [yufcassal] 

yakwenempa 
[ yuknassal ] 

yakwemmpa 

[yukasassal] 

Loc. 

yakwene [yukam ] 

yakwenene \yuknam] 

yakwemne \yukasam ] 

Perl. 

yakwesa \yukd ] 

yakwenesa [yukna] 

yakwemsa [yukasa] 

Instr. 

[ yukyo ] 


[ yukasyo ] 


This paradigm is directly descended from PIE masculine o-stems. However, like vir- 
tually every nominal or adjectival paradigm in Tocharian, it shows some importations 
from PIE 7?-stems. Thus, the genitive singular and genitive plural are from *-o-neso and 
*-o-nesom respectively, and the entire dual shows an -n- from the same source. 

The fusion of accusative plus following postposition that created the secondary cases 
was apparently only beginning in the last stages of Proto-Tocharian because the two 
languages show different results. For instance, the allative postposition must have been 
PToch. *-ca (= Greek -del), and the perlative postposition must have been *-a. When 
added to the singular accusative we would have had *yakwe-ca for the allative; when 
added to the plural, *yakwens-ca. For the perlative it would have been singular *yak- 
w(e)-a and plural *yakwens-a. At some point in the history of Tocharian A the accusative 
plural ending was reduced from *-ns to *-s. The allative would then have been singular 
*yakwa-ca and plural *yakwas-ca. The latter form was rebuilt, by analogy to the singular 
as *yakwas-aca, whence the actual forms yukac and yukasac. The perlative followed 
the same process, and *yakwa-a and *yakwas-a gave actual yuka and vukasa. In pre- 
Tocharian B, however, word-final *-ns gave *-n (written -m in attested Tocharian). Origi- 
nal *yakwe-ca and *yakwens-ca would have given **yakwec and yakwensc respectively. 
Original *yakwe-a and *yakwens-a would have given **yakwa and *yakwensa respec- 
tively (with a morphological division *yakwen-sa because now the accusative plural was 
*yakwen). In both cases the plural form was generalized in Tocharian B, thus singular 
*yakwesca and *yakwesa. Also generalized from the plural was the Toch. B ablative end- 
ing -mem. In Tocharian A it was the singular ablative ending, in -s, that was generalized 
to the dual and plural. 

Adjectives: Tocharian adjectives are morphologically much simpler than nouns. They 
have at most four case forms (nominative, accusative, genitive, and vocative). When they 
modify nouns in other cases (e.g., ablative, instrumental) the accusative form is used, 
and, more often than not, nouns in the genitive are also modified by adjectives in the 
accusative. In form the most common kinds of adjectives are medleys of PIE o-stem, yo- 
stem, and 7 ? -stem adjectives. As in the case of nouns, the many 77-stem forms that show 
up in many places in the Tocharian adjective reflect the pre -Tocharian situation whereby 
nouns and adjectives had productive 77-stem derivatives that were individualizing and 
definite when compared to the non-77-stems that underlay them. With regard to adjectives, 
the situation must have been very similar to that of Proto-Germanic with the latter’s dis- 
tinction of definite and indefinite adjectives, the former being built on 77-stem antecedents 
and the latter not. In Tocharian the two types of adjectives were fused. The fusion seems 
to have been largely post-Proto-Tocharian because the mechanics of that fusion are often 
quite different in the two languages. In Table 10.5 we give examples of an inherited (PIE) 
o-stem ( iruwe ‘new’ [Toch. B only; Toch. attests only nu (m. sg.) and nwam (f. pi.)]), 
yo-stem (ndkciye \nakci\ ‘divine’), and two inherited 77-stems ( krosce [ k/as\ ‘cold’ and 
klvomo [klvom\ ‘noble’). 
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TABLE 10.5 SOME EXAMPLES OF TOCHARIAN ADJECTIVAL DECLENSION 


m. sg. nom. 

nuwe 

nakciye [nakci] 

krosce [kuras] 

klyomo [klyom] 

acc. 

nuwe 

nakciye [nakci (m)] 

kroscam [krossam] 

klyomom 

[klyomant] 

gen. 

nwepi 

nakciyepi [nakcinap] 


klyomopi 

[klyomdntdp] 

m. pi. nom. 

nuwi 

nakci [nakcini] 

krosci [krosse] 

klyomon [klyomas] 

acc. 

nuwem 

nakciyem [nakcinds] 

kroscam [krosses] 

klyomom 

[klyomancas] 

gen. 

nwemts 

nakciyemts 

[nakcinassi] 


klyomomts 
[klyom dncdssi ] 

f. sg. nom. 

nuwa 

nakciya [nakci (m)] 

[ krossi ] 

klyomna [klyomim] 

acc. 

nuwai 

nakciyai [ndkcyam] 

[ krossam ] 

klyomnai 

[klyominam] 

gen. 

nuwai 

nakciyai [ndkcine] 


klyomnai [klyomine] 

f. pi. nom. 

nwona 

nakciyana [nakcydn] 

kroscana [krossdn] 

klyomnana 

[klyominan] 


nwona 

nakciyana [nakcyas] 

kroscana [krossas] 

klyomnana 

[klyominas] 


nwonamts 

nakciyanamts 
[ nakcinassi \ 




Similar in declension are the various productive adjectives in -sse [-«'] ‘pertaining to 
X’, -fine [-fii] ‘ibid.’, -tse [-te] ‘having X’, and -lie [-/] ‘X-able’. Adjectives in-//<? [-/], in 
both languages, and -tse, in Tocharian B only, show palatalization in the masculine singu- 
lar (except the masculine singular nominative), masculine plural, and feminine singular 
(e.g., Toch. B (m. sg.) -lie, -lye, -Ivepi, m. pi. -Ivi, -Ivem, f. sg. -Iva, -Ivai, f. pi. -liana, 
-liana). The palatalization is ultimately from the »-stems (in the masculine) andya-stems 
(in the feminine). Tocharian has, in addition, abundant traces of participles in *-nt- or 
*-us- (both of which have been worked into the Toch. A paradigm for klvom ) and the 
denominative *-went- ‘having X’. 

Deictics and Third Person Pronouns: The different Tocharian deictics (i.e., the 
equivalents of ‘the’, ‘this’, and ‘that’), used also as third person pronouns, are based 
on a single basic paradigm to which different particles are attached. In Tocharian B 
the basic deictic/pronoun, usually used as a simple anaphoric, is masculine se (nom. 
sg.), ce (ace. sg.), cev (nom. pi.), cem (ace. pi.), feminine sa (nom. sg.), ta (ace. 
sg.), toy (nom. /ace. pi.), neuter te (nom. /ace. sg.). Only here in Tocharian is there a 
neuter, used predicatively in reference to whole clauses, or concepts. Except for the 
word-initial palatalization in some forms, the Tocharian B forms of the first deictic 
match very well with their counterparts in Greek (se, ce, cev, sa, ta, te = Greek 6, 
xov, [Doric] xoi, oq, xf)v, xo). 

The second deictic, more or less equivalent to the English definite article, though used 
more sparingly, adds -w in Tocharian B and -m in Tocharian A. The third deictic, ‘this 
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one here’, adds -n (written <m>) in both languages. The fourth deictic, ‘that there’ (only 
in Tocharian B), adds -m(p). As is commonly the case, the basic system is shared by both 
Tocharian A and B, but the details, and the inter-paradigmatic analogies, are the province 
of each language separately. The Tocharian B paradigms are underdifferentiated in the 
genitive and masculine plural, while Tocharian A has generally created unique forms for 
each “slot.” 

TABLE 10.6 DEICTIC PRONOUNS IN TOCHARIAN 


first 

second 

third 


fourth 

B 

A 

B 

A 

B 

A 

B 

m. sg. nom. 

se 

sas 

su 

sam 

sem 

sam 

samp 

m. sg. acc. 

ce 

cas 

cew 

cam 

cem 

cam 

camp 

m. sg. gen. 

cwi 

cast 

cwi 

cami 

cwi 

cani 

cwimp 

m. pi. nom. 

cey 

ces 

cey 

cem 

cey 

cem 

caimp 

m. pi. acc. 

cemts 

cesds 

cemts 

cesam 

ceyna 

cesam 

*caimpa 

m. pi. gen. 

cemts 

cessi 

cemts 

cesmi 

ceynamts 

cesni 

cempamts 

f. sg. nom. 

sa 

sas 

sa,, 

sam 

sam 

sam 

somp 

f. sg. acc. 

ta 

tas 

ta u 

tarn 

tarn 

tarn 

tomp 

f. sg. gen. 

tay 


tay 

temi 




f. pi. nom. 

toy 

tos 

tom 

tom 

toyna 

*tom 

toym 

f. pi. acc. 

toy 

tosas 

tom 

tosam 

toyna 

tosam 

toym 

f. pi. gen 



tomts 

tosmassi 

toynamts 



n. sg. 

te 

tas 

tu 

tarn 

tern 

tarn 

tamp 

n. sg. gen. 



tuntse 

tmis 




du. 



tai 

tim 




du. gen. 



tainaisan 






Relative and Interrogative Pronouns: The commonest relative and interrogative pro- 
noun, one that does not distinguishes gender (in both languages) nor number (in Tochar- 
ian B), but that does distinguish case (in both), is a combination of the first deictic with a 
preceding kw- to give B nom. k t/ se (A kus ), ace. k„ce (A kuc). The genitive forms are both 
irregular and not reducible to a single Proto-Tocharian preform; we have B ket(e) and 
A ke. In Tocharian A, but not in B, the relative pronoun is distinguished from the inter- 
rogative by the addition of the particle ne to the former but not the latter (thus kusne and 
kucne ). In Tocharian A a nominative plural of the relative pronoun is also known, kucene 
(no accusative plural is known, perhaps simply because of a lack of attestation). Both 
languages have a relative or interrogative adjective derived from a deictic prefixed by in- 
(in B) and an- (in A). In Tocharian B it is the second deictic that is so used; in Tocharian 
A, the third. Thus, we have B intsu (nom.), incau (ace.), etc. ‘which’ [A antsam, dncam\. 
More commonly, in this function in Tocharian B we have the second deictic prefixed by 
mdk-, i.e., maksu (nom.), rndkcau (ace.), etc. 

First and Second Person Pronouns: The array of personal pronouns in the two Toch- 
arian languages is given in Table 10.7. 
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TABLE 10.7 FIRST AND SECOND PERSON PRONOUNS IN TOCHARIAN 



Toch. B sg. 

Toch. A m. sg. 

Toch. A f. sg. 

Toch. B pi. 

Toch. A pi. 

nom. 

nas 

nas 

huk 

wes 

was 

acc. 

nas 

nas 

nuk 

wes 

was 

gen. 

ni 

ni 

nani 

wesan 

wasam 

nom. 

tuwe 

tu 


yes 

yas 

acc. 

ci 

cu 


yes 

yas 

gen. 

tan 

tni 


yesan 

yasdm 


In addition there is a reflexive genitive B sail [A sni\ and, attested only in Tocharian B, 
dual nominative-accusative wene ‘we too’ and yene ‘you too’. 

All of the second person forms, and the dual and plural of the first, have direct PIE ety- 
mologies (tuwe < *tuHom [cf. Sanskrit tuvam ], wes < *wos [cf. Latin uos]) or are easily 
explained analogical developments of known PIE forms. Not so the first person singular. 
The initial n- may reflect a PIE oblique case stem *mne-, but essentially everything else 
is unclear or downright weird. The -s of Tocharian B and the -s of Tocharian A are not 
reducible to a common denominator, and the origin of the typologically very unusual 
distinction in gender in Tocharian A is without any accepted explanation. 

Numbers: Cardinal numbers are inflected for gender (‘one’, ‘two’, ‘three’, and ‘four’ 
in Tocharian A; ‘one’, ‘three’, and ‘four’ only in Tocharian B), number (‘one’ only), and 
case (a full panoply only for ‘one’, a single nominative-accusative form and a rarely 
attested genitive for the other numerals through ‘ten’). 


TABLE 10.8 TOCHARIAN NUMBERS FROM ‘ONE’ TO ‘TEN’ 



Toch. B nom. 

Toch. B acc. 

Toch. A nom. 

Toch. A acc. 

Toch. B gen. 

‘one’ m. 

se 

seme 

sas 

som 

(see below) 

‘one’ f. 

sana ~ somo 

sanai ~ somo 

sam 

som 


‘two’ m. 

wi 

wi 

wu 

wu 


‘two’ f. 

wi 

wi 

we 

we 


‘three’ m. 

trai 

trai 

tre 

tre 


‘three’ f. 

tarya 

tarya 

tri 

tri 

taryamts 

‘four’ m. 

stwer 

stwer 

stwar 

stwar 


‘four’ f. 

stwara 

stwara 

stwar 

stwar 


‘five’ 

pis 

pis 

pan 

pan 

pisamts 

‘six’ 

skas 

skas 

sak 

sak 

skassamts 

‘seven’ 

sukt 

sukt 

spat 

spat 


‘eight 

okt 

okt 

okdt 

okdt 

oktamts 

‘nine’ 

nu 

nu 

nu 

nu 


‘ten’ 

sak 

sak- 

sak 

sak 



It is interesting to look closely at the paradigm for ‘one’ in both languages to see how 
the two languages have innovated with the material common to them so as to come up 
with quite different paradigms. The more usual forms of ‘one’ in the feminine singular, 
sana/sanai/sam, start from the PIE neuter singular *sm after final *-m had become *-n. 
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The non-nominative forms are generally from PIE *somos (cf. English ‘same’). The pal- 
atalization of the word-initial *s- presumably began in the masculine singular and plural, 
where it was analogically introduced for the same reason it was in the pronouns. In Toch- 
arian A it has spread to the feminine as well. The rounding of PToch. *-e- (< PIE *-o-) 
before the bilabial -m- is regular in Tocharian A. 


TABLE 10.9 THE DECLENSION OF ‘ONE’ IN THE TOCHARIAN LANGUAGES 



Toch. B m. 

Toch. A m. 

Toch. B f. 

Toch. A f. 

sg. nom. 

se 

sas 

sanai ~ somo 

sam 

acc. 

seme 

som 

sanai ~ somo 

som 

gen. 

semepi 

somap 



pi. nom. 

semi 

some 

somona 

somam 

acc. 

semem 

somes 

somona 

somam 

gen. 

sememts 

somessi 

somonamts 



The meaning of the plural of ‘one’ is ‘some’ (cf. the identical phenomenon in the 
Romance languages). Other numbers may also show a plural under the right condi- 
tions; A sdptdnt ‘sevens’, i.e., ‘weeks’; or B skanma ‘tens’, i.e., ‘decades’; B kantenma 
‘hundreds’. 

In Tocharian B the second decade is composed of the numbers ‘one’ through ‘nine’ 
compounded with ‘ten’: sak-se ‘eleven’, sak-wi ‘twelve’, sak-trai, sak-stwer, sak-pis, 
sak-skas, sak-sukt, sak-nu. The structure is essentially the same in Tocharian A, save for 
the addition of a conjunction -pi; thus sak-wepi ‘twelve’ (f.), sak-nupi ‘nineteen’, etc. The 
decade numbers, ‘twenty’ through ‘ninety’, show (partly opaque) derivatives of the num- 
bers ‘two’ through ‘nine’: B ikatn [A wiki] ‘twenty’, taiyaka [taryak] ‘thirty’, stwarka 
[stwarak ] , pisaka [pnak], skaska [silks ilk], suktanka [sdptuk], oktanka [oktuk ] , ftumka 
[: nmuk ]. As can easily be seen a lively analogy has been at work in both languages, but 
in quite different ways. The suffix reflects a PIE *-(d)komt, which, at an early point in its 
history became pre-Tocharian *-kom. The *-om part was identical with the usual neuter 
singular noun suffix, whose singulamess was felt to be inappropriate for these numbers 
and changed to the neuter plural *-a. Thus PToch. *-ka and thence, regularly, Toch. B -ka, 
A -k. The word for ‘hundred’ is inherited: kante [kant] (< PIE *kmtom), as is probably the 
word for ‘thousand’: yaltse [waits] , but higher numbers are borrowed: tmane [tmam] ‘ten 
thousand’ and kor [kor] ‘ten million’. 

The ordinal numbers from ‘second’ to ‘tenth’ are extremely transparent derivatives of 
the corresponding cardinal number and the suffix -te (Toch. A -t)\ wate [wdt] ‘second’, 
trite [ trit ] ‘third’, starte [start], pinkte [pant], skaste [skiist], suktante [sdptdnt], oktante 
[oktant], nunte [*nunt], skante [skant] ‘tenth’. In Tocharian B the process continues with 
ikante, but no higher ordinal numbers are attested. In Tocharian A we do not have ‘twen- 
tieth’, but there are taryakinci ‘thirtieth’, stwarakinci ‘fortieth’, and sakskinci ‘sixtieth’ 
with the suffix -nci. The word for ‘first’ is Toch. B parwe/parwesse (cf. Sanskrit purva-), 
Toch. A maltowinu (but compare A parwat ‘first-born son’). 

Verbs: The Tocharian verb is the most complex part of Tocharian morphology. 
A given verb can distinguish two voices: active and medio-passive (intransitive, 
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reflexive, or passive); two aspects: imperfective and perfective (but only in the past, in 
the distinction of imperfect [impfl] and preterit [pret.]); three moods: indicative (present 
and past), subjunctive (mostly with future meaning in main clauses, while signaling 
uncertainty or possibility in subordinate clauses), and imperative; number: singular, 
dual, and plural; and three persons: first, second, and third. In these distinctions it is 
fully as complex as the “classical” Indo-European languages such as Sanskrit, Greek, 
or Latin. In addition, there is an infinitive (made from the subjunctive stem in Tocharian 
B, from the present stem in Tocharian A); two verbal adjectives, one from the present 
stem, signaling necessity, and the other from the subjunctive stem, signaling possibility; 
a preterit participle; and an agent noun. The verbal adjectives can be combined with 
‘to be’ to create modal-like forms (‘one is to do X’, ‘one could do X’), and the preterit 
participle can also be combined with ‘to be’ to form a (rare) perfect (‘one has done X’). 
Most intransitive verbs, and many transitive ones, allow the derivation of a causative 
(“the window broke” > “John broke the window”, “John knows the rules” > “Tom 
showed/taught John the rules”). A small number of verbs, transitive or intransitive, have 
derived intensives, e.g., kaln’ale- ‘resound’ > kalnask’d/e- ‘howl [of the wind]’, oksd- 
‘awaken’ [intr.] > aksask’dle- Tie awake’. 

The complexity of Tocharian verbal morphology is wholly consistent with that 
of its Proto-Indo-European ancestor. Not only is the system clearly Proto-Indo- 
European, but its expression is largely inherited as well. Among present stems, for 
instance, we find descendants of PIE e/o-presents, also ve/o-, ske/o-, se/o-, eh,-, 
neh,-, nh,ve/o-, and nasal-infix presents, as well as old denominatives reflecting PIE 
*-n-ye/o- and *-eh 2 -ye/o-. Thus, we have, using Tocharian B examples, parale- (< 
*b h er-e/o-) ‘bear’, ceppi(ve)- (< *topye/o- [with innovative word-initial palataliza- 
tion]) ‘step forth \ pask’a/e- (< *peh 2 -ske/o-) ‘guard’, luks’ale- (< *luk-se/o-) ‘illu- 
minate’, laka- (< *l e g-eh 2 -) ‘see’, tarkna- (< *TrK-ne-h,-) ‘release’, mdntdnna/e- (< 
*mnt-nh 2 -ye/o-) ‘harm’, pihkd- (< *pi-n-k-) ‘paint, write’, kwipehha/e- (denominate 
to kwipe ‘shame’) (as if pre-Tocharian < * kwipen -ve/o-l ‘be ashamed’, klautko- 
(denominative to klautke) (as if < *klowt s k-eh 2 -ye/o-) ‘change’. The Tocharian sub- 
junctive shows many of the same formations, and, indeed, there are many instances 
where the present stem and subjunctive stems are identical. The most common sub- 
junctive, however, is a formation with suffixal -a- (of unclear and perhaps multiple 
origins). The present stems in the previous list are accompanied by the follow- 
ing subjunctives: parale- (suppletive subj. kama-), ceppi(ve)- (subj. tappa-, pret. 
*tappa-), pask’ale- (subj. pask’ale- [identical to present], pret. passd-), luks’ale- 
(subj. luk’a/e-, pret. Ivauk(sa) — lauksa-), laka- (subj. laka-, pret. lyaka-), tarkna- 
(subj. tarka — tarka-, pret. carka- ~ tarka-), mantahha/e- (subj. manta — manta-, pret. 
manta-), pihkd- (subj. paika-, pret. paika-), kwipehha/e- (subj. kwipehha/e-, pret. 
kwipehha-), klautko- (subj. klautka-, pret. klautka-). Unlike the other non-Anatolian 
branches of Indo-European, Tocharian shows no traces of “long vowel” subjunc- 
tives, i.e., those subjunctives formed by adding the thematic vowel, in its role as a 
subjunctive marker, to the thematic vowel to give subjunctives in *-e/o-. Nor are 
there thematic optatives in *-o-T- (< *-o-ih r ). Rather, we have subjunctives and 
optatives without the thematic vowel, namely, subjunctives in *-e/o- and optatives in 
thus pask’d/e-, both present and subjunctive < *peh 2 sk-e/o- and optative pask’i- 
(< *peh 2 sk-ih,-). The absence of “long vowel” subjunctives and thematic optatives 
in *-o-T- is clearly an archaic trait. 
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TABLE 10.10 CONSPECTUS OF THE TOCHARIAN VERB (Tocharian A forms in italics) 


non-causative 

active 


medio-passive 


tense 

NON-MODAL 

MODAL 

NON-MODAL 

MODAL 

present 

karsanam 

karsam [subj.] 

karsanatar 

karsatar [subj.] 


[karsnas] 

[krasas] 

[kdrsnatar*] 

[kdrsdtdr] 

non-present 

karsanoy[Impf] 

karsoy [Opt] 

karsanoytar [impf.] 

karsoytar [opt.] 


[sars] 

[karsi] 

[kdrsnitdr*] 

[karsi tar*] 

preterit 

sarsa 


karsate 



[sars] 


[kdrsdt] 


preterit 

karsau 




participle 

non-finites 

[karso] 




infinitive 

[karsnatsi] 

karsatsi 



gerundive 

karsanalle 

karsalle 




[karsnaf] 

[kdrsdl] 



abstract 


karsalne 

[kdrsdlune] 



m-participle 

karsanamane* 

[kdrsnam*] 




72/-participle 


karsauca 

[kdrsndnt] 



causative 

tense 

NON-MODAL 

MODAL 

NON-MODAL 

MODAL 

present 

sarsassam 

sarsassam[subj.] 

sarsastar* 

sarsastar* [subj.] 


[sarsas] 

[sarsas *] 

[sars as tar*] 

[sarsastar*] 

non-present 

sarsassi [impf.] 

sarsassi [opt.] 

sarsassitar* [impf.] 

sarsassitar* [opt.] 


[sars si] 

[sars as is*] 

[sarssitdr] 

[sarsasitdr*] 

preterit 

sarsa 


sarsate* 



[sasdrs] 


[. sasdrs at *] 


preterit 

sessarsu 




participle 

non-finites 

[sasarsu] 




infinitive 

[sarsassi] 

[sarsastsi*] 



gerundive 

sarsassalle 

sarsassalle* 




[sars ascii*] 

[sarsasdl*] 



abstract 


sarsassalne* 

[sarsaslune*] 



m-participle 

sarsaskemane* 

[sdrsdsmdm] 




^/-participle 

sarsassenca* 





NB: The finite forms are all third person singulars; the asterisk signifies a non-attested form/category 
(in all such cases, however, the reconstruction is certain). Here and elsewhere *a is an -a-, which causes 
obligatory palatalization of a preceding palatalizable consonant. Thus, in this case, we get alternation 
between passa- and paske- (e.g., third person singular passam and third person plural paskem, ultimately 
representing PIE *peh 2 sketi and *peh 2 skonti respectively). 
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SOME TOPICS OF WORD FORMATION 

As in other Indo-European languages, Tocharian has a rich set of derivational processes. 
The commonest of these are the creation of denominative verbs, adnominal adjectives 
and possessive adjectives, abstract nouns, and nominal compounds. 

Denominative Verbs: There are two ways by which denominative verbs may be cre- 
ated, by suffixation or by compounding with the verb yam- [A ypa-] ‘do, make’. The 
process of suffixation is inherited but probably no longer productive in either language. 
With just enough examples to assure us of its existence, there was a process by which a 
(PIE) o-grade noun might take the verbal suffix -a- to form a new verb. Thus, the Toch. 
B noun klenke ‘vehicle’ gave klanka- ‘travel by vehicle’ or B klautke [A lotak\ ‘change’ 
gave B klautka- [A lotka- (new analogical present lotanka -)] ‘change’ (vb.). A bit more 
commonly, we have the suffix -male- [A -inilal a-] (historically a denominative to n-stem 
nouns, i.e., PIE *-n-ye/o-); thus, B tankw [A tuhk\ ‘love’ gives B tankwanna/e- [A 
tunkinnala-] ‘love’ (vb.). Far, far more common than either of these “suffix solutions’’ is 
to combine a noun with the verb yam- \ypa-\ e.g., B apakartse yam- ‘be visible, be man- 
ifest’, B saimyam- [A semypa-\ ‘take refuge’. Of the approximately 625 verbs known in 
Tocharian B, 54 are denominatives made with yam-; the numbers in Tocharian A would 
appear to yield a similar ratio. 

Adnominal and Possessive Adjectives: Adnominal adjectives are those that mean 
something like ‘pertaining to X’ or ‘consisting of X’, while possessive adjectives are 
those that mean something like ‘having X’. In each case the ‘X’ is a noun. Thus, B 
kokale [A kukal ] is ‘chariot/wagon’, and the adnominal B kokalesse is ‘pertaining to a 
chariot/wagon’, while the possessive adjective B kokaletstse is ‘having a chariot/wagon’ 
(nominalized as ‘charioteer, carter’) or B stam [A stam] ‘tree’, where the adnominal B 
stamasse [A stamasi] is ‘pertaining to a tree’ and B stanatstste (built to the plural stand) 
is ‘possessing trees’. Adnominal adjectives are essentially universally productive in both 
languages and play an interesting syntactic role (cf. infra); the possessive adjectives are 
common in Tocharian B but quite rare in Tocharian A. 

Abstract Nouns: Both Tocharian languages have fully productive means by which 
to create abstract nouns from adjectives and nouns. In Tocharian B there is -mine (less 
commonly -mine or -auile), and in Tocharian A there is -une (less commonly -one). (The 
Tocharian B and A suffixes are presumably ultimately related, but the relationship is not 
well explained.) Thus, in Tocharian B we have astare ‘pure’ > astarne ‘purity’ [A as tar 
> astrone ], B parkar ‘long’ > parkarne ‘length’ [A pdrkar > parkrone ‘length’], or B 
lant ‘king’ (ace.) > lantuilne ‘kingship’ [A lant > lantune ]. The creation of abstract nouns 
from adjectives includes creation from the productive deverbal adjectives in -lie [A -/]. 
Usually it is the verbal adjective built on the subjunctive stem (potential gerundive) that 
forms the basis of the verbal abstract noun, but sometimes one also finds abstracts built to 
the verbal adjective derived from the present stem (the gerundive of necessity): B tsrdlile 
‘separation’ based on the subjunctive stem tsara- but also tsrelile ‘id.’ based on the pres- 
ent stem tsdre- or A kalpalune (from the subjunctive) and kalpnalune (from the present), 
both ‘achievement’. 

Compounds: The Tocharian languages are rich in nominal compounds. Essentially all 
types of compounds that are familiar in other Indo-European languages can be found. 
There are copulative compounds (the dvandas of Sanskrit grammatical tradition), e.g., 
B nem-kalywe ‘fame’ [A nom-klyu] (‘name-fame’), B pacer-macer [A pacar-macar ] 
‘parents’ (‘father-mother’), B swatsi-voktsi [A (pi.) swatsi-yoktsintu \ ‘food and drink’. 
There are also numerous examples of determinative compounds acting as adjectives (the 
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Sanskrit grammarians’ bahuvrlhis): as B astre-palsko ‘with pure spirit’, B taiyd-ydkne 
‘three-fold’, B totka-yarm ‘of small measure’ [cf. A tsru-yarm ‘id.’], B treya-meskem 
‘having three joints’ [cf. A tri-asamkhesi ‘pertaining to the three ages’], B parb'e-saul 
‘long-lived’, B ska-maiyya ‘having ten powers’ [cf. A ska-tempevum ‘having ten pow- 
ers’], B seme-ydrm ‘having a single measure’, B se-suke ‘having a single taste’, A 
soma-pacar ‘having the same father’. Sometimes the bahuvrlhis are provided with an 
adjectival derivational suffix (B -tstse, A -(u)m) as well, e.g., B orotstse-campamnetstse 
‘having great capabilities’, B volo-pilkotstste ‘with bad insight’, A nom-kalywats 
‘famous’, A nom-klyum ‘famous’, A ska-tempevum ‘having ten powers’. Finally, we find 
substantial numbers of endocentric determinative compounds (tatpurusas), though this 
latter sort of compound may be less numerous than, say, in Germanic, since they have 
heavy competition in the form of adnominal adjectives plus nouns (cf. supra and infra). 
With a regular verbal derivative (participle, verbal noun, agent noun) as the second mem- 
ber we have, for instance, B osta-smenca ‘householder’, B kartse-aksu ‘well instructed’, 
B kartse-rita ‘seeking good’, B telki-vamor ‘sacrifice’. With nouns not directly derived 
from verbs as second members we have B kartse-palsko [A kaswa-paltsak] ‘having a 
good thought’, B orotstse-pacer ‘grandfather’, B kasar-wastsi ‘Msdva-clothing’ [cf. 
A kasari wsal ‘id.’], B kuni-mot ‘grape-alcohol’ (i.e., ‘wine’), B kaum-nakte [A kom- 
nkat\ ‘sun-god’, B panakte ~ pud-ndkte [Apta-nkat\ ‘Buddha’, B povsi ‘Buddha’ (i.e., 
‘all-knowing’) [cf. A puk-knanmam ‘id.’], B san-anm [A sni-diicdm ] ‘oneself’, B san- 
saumo ‘relative, retainer, follower’. 


SYNTAX 

The Tocharian languages are typical Indo-European languages in that they have a rela- 
tively “free” word order because the rich morphology will often tell the relationship of 
a noun, say, to other nouns or to the verb in a sentence. The result of this morphological 
robustness is that rarely is it the case that, unlike the situation in English, some partic- 
ular constituent of a sentence (e.g., subject, verb, or adjective) has to be in a particular 
syntactic position in a sentence. Still, the order of elements in a Tocharian B sentence 
is not altogether random. It is true, far more often than not, that in independent clauses 
the semantically neutral, or unmarked, word order in Tocharian prose is subject-ob- 
ject-verb. A Tocharian B example, from a letter with no literary pretensions, is Silarikite 
[sic] Aryawarmem tsamo vsuwarsa preksam ‘STlarksita inquires with great friendship of 
Aryavarman’. In copular (or “nominal” sentences) the order is subject-predicate-(BE), 
where the BE, at least in the present tense, is often not present (e.g., from another letter, 
sankantse avato nesanne ste ‘the situation of the community is satisfactory’ with BE 
present, and, in a literary text, \c]m(e)lassai ytdri ydtsi tumem (pre)[k\e : ‘therefore [it is] 
time to go the way of births’ with BE absent). 

Particularly in poetry different orders appear metri causa or for literary effect. But even 
in non-literary texts, emphasis or focus or new information may be shown by fronting 
the syntactic constituent involved. It is almost certainly the case that any element of a 
clause may be fronted. Because of both emphasis and focus criteria, imperative verbs 
are commonly, though not universally, fronted, e.g., pluwas wesi sak menatse-ne wai 
pardce komne ploriyacem ‘send us, on the tenth of the month and the day of recitation, 
musicians’. Also normally fronted are interrogative and relative pronouns. With interrog- 
ative pronouns: k u ce klautkesa aisalyi ‘by what behavior [are they] to be recognized?’, • 
k tl cesa akalkantats yatalyne masketra : ‘by which is the taming of desires?’; with relative 
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pronouns: ket ait voktsi samtk=onwanne pelaivknesse lyaitkem teki mantanta sap 
srukentdr cai • ‘to whom thou givest the righteous, immortal medicine to drink, they will 
avoid sickness and never die’, k„ce te mant wnawa, tu nke wenau anaisai ‘what I have so 
said, that will I now speak clearly’. 

Constituents can be shifted rightward as well. Right-detachment is a process whereby 
appositional phrases or other adjuncts are moved to the end of a grammatical sentence, 
e.g., vsare lac pis cakanma wi tauwa swer; wasokas aka las cak pis tom ‘Wheat went out, 
five caks, two tom; for the [making of] wasok [= beer?] barley went out, one cak, five 
tom ’. In both clauses we have intransitive sentences (‘wheat went out’, ‘barley for beer 
went out’) where the subject has been expanded by the addition of measurements, but 
those measurements come after the verb. 

The usual place for an adjective or genitive is before the noun it modifies. Non- 
literary monastic records provide abundant examples, e.g., olvTskasse Sinkene wer- 
wivesa ‘for Sinke’s o/v/sfe-garden’, wace krai ‘in the second market/on the second 
market-day’, ptamasse werwiyesa ‘for the stupa’s garden’ (for the adjective), or 
stere Nanatewetse sotri ‘the sign of the elder Jnanadeva’, kapvares swasisse ‘food- 
provisions for the monastic workers’ (for the genitive). Other kinds of modifiers, e.g., 
prepositional phrases, also tend to proceed, e.g., ysine cokis salvwe ‘oil for the night 
lamp’. But even in documents such as these we find the reverse order, e.g., cevnas pito 
koromne ‘the price of their [transportation] by mules’. In more literary documents, the 
orders N-A and N-G occur freely. Particularly in more literary texts the modifier and 
noun may be separated from one another (“distracted” element underlined), e.g., se 
wcevi ake slokantse ‘the end of the second stanza’, wesan nake sarnene kekamu nest 
‘into our hands thou hast now come’, snai kes Ikassam kektsenan ‘he sees countless 
bodies’. A rather extreme case of distraction, and in general rearrangement of expected 
word order, is keklyausormem krent nem-kalywe wroccu wlo tail ‘having heard of thy 
good renown, [O] great king’, where tan belongs with krent nem-kalywe and *tan krent 
nem-kalywe would normally precede keklyausormem. Nevertheless, distraction is pos- 
sible even in a non-literary letter, e.g., sankantse avato nesanne ste ‘the situation of the 
community is satisfactory’. 

There are three kinds of dependent clauses in Tocharian B: (1) those that modify a 
particular noun or pronoun, (2) those that function as part of the verb phrase as an adverb 
of some sort, and (3) those that function as a subject, direct object, or predicative nomina- 
tive. The first group is composed of relative clauses; the second of temporal, local, man- 
ner, causal, and result clauses; and the third group of noun clauses. Dependent clauses 
almost invariably precede the main clause. Relative clauses are a partial exception in 
that we finds both orders without apparent difference in meaning; some follow (relative 
pronoun bolded): cauwak vakne enkaskemttar {makcau procer ensate} ‘we take that very 
way that [our] brother has taken’, III peparkormem vamor krenta { miikcpi \ okonta warpa- 
novtra} ‘having asked after the good deed whose fruits he enjoyed’, but some precede: 
{k^ce cai makci enkossem f pilkonta ‘these insights which they themselves [have had]’. 
Most commonly, however, relative clauses are “correlative,” e.g., {k u ce te mant wnawa}, 
tu nke weriau anaisai ‘what I have so said, that will I now speak clearly’. 

Dependent clauses are marked by a clause-initial subordinator (relative clauses are 
again an exception in that the subordinator, the relative pronoun, is not always clause ini- 
tial). The main clause, too, may be marked by a clause-initial particle, e.g., {elite palsko 
tsankam -ne) ot swatsi voktsi kalvdsdm ‘whenever the spirit arises in him, then he finds 
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food and drink’. Comparative clauses and correlative relative clauses are (almost) always 
so marked; other types show a main clause particle 20% or 30% of the time. 

As in other Indo-European languages with morphological verbal moods, the Tocharian 
subjunctive is commonly found in dependent clauses. Where both subjunctive and indic- 
ative can be found, the subjunctive reflects an indefinite or unspecified subject/condition 
or is future -oriented, or both. The optative in these dependent clauses is used either as the 
past equivalent of the subjunctive (by a Tocharian version of the sequence of tenses rule) 
or when the dependent clause is offered as unreal, e.g., in contrary-to-fact conditional 
clauses (“if I were rich . . . ”) or in relative clauses where the antecedent is denied (“there 
is no one who . . .’, e.g., B ma sii nesam {k u se onwanne takoy} ‘there is no one who is [lit. 
may/might be] immortal’). 

One of the characteristic syntactic phenomena of Tocharian is the widespread presence 
of adnominal adjectives. To a much greater degree than is the case for most Indo-Euro- 
pean languages, Tocharian has a productive grammatical rule whereby a modifying gen- 
itive noun may be converted into an adnominal adjective that, then, will agree in gender, 
number, and case with its head noun. Thus, one can say either B slentse [gen.] trofikne 
lyam=ompalskonne ‘he sat [pret.] in meditation in a hollow of the mountain’ or sieve [lege 
slive\ [adj.] gune cau sainy ompolskonne ‘he sat [impfi] in meditation in that mountain 
cave’, rupn= efywentse [gen.] ‘in the shape of a man’, but secakanne [adj.] rupsa ‘in the 
shape of a lion’, or further B sb-enantse [gen.] paruwa ‘crow- feathers’, but skrense [adj.] 
paivve ‘crow-foot’. These adnominal adjectives may take one of three possible shapes: (1) 
-ive (inherited directly from PIE *-iyo- and restricted to long-established inherited nouns, 
e.g -,ypiye ‘barley-’ [from yap ‘barley’], wastaiye ‘camel-’ [underlying noun unattested], 
kewiye ‘bovine’ (as a noun, ‘butter’) [ke„ ‘cow ’],patarye ‘paternal’ {pacer ‘father’], etc.); 
(2) -me, common only in Tocharian B, from PIE *-n-yo- and largely restricted to animate 
nouns (e.g., yakwenne ‘equine’ [yakwe ‘horse’]); and (3) -sse, largely restricted to inani- 
mates and dual or plural nouns of all degrees of animacy (e.g., klautsaisse ‘pertaining to 
the ear’, klautsanesse [A klosnasi] ‘pertaining to the two ears’). However, the distinction 
is not rigid, and there are nouns that give rise to more than one denominal adjective, e.g., 
yakwenne and yakwesse [A vukasi], both ‘pertaining to a horse’. 

Unlike the situation in other Indo-European languages in which this formation exists, 
where it can be used only when the modifier is singular and itself otherwise unmodified, 
Tocharian allows the modifying noun to be of any number (as also in Luvian among the 
Anatolian languages), and it, in turn, may be modified (thus, we have the two adjectives 
klautsaisse ‘of the ear’ and klautsanesse ‘of the two ears’, laklesse ‘pertaining to suf- 
fering’ (from lakle ‘suffering’) and laklentasse ‘pertaining to sufferings’ [Toch. Aklopl 
klopant ‘suffering(s)’, whence both klopasi and klopantwasi]). In many cases the genitive 
and adnominal adjective are semantically identical, or substantially so. In other cases the 
use of the genitive may signal a specified noun, while the adjective may be more generic, 
e.g., lante warksal ‘the king’s power’, but lantunne warksdl ‘royal power’. Animacy is 
also a factor: the more animate the modifying noun, the more likely that it will remain in 
the genitive and vice versa. 

Even more astonishing is the behavior of this phenomenon when it occurs in noun 
phrases with more than one level of modifier-embedding. A noun phrase (NP,) can con- 
sist of a noun (N A ) and a modifying noun phrase (NP 2 ), which, in turn, can consist of a 
noun (N b ) and noun phrase (NP 3 ), and so on. In the diagram below, either or both of N B 
and N c can be a noun in the genitive or a derived adnominal adjective. 
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N a 



N b NP 3 


N c 


If N c is a derived adnominal adjective, the adjective agrees in gender with the noun 
N B , even if the noun has itself been changed into an adnominal adjective (agreeing with 
the head noun, N A ). Let’s look at an example, this one from a list of medicinal recipes: 
laksahai [adj., f. acc. sg.] klautsaisse [adj., m. nom. sg.] spel [noun, m. nom. sg.] ‘poultice 
of fish ears/gills’. The N B slot is occupied by the adjective klautsaisse, which is regularly 
derived from the feminine noun klautso ‘ear/gill’. It is in the masculine nominative singu- 
lar form because it agrees with N A , spel ‘poultice’, a masculine nominative singular noun. 
The N c slot is also occupied by an adjective, laksahai, which is regularly derived from 
laks ‘fish’. However, it is in the feminine singular genitive/accusative form because it 
agrees with the feminine genitive singular of klautso, namely, klautsaintse, which under- 
lies the surface-structure klautsaisse. Particularly interesting is the double example such 
as makce u ykessa kektsehe ta„ kenasse sataslhe ‘whatever place the body [has], exhalation 
[has] that [place on] earth’. Here we have mdkce u , which is masculine, agreeing with the 
underlying ike ‘place’ (m.), even though ike has become ykessa, a feminine adjective, to 
agree with the feminine noun kektsehe ‘body’. Conversely, ta„ is feminine to agree with 
the feminine noun kem ‘earth’, which has been replaced by the adjective kenasse (mascu- 
line to agree with sataslhe ‘exhalation’). 


FURTHER READING 

Supporting the notion that “Tocharian” is a correct name for those languages so-called 
is Adams 2000. It is a review of the data and analysis that probably justify grouping the 
classical Tokharoi, the Uyghurs’ Tuyri, and the Tocharians into a single historical “ethnic 
complex.” It is certain that not all investigators will agree at every point. Balanced reviews 
of the prehistory of the Tarim Basin are Barber’s (1999) The Mummies of Urumchi and J. P. 
Mallory and Victor M. Mair’s (2000) The Tarim Mummies. Both books are readable and 
very infonnative accounts of the prehistory of the Tarim Basin, its peopling, and its culture. 

Overwhelmingly the best place to go for the Tocharian texts themselves is CEToM 
20 1 1 — , A Comprehensive Edition of Tocharian Manuscripts. The database is as yet 
incomplete but is designed to give digital access to very nearly all Tocharian manuscripts, 
accompanied, where possible, by (tentative) transcriptions and commentary; this will be 
a resource for Tocharianists that is unlike that of any other Indo-European group. Acces- 
sible now are the texts housed in London, those housed in Berlin, and many of those 
housed in Paris. 

Krause and Thomas 1960-1964 is a two-volume work containing a serviceable if brief 
grammatical sketch (phonology and morphology) of both languages, along with selected 
texts and glossaries from both. Sieg et al. 1931 is an early work that provides what is still 
in some respects the best and fullest grammar of the two languages, but clearly focused on 
Tocharian A. It is dated but still very serviceable, particularly with regard to morphology. 



TOCHARIAN 475 


Like Krause and Thomas 1960-1964, Pinault’s chrestomathy (2008) contains readings and 
grammatical analysis, along with considerable discussion of the development of Tocharian 
from Proto-Indo-European. Peyrot (2008) provides the primary discussion and analysis 
of the internal, half-millennium-long history of Tocharian B. Malzahn 2010 and Peyrot 
2013 together form an exceptionally thorough morphological description of the Tocharian 
verb. Thomas 1957 is a thorough examination of Tocharian aspect. None of these is for the 
neophyte. A full-fledged syntax of either or both languages is clearly a desideratum. (But 
see now Adams 2015.) 

The Tocharian A glossary of Krause and Thomas (1960-64) remains the best overall 
dictionary for that language, though it is supplemented by the incomplete dictionary of 
Carling et al. (2009). Adams 2013 is the most complete dictionary, synchronic or dia- 
chronic (it is both), of either Tocharian language, though by no means the “final word.” 
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CHAPTER 1 


BALTO-SLAVIC 

Steven Young 


The similarities in structure and lexicon, some quite striking, between the Baltic and Slavic 
branches of the Indo-European language family have long drawn scholarly attention. But the 
precise nature of the relationship between the two branches has been a source of debate for 
nearly as long. The prevailing view up to the early nineteenth century was that the Baltic 
languages were a kind of Slavic. It was through the work of the comparativists Franz Bopp 
and August Pott that Baltic was finally recognized as a language family in its own right (Petit 
2004: 17-19). The classic presentation of Balto-Slavic as an intermediate branch of the 
Indo-European language family, with Baltic and Slavic as parallel sub-branches, is reflected 
in August Schleicher’s Stammbaum, or family-tree, approach, in which “slawolitauisch” 
(Balto-Slavic) develops from an earlier “slawodeutsch” (Petit 2004: 19-20). Comparative evi- 
dence for a Balto-Slavic grouping was later supplied by Karl Brugmann ( 1 904: 1 8) in the fonn 
of eight shared innovations in phonology and morphology, among them the development of 
syllabic sonorants such as -iR-, the use of pronominal *-yo- to fonn definite adjectives, and the 
replacement of the inherited genitive singular form of o-stem nouns by the ablative. 

The first to offer a principled challenge to the neogrammarian tenet of a post-Indo- 
European period of Balto-Slavic common development was Antoine Meillet in 1908 (cf. 
1967: 59-67). 1 In Meillet’s view, the various agreements between the two language families 
reflect either inherited Indo-European archaisms (with both Baltic and Slavic developing 
from nearly identical Indo-European dialects) or later parallel developments in each of the 
branches, rather than shared innovations that would demonstrate a stage of linguistic unity. 

Jan Endzelin’s 1911 study of the agreements and differences between Baltic and Slavic 
represents an attempt to reconcile Meillet’s approach with that of the neogrammarians. 
While Endzelin agrees with a number of Brugmann’s shared innovations, adding to these 
syntactic comparisons (predicate instrumental and genitive direct object in a negated sen- 
tence) and a substantial number of exclusive lexical agreements, he accounts for these by 
positing a period of prolonged language contact between neighboring Baltic and Slavic 
communities, leading to a degree of linguistic convergence, particularly in the lexicon. 
According to Endzelin, while we can justifiably speak of a common Balto-Slavic period 
(‘bnoxa coBMecTHoir 3KH3HH”: 202), this does not imply a Balto-Slavic protolanguage, 
reflecting a stage of complete linguistic identity. 

Additional arguments, both in support of and against a period of Balto-Slavic linguis- 
tic unity, were put forward over the course of the twentieth century. Among the notable 
attempts in support of a “Sprachgemeinschaft” is Reinhold Trautmann’s 1923 Baltisch- 
Slavisches Worterbuch, in which Baltic and Slavic lexical correspondences are system- 
atized. A shortcoming of this work is the frequent inclusion of external cognates, and 
etyma that are represented in either Baltic or Slavic, but not in both (Petit 2004: 26). 

A review of the accumulated evidence in support of the notion of Balto-Slavic can be 
found in Stang 1966 (17-21). Shared phonological innovations include the development 
of acute and circumflex tone on non-short syllables (V, Vi/u, VR)\ Lith. karve ‘cow’ : 
SCr. b-'ava (reflex of acute tone), Lith. sausas ‘dry’ : SCr. suh (reflex of circumflex tone); 
and the development of the syllabic sonorants *r, *1, *m, *n into the sequences ir, it, im, 
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in: Lith. virsiis ‘top’ : ORuss. vbm>, vilkas ‘wolf’ : OCS vlbkb (*-i> 1-), Lith. desimtas 
‘tenth’ : OCS desgtb (*-i>n-), OPruss. insuwis ‘tongue’ : OCS jgzykb (*-i>n-) and (more 
controversially) after labiovelar stops into ur, ul, um, un (Vaillant 1950: 171): CS *g'tfdlo 
‘throat’, Lith. gurklys ‘crop’ reflect a reduced grade *gur- to the *g w erh 3 - of CS *zerti 
(OCS pozreti ‘to swallow, devour’), Lith. gerti ‘to drink’. Compare also the set OCS 
gbnati (reduced grade) ‘to chase’ : zenQ (full grade) ‘(I) chase’ and OPruss. guntwei ‘to 
chase, drive’, gunnimai ‘(we) chase’ and Lith. genii, Latv. dzgnu ‘(I) drive (cattle)’. 2 

To Stang’s examples of Balto-Slavic phonological innovations we can add the change of 
e to o before u when followed by a back vowel: Lith. tavas ‘your’, Latv. tavs, compare Gr. 
xeoq; OCS novb ‘new’, compare Gr. vsoq (Endzelin 1911: 84); the change of -eu- to -iau- 
(- ’au-) before a consonant: 3 Lith. liaudis ‘people, nation’, Latv. laudis (nom. pi.) ‘people, 
folk’ : OCS ljudbje ‘people’ (*lewd h -); Lith. liaupse (*lewb h -) ‘praise’ : OCS ljuby ‘love’. 

Innovations in nominal derivation (see Stang 1966: 18) include deverbal abstract nouns 
in - ima -: Lith. piesimas ‘drawing’ : CS *pistmo ‘writing’; abstract formations in 
Lith. draugybe ‘friendship’, Latv. draudziba : CS *druzi>ba; agent nouns in -a-io-, - e-io 
Lith. artojus ‘plowman’, OPruss. artovs : OCS ratajb ; Latv. suveja (*siuu-ei-) with an 
exact match in Russ. uieeA ‘tailor’ (Endzelin 1952: 37); agent nouns in -ik-o: Lith. siu- 
vikas, OPruss. schuwikis ‘shoemaker’ : ORuss. sbVbCb ‘tailor, shoemaker’; the agentive 
suffix represented by dialectal and OLith. -invkas (standard -ininkas), CS *-tnik'b: Lith. 
dial, arklinykas ‘stableman, groom’ : OCS dlbznnikb ‘debtor’; and the diminutive suffix 
*-Tt(i)io-: Lith. sunvtis ‘son (dim.)’ : OCS otrocistb ‘boy’. 

This list can be supplemented by denominal adjectives in -in-: Lith. kriivinas ‘bloody’ : 
OSC knvbiib (Endzelin 1952: 38); diminutives in - uk-o Latv. dgluks ‘son (dim.)’, OCS 
sviTbkb ; masculine derivations in -in- such as Lith. avinas ‘ram’ (compare avis ‘sheep’), 
Latv. dial, avins, OPruss. awins : OCS ovbin: and oblique cases of the first-person singu- 
lar personal pronoun based on a stem *me/on-, generalized from the genitive stem: Lith. 
gen. mangs, dat. man, ace. mane ; Latv. gen. manis, dat. man, ace. mani; OPruss. dat. men- 
nei : OCS gen. mene, ORuss. dat./loc. mwie; the reduced grade of the latter is matched by 
the dialectal (High) Latvian and Zemaitic Lithuanian dative mun. 

Innovations in the inflectional and derivational morphology of the verb (Stang 1966: 
18-19) include the -a formant marking a preterit/aorist stem, often accompanied by the 
reduced grade of the root: Lith. pirko (pres, perka) ‘bought’ : OCS zbda (pres, zidetb) 
‘waited, expected’; the Baltic preterit in *-e: Lith. vede Ted’, compared with the Slavic 
imperfect of the type vedeaxb ‘(I) led’; the verbal suffix Lith. -aujV- compared with 
Slavic *-ujV-, preterit Lith. -avo and Slavic -ova; and the distinctive formation of the 
present tense stem of ‘give’: OLith. duosti ‘gives’ : OCS dastb (*do-d(a)-ti). 

In addition, Baltic and Slavic share a large number of apparently exclusive lexical 
correspondences, which are not concentrated in any single semantic field. Wojciech 
Smoczynski (2001: 16-17), following Endzelin 1952 (44), finds at least two hundred, 
among them: Lith. ranka ‘hand’, Latv. riioka : OCS rgka: 4 Lith. ragas ‘horn’, Latv. rags : 
OCS rogb: Lith. liepa ‘linden’, Latv. liepa : OCS lipa: Lith. ledas ‘ice’ : OCS ledv, Lith. 
papartys ‘fern’ : CS *paporti>. These can be supplemented by the following comparisons 
from Endzelin 1952 (42ff); Lith. pirsvs (nom. pi.) ‘chest (of a horse)’ : OCS prbsi (*-i>r-) 
‘chest, bosom’; Lith. sietas ‘sieve’, Latv. siets : SCr. si to: Lith. mielas ‘nice, sweet, dear’, 
Latv. mils ‘sweet, dear’ : CS *mih>; Lith. slanka ‘woodcock’, Latv. sliioka (with anom- 
alous tone) : Russ. cjiAkw, and the precise correspondence sets Lith. varna ‘crow’, Latv. 
varna, OPruss. warne : CS *voma (whatever the interpretation of acute in this form may 
be) and Lith. varnas ‘raven’, OPruss. warnis : CS *v6rm>. 

Finally, there are a number of prepositions and preverbs with close or identical correspon- 
dences in the two language families: Lith. uz ‘at, within, instead of, in return for’, Latv. uz 
‘on, to’: CS *vr>z- ‘in return for, (preverb) up, back’; Lith.pne ‘at, with, to’, OPruss. prei ‘at, 
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with, to’ : CS *pri ‘at, with, by’; Lith. nuo, Latv. niio ‘from’, OPruss. no, na ‘onto, against, 
over’ : CS *na ‘on(to), in(to)’; Lith. dial, and OLith. iz (standard is), Latv. iz ‘out of : CS *ji>z 
‘from, out of’, with unexplained but matching initial *i-. A refinement of Balto-Slavic lexical 
comparisons, drawing on progress in dialectal data, can be found in Otkupscikov 1988. 

Perhaps the most compelling argument for a Balto-Slavic protolanguage, demonstrated 
on the basis of shared innovations, is to be found in the highly complex prosodic struc- 
tures of both language families, 5 which typically agree in accentual paradigms and lexical 
membership in these paradigms, stress placement (including the reflexes of Hirt’s Law), and 
phenomena associated with the rise of syllable tone (including the reflexes of Winter’s Law). 

The classic statement of tonal correspondences between Baltic and Slavic is reflected in 
Stang 1966 (125). In this traditional view, the correspondence set conventionally labeled 
acute, continuing Indo-European length (including length from loss of a laryngeal), shows a 
rising pitch in Slavic, Old Prussian and (together with “broken” tone) Latvian, but a falling 
pitch in (standard) Lithuanian. On the other hand, circumflex tone has a falling pitch in the 
former, but a rising pitch in Lithuanian. Stang (1966: 125) assumes a shift in tonal contours 
(“eine Umlegung von Tonverlauf ’) for Lithuanian. An attempt at resolving this anomaly is 
found in the work of Frederik Kortlandt (for example, 1985: 122ff), who has argued that 
the various tonal contours are later developments in the individual languages, and that the 
broken-tone (glottalic) acute of Latvian and the Zemaitic dialects of Lithuanian is in fact the 
prosodic continuation of an earlier laryngeal. (Baltic) circumflex tone is simply the continu- 
ation of a non-laryngeal length (lengthened grade or vowel contraction). 

An apparent lack of tonal correspondences between Baltic and Slavic can be found in 
stems belonging to the mobile accent paradigm (see below). Here the opposition of acute 
and (Balto-Slavic) circumflex was neutralized in Slavic; all such roots show “Slavic cir- 
cumflex,” which continues in Serbo-Croatian as long falling tone: SCr. glava ‘head’, ace. 
sg. glavu ± Lith. galva, galvq, Latv. galva, with acute; SCr. nag, naga ‘naked’ ^ Lith. nuo- 
gas, Latv. nudgs, with acute; SCr. zfv, ziva ‘alive’ 4 1 Lith. gyvas, Latv. dzivs, with acute. This 
development, first described by Antoine Meillet ( 1 902), is known as Meillet’s Law. 

The set of Balto-Slavic acute stems of laryngeal origin is supplemented by the reflexes 
of Winter’s Law, which likewise serves as an important argument for a Balto-Slavic lin- 
guistic unity (Derksen 2004: 82). Werner Winter (1976) showed that there is a relationship 
between instances of Balto-Slavic lengthened vowels and the distribution of stem-final 
Indo-European aspirated and unaspirated voiced stops; an original short vowel followed 
by an aspirated stop remains short in Balto-Slavic, while an original short vowel followed 
by an Indo-European voiced stop shows a lengthened (acute) vowel: 6 BS1. *med- ‘honey’ 
< *med h -, but *ed- ‘to eat’ < *ed-. Winter’s Law accounts for a number of cases that Stang 
had termed “theoretical metatony” (1970: 216), where the acute, rather than the expected 
circumflex (or short), occurs in a non-laryngeal base. The most comprehensive list of 
Balto-Slavic data supporting Winter’s Law is undoubtedly Dybo 2002. 

Baltic and Slavic also agree in showing unexpected root stress in a number of bases 
that are oxytonic in Greek and Sanskrit: Lith. dumai ‘smoke’, Latv. dumi, Slav. *dyrm> 
a.p. (a), but Skr. dhuma-, Gr. Oupoq; Latv. griva ‘mouth of a river’, CS *griva a.p. (a) 
‘mane’, but Skr. gnvS ‘neck’; Lith. pilnas ‘full’, Latv. pilns, CS *pblm, a.p. (a), but Ski'. 
purnd-; Latv. dieveris (*deh,iwer-) ‘husband’s brother’, SCr. dever, but Skr. (RV) deva 
‘(younger) brother of husband’, Gr. 8df|p [*Scnpf|p] ‘husband’s brother’. This set of corre- 
spondences was first noted by Hermann Hirt (1892: 39), who characterized it (originally 
for Lithuanian) as “eine Akzentverschiebung bei den Worten mit gestossenem Vokal in 
der Stammsilbe ...” ‘a stress shift in words with a broken (glottalic) vowel in the stem 
syllable’. This statement, known as Hirt’s Law, was revised in connection with laryngeal 
theory by V. M. Illic-Svityc (1963: 78-81); in Derksen’s updated formulation (2004: 84), 
“stress was retracted to an immediately preceding syllable containing a vocalic element 
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followed by a laryngeal.” Note that this formulation excludes the sequence *-VRH-: CS 
*golva ‘head’ and Lith. galva preserve the final stress of *golHuaH. 

Baltic and Slavic also show agreement in accentual paradigms, the identity of which 
was established for nouns by Illic-Svityc (1963), 7 who derived the tripartite system estab- 
lished by Stang (1957) for Slavic 8 from its simpler Baltic counterpart (fixed-stress and 
mobile-stress paradigms). Illic-Svityc showed that Slavic accent paradigms (a) and (b) 
are in fact in complementary distribution and can be reduced to a single non-mobile para- 
digm, comparable to Baltic fixed stress. According to Illic-Svityc, stress is retained on an 
acute syllable (a.p. a), but advances from a non-acute (short or Balto-Slavic circumflex) 
syllable onto an immediately following syllable, resulting in a.p. b. Thus, while the root 
stress of the acute base Lith. dumai ‘smoke’ corresponds to that of SCr. dim (a.p. a), the 
root stress of the short base Lith. bliisq ‘flea, ace. sg.’, is matched by end stress in Russ. 
6jioxv (a.p. b). Aside from the Slavic metatony associated with Meillet’s Law, Baltic and 
Slavic mobile bases retain their correspondence: Lith. galva ~ galvq ‘head, nom. sg. ~ 
acc. sg.’ (mobile a.p.) : Russ, zojioea ~ zonoey ‘head, nom. sg. ~ ace. sg.’ (a.p. c). 

The Baltic bipartite paradigmatic accentual system (fixed and mobile) thus represents 
an earlier state from which the Slavic system can be derived. The assignment of a particu- 
lar word to either the fixed or mobile class is, in Balto-Slavic terms, unmotivated. Accord- 
ing to Vladimir Dybo (especially 1980: 147-150), the two accent classes result from a 
phonologization of earlier sequences of morphemes of either “dominant” or “recessive” 
valencies, making up a word form or phonetic word, with a “contour rule” according to 
which word stress falls on the first morpheme of dominant valency. If a word form (or 
phonetic word) consists only of recessive morphemes, the initial morpheme is automat- 
ically stressed. The valencies in this approach are seen as continuing a tonal opposition 
(high and low tone) inherited from Indo-European. 

The derivation of the Slavic accentual model from an earlier Baltic model, noted above, 
suggests a lack of equivalence between the notions of Baltic and Slavic. Developing a point 
originally made by Endzelin (1911: 201-202), Stang (1966: lOffl) notes that while Com- 
mon Slavic presents a relatively monolithic system, Baltic is divided by a number of signifi- 
cant isoglosses, complicating the notion of Proto-Baltic. Certain of these isoglosses connect 
East Baltic (Lithuanian and Latvian) with a uniform Slavic, while others connect West 
Baltic (Old Prussian) with Slavic. The former include the Indo-European genitive singular 
*-s(y)o of o-stem nouns, apparently preserved in Old Prussian ( deiwas ‘God, gen. sg.’, and 
the pronominal stesse ‘that, gen. sg.’; cf. Slavic ceso ‘what, gen. sg.’), but replaced in East 
Baltic and in Slavic by a fonn that has ablative function outside of Balto-Slavic: Lith. rago 
‘horn, gen. sg.’, Latv. raga = OCS roga (East Baltic has extended this to the pronominal 
system as well: Lith. -kd ‘who, what, gen. sg.’, to ‘that, gen. sg.’); a present passive partici- 
ple in -m- shared by East Baltic and Slavic (Lith. vedamas ‘being led’, Latv. vgdams : OCS 
vedonrb), while Old Prussian continues an inherited -manas (in the hapax poklausnnanas 
‘being heard’); East Baltic and Slavic pronouns have -m- in the dative and locative singu- 
lar, while Old Prussian continues inherited -sm- (Lith. tam(ui) ‘that, dat. sg.’, Slav, tomu; 
OPruss. stesmu ); Lith. devyni ‘nine’, Latv. devini, Slav, devgtb, with an innovative d-, while 
OPruss. newints ‘ninth’ preserves inherited n-. Finally, the word for ‘third’ reflects a stem 
*tret- in Lith. trecias, Latv. tress, OCS tretbjb, while OPruss. tlrts agrees with Skr. trtlvah. 

Isoglosses that link Old Prussian with Slavic include the possessive pronouns OPruss. 
mais, twais, swais ‘my, your, one’s own’, which agree with OCS mojb, tvojb, svojb, as 
opposed to Lith. (manas), tavas, savas and Latvian (mans), tavs, savs; the dative singular 
of the second person and reflexive pronouns preserves *-b h - in Old Prussian ( tebbei , seb- 
bei) and Slavic ( tebe , sebe) (Stang 1966: 248; Lithuanian and Latvian have rebuilt these 
forms: Lith. tau, sau, Latv. tev, sev ); the Old Prussian preterit of ‘to be’, bei, be agrees 
with Slav, bexb, be, as opposed to the preterit stem *bi- found in Latv. dial, biju ‘(I) was’ 
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and OLith. bit(i); and the nasal-infixed Old Prussian participial forms sindats and syndens 
‘sitting’ (Catechisms I and II) can be compared with the infixed CS *s$d- ‘sit’. 

These isoglosses suggest a greater time depth for Common Baltic, “if indeed such a 
unified protolanguage ever was a reality” (Bimbaum 1970: 70-71). The resulting lack of 
equivalence between the notions of Baltic and Slavic (and the associated limitations of the 
family-tree model) is particularly emphasized by V. V. Ivanov and V. N. Toporov (1961). 
In reviewing the methodological preconditions for discussing the Balto-Slavic relationship, 
they argue that a relatively homogeneous Proto-Slavic can be derived from a considerably 
more archaic and heterogeneous Proto-Baltic linguistic model, in effect redefining the notion 
of Balto-Slavic by treating Slavic as a local development within a Baltic dialectal continuum. 

Among more distant post-Indo-European linguistic affinities proposed for Balto-Slavic, 
correspondences with Gennanic have been noted since Schleicher’s time (recall his “slawo- 
deutsch”). These include the marking of middle-intransitive voice through nasal infixation 
(a process that enjoyed particular productivity in Baltic) or nasal suffixation; the spread 
of the adjectival suffix *-isko- in a common function: Goth, -isks, OCS -bskb, Lith. -iskas 
(Stang 1942: 275-278); and case forms in -m- that correspond to *-b h - elsewhere (for exam- 
ple, OLith. dat. pi. -mus (modem -ms), OCS -mb, Goth, -m, as opposed to Skr. -bhyah, Lat. 
-bus\ Meillet 1967: 148-152). Also characteristic are exclusive lexical agreements (Stang 
1972: 79-82) across a number of semantic fields, including social relations: Lith. drau- 
gas ‘friend’, Latv. draugs, OCS drugb, Goth, driugan ‘to perform military service’; Lith. 
liaudis ‘people, nation’, OCS ljudbje ‘people’, OHG liut; Lith. valdvti ‘to govern, rule’ (3 
pres, valdo), OCS vladeti ‘rule’, Goth, waldan || technical culture (simple tools, materials, 
handicrafts, objects of daily life): Lith. aldija ‘dugout canoe’, OCS aldii, ladii ‘boat’, Norw. 
dial, olda Targe trough, often from a carved-out tree trunk’; OPruss. dalptan ‘punch (tool)’, 
RussCS dlato ‘chisel’, Russ. doji6umb ‘to chisel’, OEng. delfan ‘to dig, delve’; Lith. strele, 
strela ‘arrow, shaft’, Latv. strata ‘stripe, streak; jib, crane’, OCS strela ‘arrow’, OHG strata, 
OEng. strdel ‘arrow’ || products: Lith. vaskas ‘wax’, OCS voskb, OEng. weax; Lith. alus 
‘beer’, ORuss., RussCS olb ‘an intoxicating drink’, Sin. qI ‘beer’, ONor. qI ‘beer’ || agri- 
cultural terms: Lith. rugiai ‘rye’, OCS nzt, ONor. rugr (/-stem) || certain adjectives: Lith. 
surus ‘salty’, OCS svn ‘damp, moist’, ONor. suit ‘sour’; Lith. tingus ‘lazy, sluggish’, OCS 
tqzbkb ‘heavy, difficult’, ONor. pungr ‘heavy’ || numbers: OPruss. tiisimtons ‘thousand’, 
Lith. tiikstantis, OCS tvsgsti, tvsQSti, Goth .pusundi (which do not fully agree in formation). 
Stang ( 1 942: 278) suggests that these equivalences reflect a common preservation of inher- 
ited (perhaps northwest Indo-European) features, or represent convergent developments, 
and thus belong rather to the realm of Sprachbund phenomena. 

FURTHER READING 

Henrik Bimbaum (1970) offers a balanced summary of various approaches to Balto-Slavic. 
Daniel Petit (2004) presents a historically oriented account of the Balto-Slavic question, 
from the Renaissance to the present day; he includes the findings of archaeology and pro- 
vides an extensive list of references. Accentological issues are addressed by Rick Derk- 
sen (2004) in the same volume; an up-to-date overview of Balto-Slavic accentology can 
be found in the introduction to Derksen 2015. For a detailed presentation and evaluation 
of the data relating to Balto-Slavic, the careful and cautious studies of Christian S. Stang 
found in the references have not lost their relevance. 

NOTES 

1 Baudouin de Courtenay (1903: 330-331) had earlier rejected a Balto-Slavic unity, call- 
ing it a “scholarly fiction” (yueHaa (jwiama), but without providing argumentation. 



484 STEVEN YOUNG 


2 There are disagreements regarding the -uR- reflex of syllabic sonorants after labiovelars 
as a regular phonological change; see the discussion in Stang 1966 (77-82). Still, the fact 
of specific Baltic and Slavic agreements in the roots for ‘chase, drive’ and ‘drink, swal- 
low’ is revealing. The -i- (rather than -u-) of Lith. girtas ‘drunk’, and the infinitives Lith. 
gihti ‘to drive (cattle)’, Latv. dzit, is due to an extension of the productive ablaut pattern. 

3 Derksen (2010) notes that this change was not shared by Old Prussian. But this argu- 
ment depends on one’s interpretation of the orthographic evidence of the Old Prussian 
texts; see, for example, Schmalstieg 1976: 120. 

4 Baltic characteristically preserves the original verbal root here, lacking in Slavic: Lith. 
rihkti ‘gather’ (3 pres, rehka ), OPrass. senrinka. 

5 The role of accentology in the Balto-Slavic question was anticipated by Endzelin as 
early as 1911 (202): “aajiBH'bHmafl pa3pa6oTKa ygapeHm . . ., BhpoaTHO, enje 6onie 
ocBtTHTb 3totb BonpocB” (further developments in stress . . . will undoubtedly shed 
more light on the issue). 

6 There are a number of interpretations of Winter’s Law and its conditions of operation; 
for a review and critique of the literature, see Derksen 2002. Kortlandt (1988: 387) 
amends “lengthened vowels” in Winter’s original statement to “acute vowels,” since 
he rejects a connection between (non-laryngeal) length and acute tone. In Kortlandt’s 
view, Winter’s Law represents “the merger of the glottal element of the preglottalized 
voiced stop with the reflex of the Indo-European laryngeals, which had become a glot- 
tal stop in Balto-Slavic” (Derksen 2004: 83). 

7 This progressive stress shift was demonstrated for Slavic verbs at approximately the 
same time by V. A. Dybo (1962), and is therefore referred to as either the Law of 
Illic-Svityc or Dybo’s Law. 

8 Accentual paradigm (a.p.) a is characterized by columnar barytonic stress; a.p. b by 
columnar oxytonic stress; and a.p. c by final-initial mobile stress, which is compared 
with the oxytona of Greek and Vedic in thematic stems. 
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CHAPTER 2 


BALTIC 

Steven Young 


INTRODUCTION 

The Baltic language family is represented by Lithuanian and Latvian; a third major Bal- 
tic language, Old Prussian, became extinct by the beginning of the eighteenth century. 
Lithuanian and Latvian are traditionally grouped together as East Baltic, in contrast to 
Old Prussian (West Baltic); from a historical perspective, Lithuanian and Latvian may 
be viewed as representing core dialects within a Baltic (or Balto-Slavic) dialectal con- 
tinuum, with Old Prussian as peripheral (Toporov 2006: 20-21). The original Baltic- 
speaking territory was once much larger, extending eastward into the upper Dniepr river 
basin and beyond. 1 These Baltic populations, among them the Goljadb (Balt. *Galind-) 
of the eleventh- and twelfth-century Russian chronicles, were eventually assimilated by 
the eastern Slavs, a process continuing into modem times as former Lithuanian-speaking 
communities in what is now Belarus have become Slavicized. The paucity of surviving 
Baltic languages is somewhat compensated by the rich dialectal diversity within these 
languages; a number of dialectal features are undoubtedly due to the substratum influence 
of now-extinct Baltic languages - Curonian, Zemgalian, Jatvingian, and Selonian- about 
which little is known linguistically. An additional source of Baltic linguistic data are 
the many early (Bronze Age) Baltic borrowings into Baltic Finnic (see especially Kallio 
2008) and Volga Finnic. 

No common ethnonym for the ancestral Balts survives in the Baltic languages. The 
terms “Balt” and “Baltic” are secondary, after the name of the Baltic Sea ( mare Balticum 
first appears in the eleventh century). The use of Baltic for the language family was first 
proposed in 1845 by Konigsberg professor Georg H. F. Nesselmann in the introduction to 
his Die Sprache der alten Preussen (p. xxix). The lack of a common ethnographic desig- 
nation is suggestive of the greater time depth and diffuse nature of the Baltic languages 
in comparison with the closely related (and more monolithic) Slavic language family; 
the reality of a Baltic protolanguage has even been questioned (Bimbaum 1970: 70-71). 
Lexical differences among the Baltic languages, and even between East Baltic Lithuanian 
and Latvian, can be quite striking. For example, Latvian and Lithuanian (together with 
Old Prussian) preserve different Indo-European words for ‘son’: Lith. siinus, OPruss. 
souns (cf. OCS svrib, OInd. sunuh), but Latv. di> Is, related to Lat. films', Lith. kraujas 
‘blood’, OPmss. crauvo, krawia (cf. OCS krbVb), but Latv. asins, an Indo-European r/n 
heteroclite cognate with OInd. asrk, gen. sg. asnah ; Lith. kan’e ‘cow’ (cf. Russ, kordva), 
OPmss. curwis ‘ox’, but Latv. giiovs ‘cow’, continuing PIE *g w eh,us. 

Nevertheless, the notion of Common Baltic is justified by a number of characteristic 
innovations among the Baltic languages. These include a refashioned verb system, with 
the loss of a number distinction in the third person and the generalization of an -a (PIE 
*-o) theme vowel in the present tense; a widespread nominal stem type in -e; a number 
of common diminutive formations; an -st- formant in the present tense marking mid- 
dle/intransitive meaning; and many characteristic lexical items, among them Lith. labas 
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MAP 11.1 BALTIC TRIBES AT THE BEGINNING OF THE SECOND MILLENIUM ad 
Source: Lithuanian Encyclopedia , Boston: Lithuanian Encyclopedia Press, Inc., vol. II, 1954: 148 


‘good’, Latv., OPruss. labs', Lith. qzuolas ‘oak’, Latv. udzuols, OPruss. ansonis; Lith. 
briedis ‘stag, elk’, Latv. briedis, OPruss. bravdis; Lith. pele ‘mouse’, Latv., OPruss. pele\ 
Lith. tureti ‘to have’, Latv. turet, OPruss. turit; see Stang 1966: 7-9. 


Early texts and the emergence of standard languages 

The oldest known text in a Baltic language is the Basel Epigram, two rhymed lines in Old 
Prussian from the mid-fourteenth century (see McCluskey, Schmalstieg, & Zeps 1975). 
The oldest more substantial text is the Elbing Vocabulary, a German-Old Prussian word 
list of 802 thematically arranged nouns and adjectives, making up the final seventeen 
pages of the Codex Neumannianus, which dates from about 1400 and is itself a copy 
of an original from the early fourteenth century (Schmalstieg 1976: 68). Another, rather 
inferior list of some one hundred German items with Old Prussian equivalents is found in 
Simon Grunau’s early sixteenth-century Preussische Chronik. A sense of the morphology 
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of Old Prussian is revealed in three sixteenth-century translations of Lutheran catechisms 
from German into Old Prussian; all suffer from a rather slavish translation from the Ger- 
man, and therefore shed little light on Old Prussian syntax. The first two, published in 
1545 (the second is a “corrected” version of the first), each consist of sixteen pages, only 
about a third of which is actual Old Prussian text. The third catechism, published in 1561, 
includes German and Old Pmssian versions of Martin Luther’s Enchiridion, or small 
catechism. The Old Prussian text makes up about 54 pages of the 134-page publication 
(Kabelka 1982: 54). The third catechism is our only source of Old Prussian word stress 
(and syllable tone). 

The earliest monuments of Latvian and Lithuanian date only from the sixteenth cen- 
tury, when these languages already have a modem appearance. The earliest-known Lith- 
uanian text is a handwritten set of prayers dating from the early 1500s. Book publication 
in Lithuanian began in East Prussia (which had a substantial Lithuanian population at the 
time), in connection with the spread of the Reformation. The first book published in Lith- 
uanian is a 1547 translation of a Lutheran catechism by Martynas Mazvydas (Martinus 
Masvidius). The foreword begins with a personal appeal to the reader: “Bralei seseris 
imkiet mani ir skaitikief’ ‘Brothers, sisters, take me and read’. The language reflects 
Mazvydas’s native South Zemaitic dialect, with Aukstaitic elements. Subsequent East 
Pmssian Lithuanian publications are written in an increasingly normalized variety of the 
local West Aukstaitic dialect, codified in Daniel Klein’s 1653 Grammatica Litvanica, 
the first grammar of Lithuanian. In the Catholic Grand Duchy of Lithuania, two writing 
traditions emerged, one based on the East Aukstaitic dialect of Vilnius, and the other rep- 
resenting the Central Aukstaitic dialect of the Kedainiai region. The latter is reflected in 
Mykalojus Dauksa’s 1595 translation of Jacobus Ledisma’s popular Catholic catechism, 
and his lengthy 1599 translation from the Polish of Jakub Wujek’s collection of sermons, 
the Postilla Catholicka. Dauksa’s works are the first accented texts in Lithuanian, making 
them a valuable source for the study of Lithuanian historical prosody. The present-day 
standard language has its roots in the late nineteenth century and is based on the dialect 
of the southern West Aukstaitic region. Among the factors in the establishment of this 
variety as the national standard are the prior literary tradition of the virtually identical 
Aukstaitic dialect of neighboring East Prussia and the authority of the Lithuanian gram- 
mars of the Indo-Europeanist August Schleicher and Friedrich Kurschat, which described 
the same East Pmssian Lithuanian speech. 

The beginnings of a Latvian literary tradition also date from the Reformation. The first 
book published in Latvian is a 1585 translation of a Catholic catechism; this was soon 
followed by other religious texts, chiefly translations. The language of these early texts is 
to varying degrees influenced by the German speech of the authors, especially in syntax. 
Efforts to establish a national standard language were begun in the mid-nineteenth century; 
the Latvian traditional folksongs ( daujas ), which preserve a number of archaic features, 
served as an important source for norms. A milestone in the codification and description 
of modem Latvian was the appearance in 1 922 of Jan Endzelin’s Lettische Grammatik and 
the four- volume Latviesu valodas vardmca ( Dictionary ofLatx’ian) (1923-32), begun by 
Karl Muhlenbach (Karlis MTlenbahs) and completed and edited by Jan Endzelin. 


East Baltic dialects 

There are two major dialects of Lithuanian, the more conservative (at least in its south- 
western subtype) Aukstaitic ( aukstafciif tarme), and Zemaitic ( zemaiciif tarme; Samogi- 
tian), spoken in the northwestern quarter of Lithuania. Zemaitic presents a number of 
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structural and lexical similarities with Latvian, which may reflect the substratum influ- 
ence of Curonian, an extinct Baltic language that was absorbed in the north by Latvian 
and in the south by Lithuanian. 

The Latvian standard language is based on the central Latvian dialect ( vidus dialekts ), 
which is further divided (from west to east) into Curonian, Zemgalian, and Vidzeme 
varieties. The central dialect, together with Tamian in northwestern Curonia and along the 
northeastern coast of the Gulf of Riga, is known as Low Latvian. High Latvian (Selonian 
and Latgalian, the latter with an independent literary tradition; see Nau 2011), is found 
in the eastern third of the country. Latvian is traditionally viewed as representing a syn- 
thesis of the language of the early Latgalians ( Letthigalli , Letngola, Lotvgola of medieval 
chronicles), also known simply as Letts ( Letthi , Letti ), and neighboring closely related 
Baltic languages, now extinct, among them Zemgalian (along the Lielupe river), Curo- 
nian (in southwestern Latvia; a subtype of this dialect was spoken until the mid-twentieth 
century by the kursenieki of the Curonian Spit), and Selonian (along the middle Daugava 
river, extending into Lithuania). 


PHONOLOGY 
Baltic vowels 

The vowel system of Common Baltic 

The vowels of the Baltic languages can be derived from a Common Baltic system that, 
except for the merger of short *a and *o (shared by Germanic, Slavic, and Albanian) as *a 
(or perhaps * 3 ), reflects the vowels traditionally established for late Proto-Indo-European 
(for an alternative view, see the Kortlandt 1977 and 1985): 


TABLE 11.1 THE VOWEL SYSTEM 
OF COMMON BALTIC 



The long vowels of Common Baltic ultimately reflect sequences of vowel plus 
laryngeal, the effects of Winter’s Law (see below), and also (in the case of *e and *5) 
lengthened grade. In addition, we can establish a set of Common Baltic diphthongs con- 
sisting of *e, *a and a high vowel *i, *u (with a Balto-Slavic change of *eu > *iau, see 
p. 480), and a set of so-called mixed diphthongs, consisting of a vowel plus sonorant 
(*r, *1, *m, *n). The high-vowel mixed diphthongs continue Proto-Indo-European syl- 
labic sonorants (see p. 479f.). The long vowels and diphthongs of Common Baltic were 
further distinguished (as they are in Common Slavic) by a prosodic opposition conven- 
tionally referred to as acute and circumflex; these structure points were subsequently 
realized, for the most part, as rising and falling contour tones in the daughter languages. 

The Common Baltic vowel system is fairly well reflected in the Old Prussian of the 
Elbing Vocabulary (EV); a notable change is the raising and rounding of the low back 
vowels *a (mainly after labials and velars) to *o (spelled <o>) and *a to *5 (spelled <o> 
or <oa>; Maziulis 2004: 17). The Old Prussian catechisms, especially the third, reflect a 
number of innovations in the long vowels (to the extent that one can trust the orthography, 
which, after all, is the representation of a non-native speaker): the raising and rounding of 
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*a (and some instances of *5) to *u after velars and labials (muti ‘mother’ < *mate : EV 
mothe; mergu ‘girl’ : EV mergo, Lith. merga), the raising of *e to *1 (Jst ‘to eat’ : Lith. esti, 
Latv. e. st < *ed-; Maziulis 2004: 16) and the (inconsistent) diphthongization of acute high 
vowels: geiwan ‘life, ace. sg.’, alongside grwan, gijwan; bouton, bauton ‘to be’ alongside 
buton. Distinctive length was lost in unstressed syllables (Maziulis 2004: 14). 


The vowel system of East Baltic 

The somewhat skewed vowel system of Common Baltic, in which *5 (with a close artic- 
ulation: *5) lacks a short counterpart, emerges as a more balanced system in East Baltic. 
Here, the apparently quite open inherited pair *e, *e was reinterpreted as the front coun- 
terpart of *a, *a; the resulting gap in the system was filled by a new long front mid-vowel 
with a close articulation, *e (parallel to *o), which developed under uncertain conditions 
from the diphthongs -ei- and -ai-, preserved in Old Prussian (for a detailed discussion, 
see Karaliunas 1987: 152-179); compare Lith. sniegas ‘snow’, Latv. sniegs and OPrass. 
snavgis; Lith. dievas ‘God’, Latv. dievs and OPrass. deiwas, as well as the Baltic Finnic 
borrowings Finn, taivas ‘sky, heaven’, Est. taevas\ Finn, heina ‘hay’ < CBalt. *seTna-, 
Lith. sienas. The new *e, together with its back counterpart *o, gave rise to the East Bal- 
tic “gliding” diphthongs, ie, uo (approximately [is], [us]). 2 

TABLE 11.2 THE VOWEL SYSTEM 
OF EAST BALTIC 

*e (> ie) *o (> uo) 

*e, *e *a, *a 


This realignment of the East Baltic system of long vowels was reflected in ablaut alter- 
nations, which were still productive at this period. Rather than the inherited alternation of 
an e-grade opposing o-grade, the productive pattern now becomes e ~ a, parallel to the e ~ 
a that continues Indo-European e ~ o; cf. Lith. sodinti ‘to plant’ (*sad-), the causative to 
*sed- ‘to sit’. Lith. suodvs ‘soot’, Latv. suodreji, with old o-grade *s5d- to *sed- ‘to sit’, 
and Lith. tiodas ‘gnat, mosquito’, Latv. udds, with old d-grade *od- to *ed- ‘to eat’, are 
isolated relic forms. 


The vowel system of Lithuanian 

The above vowel system underlies later individual developments in Lithuanian and Lat- 
vian and their dialects. In both Lithuanian and Latvian, the high vowels and the short 
vowels have remained fairly faithful to the Common East Baltic system. Characteristic 
of the development of the Lithuanian vowel system was the repeated raising of long low 
vowels, with new low vowels emerging to take their place. The inherited long low *e and 
*a acquired a more close articulation and rose to fill the mid-vowel gaps created by the 
diphthongization of earlier *e and *5: ejo ‘went’ < *eia (the new, close mid-vowel e is 
spelled <e> ). The vacated slots were in turn filled by new long low vowels (the e, e are 
quite open and can be represented as e, ce:), resulting from the denasalization of nasalized 
vowels; the latter arose from sequences of a short vowel plus tautosyllabic n before a con- 
tinuant or in final position. Since this development post-dates the appearance of printed 
texts, the orthography reflects nasalization, denoted by a hook under the corresponding 
vowel character: zqsis [-a:-] ‘goose’, tqsti [-£:-] ‘to continue’, sivj.sU [-u:-] ‘to send’, / 
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[-1:-] ‘into’. The functional load of the new a and its e counterpart was augmented by the 
results of a lengthening under stress (with concomitant circumflex tone) of most non-final 
short *e and *a; vowel length here is not indicated orthographically: ledas [T'tLdas] ‘ice’, 
vakaras ['va:karas] ‘evening’ (compare Latv. Iqdus, vakars, which preserve short e and 
a). There are exceptions to this general lengthening: infinitive roots (and derived forms) 
with a short low vowel remain short: vesti ‘to lead’, rasti ‘to find’, vesk! ‘lead!’, rask! 
‘find!’, vesiu ‘(I) will lead’, rasi ‘(you) will find’, as do the possessive pronouns mano 
‘my’, tavo ‘your’, savo ‘his/her/its/one’s’, leading to the possibility of minimal pairs such 
as mano ‘thinks’ : mano ‘my’. 

After palatalized consonants and j, -a- and -a- merge with -e-, -e-, respectively: gilids 
‘deep, ace. pi. f.’, and giles ‘acorn, ace. pi.’, are homophonous: [g j Tl>es]), as are gilq 
‘acorn, ace. sg.’, and giliq ‘deep, ace. sg. f.’: ['g'll'ae:] (Girdenis 2003: 194). 

The above changes result in the following vowel system for the Lithuanian standard 
language, represented in a broad phonetic transcription (corresponding graphemes are 
given in brackets): 

TABLE 11.3 THE VOWEL SYSTEM OF LITHUANIAN 

i <i>, i: <y, f> u <u>, u: <u, if> 

e: <e> o: <o> 

e <e, io>, x: <e, ia (under stress); e, iq> a <a>, a: <a (under stress); q> 


This system has been rounded out by new short mid-vowel counterparts of e and o, but 
these (in particular e) may be considered marginal phonemes, characteristic of borrowed 
words. 

This inventory of Lithuanian vowels is supplemented by the “gliding” (sutaptiniai) 
diphthongs ie and uo (ia, us), which arose from East Baltic *e (< *ei) and *6 (< *5) and 
function as long monophthongs, and by the pure diphthongs au, ai, ei, ui, and the mixed 
diphthongs, composed of a short vowel plus tautosyllabic sonorant: i, u, e, a + r, l, m, n. 


The vowel system of Latvian 

The Latvian vowel system generally reflects that of Common East Baltic, apart from 
a split in the quality of the low front vowels, originally conditioned by the nature of a 
following vowel: e, e acquired a more close articulation before front vowels and palatal 
consonants; otherwise, they retained a more open articulation. Subsequent phonological 
and morphological processes rendered this split phonemic: nesu ['naesu] ‘(I) carry’ : nesu 
['nesu] ‘(I) carried’. The distinction between open and close e, e is not reflected in Latvian 
orthography, although in linguistic texts the open e, e is often denoted by f, #: the above 
“ nesu ” would thus be disambiguated as nqsu ['nassu] : nesu [’nssu]. Long vowels are 
denoted in standard orthography with a macron. In linguistic works that include informa- 
tion on tone, the macron is replaced by one of the diacritics for tone; thus, the standard 
spelling rit is a homograph for rit ‘to swallow’ and rit ‘tomorrow’. 

TABLE 11.4 THE VOWEL SYSTEM 
OF LATVIAN 


i, i: 

E, e: 


*, s: 


a, a: 
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The functional load of the long high vowels f and it and the gliding diphthongs ie, uo 
(*e < *ei, *ai; *5) has been augmented by the results of a denasalization of the tautosyl- 
labic sequences *in, *un; *en, *an, respectively: mit ‘to tread’ : Lith. minti;jutu ‘(I) feel’ 
: Lith. juntu; pieci ‘five’ : Lith .penki; riioka ‘hand, arm’ : Lith. ranka. Occasional excep- 
tions to this process are generally assumed to reflect substratum borrowings from the 
now-extinct Curonian, which did not undergo the change: menca ‘cod’, dzintars (along- 
side dzitars ) ‘amber’, rihda ‘row, line’. 

The inventory of Latvian syllabics includes the pure diphthongs, among them ei, 
ai, au; the gliding diphthongs ie and uo, which function (as they do in Lithuanian) as 
long monophthongs (uo is spelled <o> in standard orthography but retained in linguis- 
tics texts, especially when syllable tones are marked), a practice adopted in this chapter: 
riioka ‘hand, arm’, otherwise spelled i vka); and the mixed diphthongs, composed of a 
short vowel plus tautosyllabic sonorant, although most instances involving tautosyllabic 
-«- will have become high vowels or gliding diphthongs (see above). 

Characteristic of Latvian is the tendency to shorten final syllables in disyllabic and 
polysyllabic words. 3 Word-finally, all short vowels, with the exception of -«-, disappear: 
vilks ‘wolf’ : Lith. vilkas; ass ‘axle’ : Lith. as is; met ‘throws’ : Lith. meta ; but (with -«-) 
m^dus ‘honey’ : Lith. medus. Long vowels and diphthongs are shortened, with ai, ei, ie 
reduced to i, and au, uo to u\ riiokas ‘hands, arms’ : Lith. raiikos (*-as); lacis ‘bear’ : Lith. 
lokvs (*-Ts); saki ‘(you) say’ : Lith. sakai; saku ‘(I) say’ : Lith. sakau; mgdus ‘honey, gen. 
sg.’ (thus homonymous with nom. sg. modus') : Lith. medaus. Long vowels and diph- 
thongs are preserved in monosyllabic forms: ta ‘that, gen. sg. m.’ : Lith. to, tai ‘that, dat. 
sg. f.’ : Lith. tai; tie ‘that, nom. sg. m.’ : Lith. tie. 


Prosodic structure 

The prosodic evidence of the Baltic languages (together with that of Slavic) provides cru- 
cial information for reconstructing aspects of the phonological and accentual systems of 
Proto-Indo-European. In Common Baltic, as noted above, all long syllabics (long vowels 
or diphthongs, including sequences of vowel plus tautosyllabic sonorant) were character- 
ized by a prosodic opposition of acute and circumflex, which have varied reflexes in the 
daughter languages. 


Tonal oppositions in Lithuanian 

In standard Lithuanian, acute and circumflex are distinguished on stressed long vowels 
and diphthongs. Acute is realized as a falling tonal contour (in a mora analysis, the first 
mora is stressed), while circumflex is level or slightly rising (stress on the second mora of 
a long syllable). The tonal opposition is perceptually clearest on pure diphthongs: (acute) 
sd.uk! ‘shoot!’ : (circumflex) sauk! ‘shout!’ The tones are indicated in reference works by 
diacritics: acute by ' (placed over the first element of a diphthong) and circumflex by ~ 
(placed over the second element of a diphthong). Short stressed vowels, which do not dis- 
tinguish tone, are marked in reference works with a grave accent (’); the grave is also used 
(in place of an acute diacritic) over a high vowel in an acute mixed diphthong: -ir-, -ill-. 

The tonal contours of the standard language may be viewed as representing a point in 
a continuum of dialectal tonal variation. In most of Zemaitija, particularly the northwest, 
acute is a broken tone ( lauztine priegaide [”]), a rising-falling contour interrupted at its 
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peak by a glottal stop; in eastern Zemaitija and neighboring western Aukstaitic areas it 
weakens to the so-called Stosston ( stumtine piiegaide ), which has a sudden rise followed 
by a falling contour. Further east, the acute loses its characteristic “Stoss” to become a 
simple falling tone. This is the variant of acute that has entered the linguistic literature 
(through the work of Friedrich Kurschat, who was the first to provide a description of 
the Lithuanian tones, based on his own East Prussian Lithuanian dialect) and is consid- 
ered normative. Circumflex has a weak rising-falling contour in Zemaitija and western 
Aukstaitija, which, in moving east, “flattens out” on long monophthongs to simple length 
(protracted tone, tgstine piiegaide). 

Unlike the Zemaitic dialects, where the peak of intensity for both acute and circumflex 
diphthongs occurs in the initial part of a syllable, Aukstaitic acute has its peak of intensity 
on the first element of a diphthong (which, if a low vowel, is lengthened: kelti ‘to raise’ 
[ kJEe litk], while the circumflex reaches its peak on the second element of a diphthong, 
resulting in contour tones that are more stress- (or mora-) based than pitch-based; this 
is reflected in the Lithuanian terms for the tones: tvirtapra.de (piiegaide) “strong-initial 
(tone)” for acute and tvirtagale (piiegaide) “strong-final (tone)” for circumflex. 


Saussure s Law and Leskien ’s Law 

While the acute and circumflex tonal opposition in modern Lithuanian is distinguished 
only on stressed syllables, tones (or their prosodic antecedents) must once have charac- 
terized all long syllables, stressed and unstressed. This is shown by an earlier advance- 
ment of the ictus from a short or circumflex syllable to an adjacent acute syllable that had 
previously been unstressed: (nominal inflection) ace. sg. raiikq ‘hand, arm’, but nom. 
sg. ranka (*-a); (nominal derivation) kiaule ‘pig’ + -ien-a ‘meat of x’ > kiauliena ‘pork’; 
(verbal inflection) losia ‘plays’ : losiii (*-uo) ‘(I) play’; (verbal derivation) laiko ‘holds, 
keeps’ + -y-ti (verb class marker, infinitive suffix) > laikvti ‘to hold, keep’. This progres- 
sive shift in ictus onto a following acute syllable was first recognized and described by 
Ferdinand de Saussure (Saussure 1896: 157) and is therefore known as Saussure ’s Law. 4 
In final syllables, the results of the operation of Saussure ’s Law were obscured by a late 
post-dialectal process known as Leskien’s Law, according to which (for disyllabic and 
polysyllabic words) word-final acute syllables (including ie and uo) were shortened, 
forfeiting tone: *ranka > ranka. The gliding diphthongs ie and uo were replaced by i 
and w; compare the long-form (definite) adjectives nom. pi. m. geiieji ‘good’, instr. sg. 
m. geruoju, with the short forms geri, gerii, which show the effects of Leskien’s Law. 
Diphthongs did not shorten but underwent metatony (change of tone) from acute to cir- 
cumflex: *matau > matau ‘(I) see’ *matai > matai ‘(you) see’. The third-person future 
forms are exceptional in that non-high long monophthongs in both monosyllabic and 
polysyllabic words retain length, with circumflex metatony: duds ‘give, 3 sg. fut.’ : inf. 
duoti; des ‘put, 3 sg. fut.’ : inf. detv, zinos ‘know, 3 sg. fut.’ : inf. zinoti. Monosyllabic 
third-person future forms with acute high vowels were shortened: bus ‘be, 3 sg. fut.’ : 
inf. buti; lis ‘rain, 3 sg. fut.’ : inf. lyti. 

As a result of Saussure ’s Law, the fixed and mobile nominal accent paradigms inher- 
ited from Common Baltic each underwent a split into two paradigms, resulting in the 
four accent paradigms of modem Lithuanian: a.p. 1: fixed stress (non-stem-final ictus, or 
stem-final acute), a.p. 2: fixed stress with an overlay of Saussure ’s Law (stem-final short 
or circumflex), a.p. 3: mobile stress (non-stem-final ictus, or stem-final acute), and a.p. 4: 
mobile stress plus an overlay of Saussure ’s Law (stem-final short or circumflex). 
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TABLE 11.5 LITHUANIAN ACCENT PARADIGMS FOR/I-STEM NOUNS 


singular 

a.p. 1 

a.p. 2 

a.p. 3 

a.p. 4 

nom. 

vyras ‘man’ 

pirstas ‘finger’ 

langas ‘window’ 

vilkas ‘wolf’ 

gen. 

vyro 

pirsto 

lango 

vilko 

dat. 

vyrui 

pirstui 

langui 

vilkui 

acc. 

vyrq 

pifstq 

langq 

vilkq 

instr. 

vyru 

pirstu 

langu 

vilkii 

loc. 

vyre 

pirste 

lange 

vilke 

voc. 

plural 

vyre 

pirste 

lange 

vilke 

nom. 

vyrai 

pirstai 

langai 

vilkai 

gen. 

vyrq 

pirstq 

langq 

vilkq 

dat. 

vyrams 

pirstams 

langams 

vilkams 

acc. 

vyrus 

pirstus 

langus 

vilkus 

instr. 

vyrais 

pirstais 

langais 

vilkais 

loc. 

vyruose 

pirstuose 

languose 

vilkuose 


Tonal oppositions in Latvian 

In Latvian, tonal oppositions are found on (long) initial syllables, since initial stress has 
been generalized under the influence of neighboring Baltic Finnic languages (Stang 1966: 
46). As in Lithuanian practice, the tones are indicated with diacritics in linguistic refer- 
ence works; otherwise, they are not represented. Three phonemic tones are distinguished 
in the prosodically more conservative dialect of central Vidzeme: sustained (or weakly 
rising) tone ( stiepta intonacija, denoted by ”, which in diphthongs is placed over the 
second element): mate ‘mother’ (Lith. mote), saule ‘sun’ (Lith. saule), tilts ‘bridge’ (Lith. 
tiltas); broken tone 5 ( lauzta intonacija, denoted by ", which in diphthongs is placed over 
the second element): rits ‘morning’ (Lith. jytas), galva ‘head’ (Lith. galvq, ace. sg.), but 
‘to be’ (Lith. buti); and falling tone ( kritosa intonacija, denoted by ', which is placed over 
the first element of a diphthong): riioka ‘hand, arm’ (Lith. raiikq, ace. sg.), draugs ‘friend’ 
(Lith. draugas), pirkt ‘to buy’ (Lith. pirkti). Tonal minimal pairs (or triplets) include dust 
‘to dawn’ : aust ‘to weave’; rit ‘to swallow’ : tit ‘tomorrow’; ta ‘that, nom. sg. f.’ : ta 
‘that, gen. sg. m.’ : ta ‘thus’; mit ‘to tread’ (Lith. mind), nut ‘to change, exchange (arch., 
poetic)’, mit (inf. mist) ‘lives, dwells’ (Lith. mint a ‘feeds on, nourishes oneself’). 

Most Latvian dialect areas distinguish only two of these three tones. High Latvian 
has lost the distinction between falling and sustained tone in favor of falling; outside 
of High Latvian, broken and falling tone merge as either broken (in Kurzeme and Zem- 
gale) or falling (Vidzeme), opposing sustained tone. The classic dictionary of Latvian, 
Mulenbachs-Endzelins (Mulenbachs 1923-32), uses a superscript 2 to indicate forms 
taken from dialect areas in which only two tones are distinguished. 


Endzelin s Law 

It follows from the above comparisons that Latvian sustained and broken tone both cor- 
respond to Lithuanian falling tone. In what has since become known as Endzelin’s Law, 
Jan Endzelin (1899: 267-268; 1922: 22) demonstrated that Latvian sustained tone corre- 
sponds to Lithuanian falling tone in a fixed-stress paradigm (a.p. 1): Latv. bralis ‘brother’ : 
Lith. brolis; Latv. berzs ‘birch’ : Lith. berzas, while Latvian broken tone corresponds to 
Lithuanian falling tone in a mobile paradigm (a.p. 3; that is, where there is an alternation 
between final and initial stress throughout the forms of a paradigm): Latv. dzivs ‘alive’ : 
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Lith. gvvas, gvva (nom. sg. m./f.); Latv. sirds ‘heart’ : Lith. sirdis, sirdj (nom./acc. sg.). 
The remaining Latvian tone, falling, corresponds to Lithuanian rising tone (circumflex) 
in both fixed (a.p. 2) and mobile (a.p. 4) paradigms: Latv. riioka ‘hand, arm’ : Lith. rankd, 
rankq (nom. sg., ace. sg.; a.p. 2); Latv. ziioss ‘goose’ : Lith. zqsis, zqsj (nom. sg., ace. sg.; 
a.p. 4); Latv. draugs ‘friend’ : Lith. draugas (a.p. 4); Latv. dzimt ‘to be bom’ : Lith. gimti. 

A phonetic motivation for the twofold reflex of Baltic acute in Latvian was first pro- 
posed by Richard Ekblom (1933: 69), who connected the development of Latvian broken 
tone with stress retraction onto an initial syllable. If this syllable was acute (which he 
assumed to be rising tone), it underwent an abrupt rise in pitch under new stress, leading 
to a “break” (“Umbruch”) in the syllable, represented by glottal closure and release. But 
an association between retracted stress and broken tone presents a number of difficulties 
and contradictions, which are avoided if broken tone, rather than sustained (or rising) 
tone, is taken to be primary; see Young 1994. 

As a result of the distribution associated with Endzelin’s Law, Latvian occasionally 
preserves more information about the original fixed or mobile accent paradigm of an 
acute base than Lithuanian does. For example, the evidence of Latvian is needed in order 
to establish the original accentual class of disyllabic adjectives, since Lithuanian has 
generalized the mobile accent paradigm here: Latv. baits ‘white’, ilgs Tong \ pilns ‘full’ 
indicate original fixed stress, while Lith. baltas, -a; ilgas, -a; pilnas, -a are secondarily 
mobile (a.p. 3). Latvian also continues an original fixed or mobile accent paradigm in 
infinitive forms, where Lithuanian provides no direct information: augt (Lith. augti) ‘to 
grow’, begt (Lith. begti ) ‘to run’, duot (Lith. duoti) ‘to give’ point to original mobile 
stems, while drazt (Lith. drozti) ‘to carve’, set (Lith. seti) ‘to sow’, saut (Lith. sauti) ‘to 
shoot’ reflect original fixed stress. 


Tonal oppositions in Old Prussian 

The Old Prussian Enchiridion also points to a tonal distinction between acute and cir- 
cumflex. Assuming the traditional view that the macron that appears on vowel graph- 
emes indirectly reflects stress (the macron marks long vowels, which were preserved 
only under stress), the representation of diphthongs with either the first or second ele- 
ment marked by a macron would suggest a falling or rising tonal contour (Fortunatov 
1880: 153ff.; for a discussion of tones and tonal marking in Old Prussian, see Derksen 
1998). Old Prussian examples reflecting Baltic circumflex include eit ‘goes’ : Lith. eiti 
‘to go’; laiku ‘(they) hold to, keep’ : Lith. laiko\ ausins ‘ear, ace. pi.’ : Latv. auss, Lith. 
ausj (ace. sg.). Among examples of Baltic acute are kaulins ‘bone, ace. pi.’ : Lith. kaulus 
(ace. pi.); pogaut ‘to receive, get’ : Lith. pagauti ‘to catch’; ain an ‘one, ace. sg. f.’ : Lith. 
vlenq (ace. sg.). Moreover, acute (but not circumflex) T and u often diphthongize to ei, 
ou (au), allowing for placement of the macron on the second element of the diphthong 
(eT, ou): bout ‘to be’ : Latv. blit, Lith. buti\ soiins ‘son’ : Lith. sunq (ace. sg.); geiwans 
‘alive, ace. sg.’ : Lith. gvvas. In the Elbing Vocabulary, a number of fonns containing a 
diphthong that have circumflex counterparts in East Baltic show apparent lengthening 
of the first element, suggesting falling tone for circumflex: doalgis < *dolgis < *dalgis 
‘scythe’ : Lith. dalgis; moasis ‘bellows’ < *mo(i)sis < *ma(i)sis : Lith. maisas ‘bag’ 
(Maziulis 2004: 14). 


Interpretation of Baltic tonal correspondences 

We arrive at the following sets of tonal correspondences for the Baltic languages (the 
diphthong au here represents all long syllabics): Latv. au (falling tone) = OPruss. au 
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(falling tone) = Lith. au (rising tone) for Baltic circumflex; and Latv. au/au (rising or 
broken tone) = OPruss. af (rising tone) = Lith. au (falling tone) for Baltic acute. These 
equations, supported by the evidence of Slavic (as traditionally presented), led to the 
assumption of a falling pitch contour for circumflex and a rising pitch contour for acute in 
Proto-Baltic (and Balto-Slavic). Lithuanian was assumed to have undergone an inversion 
of tonal contours (see p. 481). 

This statement of tonal correspondences has served as a traditional point of departure 
in discussions of Balto-Slavic tonogenesis. But acoustic studies by Aleksas Girdenis and 
Antanas Pakerys have shown that the Lithuanian tones (on diphthongs) differ more in the 
nature of the first component of the diphthong than in a prosodic contrast between the 
first and second components (Girdenis 2003: 288). Listening experiments using reverse 
recordings have shown that speakers of both Zemaitic and West Aukstaitic dialects of 
Lithuanian distinguish tones reproduced in the reversed direction just as well as original 
tones (Girdenis 2003: 273), which would not be expected if the primary exponent of tone 
was a rising or falling pitch contour. An acoustic characterization of the tones of prosod- 
ically conservative (North) Zemaitic Lithuanian is instructive here: for acute, acoustic 
energy is concentrated at a single point in the syllable nucleus and changes abruptly; 
circumflex is characterized by a lack of concentrated energy. A pitch contour in acute 
syllables can be explained as a side effect of the glottalization associated with broken 
tone (Vaillant 1936: 114-115, 1950: 244—245; Girdenis 2003: 272). This suggests that it 
is the broken (glottalized) tone of the Zemaitic Lithuanian dialects and Latvian that is the 
original reflex of Proto-Baltic (Balto-Slavic) acute. 


Baltic consonants 

The Proto-Baltic consonant system 

The Proto-Baltic consonant system, which is essentially identical to that of Balto-Slavic 
(except perhaps for the Balto-Slavic reflex of the PIE palatovelars as s and z), 6 may be 
represented as follows: 

TABLE 11.6 THE PROTO-BALTIC CONSONANT SYSTEM 


labial dental alveopalatal velar 


stops 

voiceless 

voiced 

fricatives 

voiceless 

voiced 

nasals 

liquids 

glides 


P 

b (PIE *b, b») 


m 


w 


t 

d (PIE *d, d h ) 


s 

n 

1, r 


s 

z (PIE *g, g h ) 


y 


k 

g (PIE *g, g h ; *g w , g wh ) 


Although the reflexes of Indo-European aspirated and plain voiced stops have merged 
as plain voiced segments, the original distribution is maintained in an adjacent syllabic: 
those preceding an original plain voiced stop are lengthened (if short monophthongs) and 
acquire acute tone. This process, known as Winter’s Law, is illustrated by the following 
sets of correspondences: 7 
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• PBalt. *b < PIE *b h , with circumflex base: Lith. zembti ‘to cut, hew’, 3 sg. pres. 
zembia: PIE *gemb h - ‘to snatch, bite’, LIV 2 162; 

• PBalt. *b < PIE *b, with acute base: Lith. obuolvs ‘apple’, acc. sg. obuolj; Latv. 
abuols: PIE *h,eb5l; 

• PBalt. *d < PIE *d h , with short base: Lith. vesti ‘to lead’, 3 sg. pres, veda; Latv. xyd: 
PIE *wed h - ‘to lead’, LIV 2 659; 

• PBalt. *d < PIE *d, with acute base: sklesti ‘to dilute’, 3 sg. pres, skledzia, Lith. 
dial, skaidrus ‘clear’; Latv. skaidrs : PIE *sk h eyd- ‘to split, separate, tear’, LIV 2 
547-548; 

• PBalt. *g < PIE *g h and *g wh , with circumflex base: Lith. alga ‘wages, pay’, acc. sg. 
algq\ Latv. alga\ OPruss. gen. sg. algas: PIE *h,elg wh - ‘to earn, yield (revenue)’, LIV 2 
263; 

• PBalt. *g < PIE *g and *g w , with acute base: Latv. raugs ‘pupil (eye)’, raudzit ‘to see’: 
Gr. (Hesychius) pouyoq : upoaccmov; 

• PBalt. *z < PIE *g h , with circumflex base: Lith. liezti ‘to lick’, 3 sg. pres, liezia; Latv. 
(with o-grade) laizit. PIE *leyg h - ‘to lick’, LIV 2 404; 

• PBalt. *z < PIE *g, with acute base: Lith. melzti ‘to milk’, 3 sg. pres, melzia : PIE 
*h,melg- ‘to milk’, LIV 2 279. 


Centum/satsm in Baltic 

While the Baltic reflexes of PIE *k, *g, *g h are generally alveopalatal fricatives (a satsm 
development: Lith. simtas ‘hundred’ < PIE *kmtom), there are occasional examples in 
which they continue as pure velars (a centum treatment). There are interesting correla- 
tions with Slavic, in which one language family or the other shows the exceptional cen- 
tum development: Lith. seiva, Latv. saiva ‘bobbin’ : CS *koiua in Russ, ifeem; Lith. serti 
‘to feed’ : CS *korim> ‘fodder’; Lith. zvaizde (standard zvaigzde) ‘star’ : CS *gvezda; but 
with the opposite development in Lith. klausyti ‘to listen’, 3 sg. pres, klauso : OCS slvsati 
‘to hear’ (PIE *klews- ‘to hear, listen’, LIV 2 336). Phonetic motivations have been sought 
for such exceptions, but these would still not account for lexical doublets such as Lith. 
akmuo ‘stone’ : asmuo (standard asmenys, pi.) ‘blade’, Lith. kiemas (< *keimas) ‘yard’ : 
seima ‘family’, Lith. kaukti ‘to howl’ : saukti ‘to shout’. 


The RUK1 rule in Baltic 

Another source for Proto-Baltic *s is the historical process known as Pedersen’s Law or 
the ruki rule: the change of -s- to -s- after k, r, or the high vowels T, u, operating in Baltic 
(unlike Slavic) before consonants as well as vowels. In Baltic, the change is regular after 
-r-: Lith. virsus ‘top’ (cf. ORuss. Vbrx-b), Lith. pirstas ‘finger’ (cf. ORuss .pbrstb, without 
the change). Reliable examples with -k- are difficult to find, although the frequent appear- 
ance of secondary -ks- makes it likely that the process took place here as well (Stang 
1966: 96). Lithuanian generally preserves -s- after the high vowels: sausas ‘dry’ : OCS 
suxb; ausis ‘ear’ : OCS tao; miisos ‘moss’ : OCS nnxb; visas ‘all’ : WSlav. vs-, ORuss. 
(Novgorod) vbxv, viesulas ‘whirlwind’ : RussCS vixbr b, Lith. dial, trisii ‘three, loc.’ : 
OCS trbXb, OInd. trisu. But there are a few examples that in fact reflect the rule: ausra 
‘dawn’ (but dial, austra < *ausra), vetusas ‘old (arch.)’ : OCS vetbXb, maisas ‘bag’ : OCS 
mexb,jiise ‘fish soup’ : Russ. uxa. This lack of consistency may result from Baltic being 
on the periphery for the diffusion of this areal feature (Stang 1966: 99). 
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Later Baltic reflexes of the palatovelars 

Lithuanian alone among the Baltic languages preserves the reflexes s and z of the Indo- 
European palatovelars and, for s, the output of the RUKI rule. In Latvian and Old Prus- 
sian (as in Slavic), these have become s (merging with inherited s) and z: Lith. suo (PIE 
*kwb-) ‘dog’ : Latv. suns (PIE *kun-); Lith. simtas (PIE *kmto-) ‘hundred’ : Latv. stmts', 
Lith. zeme (PIE *d h g h em-) ‘earth’ : Latv. zeme, OPruss. semme (*zeme). But the alveo- 
palatal fricatives must once have been more widespread, since early Baltic borrowings 
into Finnish (and other Baltic Finnic languages) show the reflex of *s and *z, continuing 
in Finnish as h: Finn, hammas ‘tooth’ : Lith. zambas ‘sharp edge; sharp object’, Latv. 
ziiobs ‘tooth’ (PIE *gomb h -o-); Finn, heme ‘pea’ : Lith. zirnis, Latv. zirnis (PIE *grHno- 
‘grain’); Finn, lohi ‘salmon’ : Lith. lasis, lasisa, Latv. lasis (PIE *loks-os-). 


Sequences of consonant + *i 

Proto-Baltic sequences of consonant plus *i before a back vowel result in various types of 
palatal or dental assimilations in the Baltic languages, which in turn give rise to morph- 
ophonemic alternations at stem boundaries. Where there is no assimilation, Latvian 
preserves the original in the sequence, spelled </>: gulbja ‘swan, gen. sg.’, while 
Lithuanian develops a palatalized consonant (orthographically, <-Ci->), with secondary 
fronting of a low back vowel: siuti ['shLEi] ‘to sew’, laukia [ ld uk%] ‘waits’, labiau 
[la'b J eu] ‘more’, siaure ['Jlte ure:] ‘north’ (*siaur- < *seur- < *keh,w(e)r-, cf. CS *sever-). 
The i in sequences with a labial stop is preserved in word-initial position in Lithuanian: 
pjauti ['pjarut'i] ‘to cut, mow’, bjaurits [bjeu'rus] ‘ugly’. Latvian, like Slavic, intro- 
duces an epenthetic palatal / (spelled </>): p\aut, bjaurs ‘angry, evil’. 

Sequences of dental stop plus i, preserved in Old Prussian, perhaps as palatalized stops 
(cf. median ‘woods’ : Lith. medzias ‘tree’, Latv. mezs ‘woods’), develop in Lithuanian 
into the alveopalatal affricates [tp, dtjJ], orthographically <c(i)>, <dz(i)>: cia (*tia) ‘here’, 
medzias (*medias) ‘woods’; dziuti (*diu-) ‘to dry’. The corresponding reflexes in Latvian 
are the alveopalatal fricatives s andz: latviesa (*-tia) ‘Latvian, gen. sg.’; mezs (*medias) 
‘woods’ : Lith. medzias', zut (*diu-) ‘to dry’ : Lith. dziuti. The sequence *si before a back 
vowel yielded s in Latvian: silt (*sitit-) ‘sew’ : Lith. siuti, as it apparently did in Old Prus- 
sian: schuwikis ‘shoemaker’, cf. Lith. siuvikis ‘tailor’. Alveopalatal fricatives also result 
from *si and *zi combinations in which the s, z continue the Indo-European palatovelars: 
eza (*ezia < *ezia) ‘hedgehog, gen. sg.’ (nom. sg. ezis) : Lith. ezio (nom. sg. ezys). 

Proto-Baltic sequences of velar stop plus i before a back vowel, as well as velar stop 
before a front vowel, resulted in fronted (palatalized) velars in Lithuanian, but assibilated 
to dental affricates (orthographically, <c>, <dz>) in Latvian. Sequences of velar stop plus 
i: Latv. caune : Lith. kiaune ‘marten’, acu ‘eye, gen. pi.’ : Lith. akitp, rudzi ‘rye, nom. 
pi.’ : Lith. rugiai; velar stop before a front vowel: Latv. acs ‘eye’ : Lith. akis, Latv. lacis 
‘bear’ : Lith. lokvs, Latv. dzimt ‘to be born’ : Lith. girhti. A secondary palatalization of the 
new dental affricates can be found in masculine -w-stems, where a stem-final -c- or -dz- 
merged with a restored to produce a new c, dz: Idea (< *lacia) ‘bear, gen. sg.’ (nom. 
sg. lacs) '. Lith. lokio (nom. sg. lokvs). Note also Latv. caula ‘hull, husk; shell’ compared 
to Lith. kevalas ‘shell’; its historical development serves as a recapitulation of some of 
the sound changes presented here: *kevala > *cevala (assibilation of a velar stop before 
a front vowel) > *ceula (vowel syncopation and concomitant formation of a falling-tone 
diphthong) > *ciaula (*eu > *iau as a persistent rule in East Baltic, dating from the Bal- 
to-Slavic period) > caula (palatalization of ci to c). 
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Dissimilation in clusters of dental stops 

Sequences of dental stops continue in Baltic (as in Slavic) as fricative plus stop clusters: 
Lith. mes-ti (3 sg. pres, met-a), past pass. ptcp. mes-tas (*met-t-) ‘to throw’ = Latv. mes-t 
(3 sg. pres. mqt), past pass. ptcp. mes-ts ; Lith. ves-ti (3 sg. pres, ved-a), past pass. ptcp. ves- 
tas (*wed-t-) ‘to lead’ = Latv. ves-t (3 sg. pres, vqd), past pass. ptcp. ves-ts = OPruss. wes-t. 


Dissimilation in clusters of dental stop + l 

A dissimilation of the dental in the clusters -tl-, -dl- to a velar i-kl-, -gl-) is found in 
East Baltic and, in part, in Old Prussian: Lith. zenklas ‘sign’ (with -kla- < *-tla-), but 
OPruss. ebsentliuns (*-zen-tl-) ‘designated’; Lith. egle ‘spruce, fir’, Latv. egle, but 
OPruss. (Elbing Vocabulary) addle, where the dental is preserved. The Old Prussian of 
the Elbing Vocabulary is inconsistent here; despite addle ‘spruce, fir’, the dissimilatory 
change appears in clockis ‘bear’ (Lith. lokvs, Latv. lacis, CBalt. *tlok-). The unchanged 
sequence is preserved in the Old Prussian place name Tlokun=pelk ‘bear swamp’, the first 
element of which is the genitive plural of ‘bear’. 


MORPHOLOGY 
The noun 

Nominal inflection: case, number, gender 

The Baltic languages inherited from late Indo-European a system of cases for marking the 
syntactic and semantic role of a noun phrase in a sentence: nominative, genitive (continu- 
ing, as in Slavic, both genitive and ablative function), dative, accusative, and instrumen- 
tal, together with a vocative form of address. A locative case was also inherited (PIE *-oy, 
cf. adverbialized o-stems Lith. namie ‘at home’, OPruss. bTtai ‘in the evening’; a-stem 
rankoje < *rank-oi-en ‘in the hand’), but was rebuilt in East Baltic as part of a system of 
secondary local cases formed with postpositions added to existing case forms. In addi- 
tion to the inessive (old locative + *en; Stang 1966: 182), this system included an illa- 
tive (accusative + *n(a)), adessive (old locative + *p(i)), and allative (genitive + *p(i)). 
Besides the standard inessive (simple locative), certain of these, particularly the illative, 
are still found dialectally in Lithuanian, or continue as adverbs: laukaii ‘outside (direc- 
tional)’. It is difficult to establish in a reliable way the case system of Old Prussian, due 
to possible morphological interference from German in the texts; it undoubtedly agreed 
with East Baltic in marking at least a nominative, accusative, genitive, dative, and voca- 
tive, and possibly a locative (and, among personal pronouns, an instrumental). 

In the modem Baltic languages, the various cases are distinguished in two numbers, 
the singular and plural; a dual is attested for certain case forms in Lithuanian dialects and 
in older Lithuanian and Latvian texts. The noun has inherent masculine or feminine gen- 
der; the Balto-Slavic neuter merged in East Baltic with the masculine: Lith. sienas ‘hay’ 
(m.), Latv. siens (m.) : OCS seno (n.). The neuter, with a nominative-accusative a-stem 
ending -an, is attested in the Old Pmssian of the Elbing Vocabulary, for example, assaran 
(*azaran) Take’ (East Baltic masculine Lith. ezeras, Latv. qzqrs) and the phrase ructan 
dadan (spelled as a single word) ‘sour milk’, cf. Latv. rugts ‘bitter’; the «-stem alu ‘mead’ 
(: Lith. alus, Latv. alus ‘beer’) shows a zero ending. By the time of the catechisms, the 
Old Prussian neuter was in the process of disappearing (Petit 2010: 153). 
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Nominal stem types 

The endings marking the grammatical categories of case, number, and gender follow sev- 
eral inherited declensional patterns, based on stem type; these are most clearly preserved 
in Lithuanian and give the Lithuanian noun its characteristic archaic appearance. The 
following stem types are represented in Lithuanian; Latvian has a somewhat reduced and 
simplified system: 

• (pa-stems, which continue Indo-European (thematic) o-stems and include masculines 
and neuters (which merged in East Baltic with masculines): (masculine) Lith. vilkas 
‘wolf’, Latv. vilks < PIE *wlk w os; OPruss. deiws ~ deiwas (lx) ~ deiwis (EV) ‘God’; 
(original neuter) Lith. kaulas ‘bone’, cf. OPruss. (EV) caulan < PIE *keh 2 ulo-, Lith. 
biitas ‘lodgings’ : OPruss. buttan; ia- stem: Lith. svecias ‘guest’, Latv. svess ‘guest; 
strange, unknown’ < EBalt. *svetias; the Latvian consonantal stem zvgrs ‘beast’ has 
joined the a-stems, apparently on the basis of the genitive plural zv$ru (Gr. Oqpcov); 

• T-stems, which develop from the sequence -iia-: Lith. brolis ‘brother’, Latv. bralis 
(EBalt. *bralls, PBalt. *braliias); Lith. dagys ‘thistle’, Latv. dadzis ‘thistle, burdock’ 
(*dagiias; the original -iia- sequence is seen in the Estonian borrowing takijas ‘this- 
tle, burdock’); the -ys of Lithuanian represents a contraction of *-iias. This type also 
includes some masculine consonantal stems: Lith. memo, gen. sg. menesio, nom. pi. 
menesiai ‘moon, month’, Latv. menes(i)s, gen. sg. menesa, nom. pi. menesi ‘moon, 
month’ < PIE *meh[n(e)s); Latv. suns, gen. sg. suna, nom. pi. suni ‘dog’; Latv. udens, 
gen. sg. Mens, ace. sg. Meni, nom. pi. udeni ‘water’; Latv. akmens, gen. sg. akmens, 
ace. sg. akmeni, nom. pi. akmerii ‘stone’; 

• (pd-stems, continuing PIE *eh,-stems: Lith. rasa ‘dew’, Latv. rasa < PIE *roseh 2 ; id- 
stem: Lith. zinia ‘(piece of) news’, Latv. zina. This class also includes the Lithuanian 
relic forms marti, ace. sg. marciq ‘daughter-in-law’ and pad, ace. sg. paciq ‘wife’, 
and the feminine forms of the present active participles: dirbanti ‘working’, ace. sg. 
dirbanciq, which continue the type represented by Skr. dev! ‘goddess’; 

• e-stems, most of which represent a development from *-iia (Stang 1966: 201-204): 
saule ‘sun’, Latv. saule < CBalt. *sauliia, in Indo-European terms *seh 2 uliieh 2 ; zeme 
‘earth’, Latv. zeme, OPruss. semme< CBalt. *zemiia, comparable to Slav. *zemia. The 
development of *-iia to -e must have been early, since the latter is already reflected 
in borrowings into Finnish: kantele ‘a kind of stringed instrument’ : Lith. kaiikles 
(*kantles); 

• i-stems, continuing Indo-European /-stems, most of which are feminine (Lith. avis 
‘sheep’, Latv. (arch.) avs < PIE *h,ewis); Lith. ugnis (OLith., ELith. u ignis ) ‘fire’, 
Latv. uguns, now feminine, is attested as masculine in Old Lithuanian (cf. OCS ognn, 
m.; secondarily jo-stem ognn). The inherited /-stem declension has absorbed various 
consonantal stem types; the entry point for the consonantal stems was undoubtedly 
the accusative, in which the Balto-Slavic reflex of the syllabic nasal case marker in 
consonant stems (e.g., PIE *h 3 dont-m ‘tooth, ace. sg.’ > CBalt. *dant-in, Lith. dantj ) 
merged with the ending of an /-stem, for example, *avi-n ‘sheep’. Old inherited 
feminine consonantal stems include Lith. naktis, ace. sg. naktj ‘night’, Latv. nakts 
< PIE *nok w ts; Lith. zqsis, ace. sg. zqsj ‘goose’, Latv. ziioss < PIE *g h h 2 ens; Lith. 
sirdis, ace. sg. sirdj ‘heart’ < PIE *krd-; Lith. zuvis, ace. sg. ziivj ‘fish’ < CBalt. *zu 
(PIE *dg h uH), ace. sg. *zuuin; Lith. ausis, ace. sg. ausj, Latv. auss < PIE *h,ews; 
and Lith. dukte, ace. sg. diikterf ‘daughter’ < PIE *d h ugh,ter. Examples of masculine 
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consonantal stems are Lith. dantis, gen. sg. danties ‘tooth’ < PIE *h,dont(s); Lith. 
zveris, gen. sg. zveries ‘beast’, cf. Gr. 0r)p; Lith. suo, ace. sg. sun), Latv. suns < PBalt. 
*s(u)o, *sun-, PIE *ku > w on; Lith. vanduo, dial, vanduo, acc. sg. vandenj, Zemaitic 
Lith. unduo, unduo, Latv. udens, OPruss. wundan, unds < CBalt. *uandd, *unden-; 
and stems with the extended formant -men-, which in Baltic generalize a nom. sg. 
-md(n) and an oblique stem -men-: Lith. akmuo, acc. sg. akmen } ‘stone’, Latv. akmens 
< PIE *h 2 ekmon; Lith. piemud, acc. sg. piemen / ‘shepherd’, compare Gr. 3rotpf)v. In 
Latvian, where the /-stems are exclusively feminine, the masculine consonantal stems 
have mostly merged with the /-stems, sometimes preserving the original consonantal 
genitive singular: akmens (*akmen-es) ‘stone’, Mens (*unden-es) ‘water’. Vestiges 
of the Indo-European consonantal declension are preserved in Lithuanian (standard 
language, dialects, old texts) in gen. sg. dukters (~ dukteres), akmens (~ akmenes), 
suns (~ sunes ); nom. pi. (dial.) akmenes ‘stone’, dukteres ‘daughter’, zveres ‘beast’ 
(Gr. Oppeq), zuves ‘fish’ (Gr. i/Gua;), zqses ‘goose’ (Gr. xpve?); and gen. pi. dantq to 
dantis ‘tooth’ (Gr. odovxcov), naktq to naktis ‘night’ (Gr. vdktcov), dukterq to dukte 
‘daughter’, zuvq to zuvis ‘fish’ (Gr. i^Gucov), and zqsq to zqsis ‘goose’ (Gr. xflvcov). 
Latvian also preserves a number of such genitive plural forms dialectally (Endzelin 
1922: 318-319): ausu to auss (Lith. ausis) ‘ear’ (PIE *h,ews), ziiosu to zuoss (Lith. 
zqsis) ‘goose’, naktu (standard naksu, an /-stem) to nakts (Lith. naktis) ‘night’, zuvu to 
zuvs (standard zivs, gen. pi. zivju) ‘fish’; 

• u-stems, continuing Indo-European u-stems (Lith. suniis ‘son’ < PIE *suHnus; Lith. 
lediis ‘ice’, Latv. Iqdus, compare OCS ledrS)', most are masculine, but they include a 
few old neuters: Lith. dial, pekus ‘livestock’, OPruss. peckir, Lith. aids ‘beer’, Latv. 
alus, OPruss. alu. Latvian preserves, as pluralia tantum, a few old *u- (*uH-) stem 
feminines, such as dzirnus (also dzirnavas) ‘millstone’ (PIE *g w rh 2 nuHs; compare 
Slav. (OCS) zrunv). 

The more productive stem types have become strongly associated with a particular gender: 
(i)a- and f- (iia-) stems, together with u-stems, are masculine; (i)a- and e- ( iid -) 
stems, together with /-stems, are feminine. There is a strong tendency in the Lithuanian 
dialects for the few masculine /-stems, such as zveris, to be treated as feminine. As we 
have seen, in Latvian the old masculine /-stem zvqrs has joined the /-stem masculines. 


Sample o-stem case endings 

What follows is a brief sketch of individual case endings for o-stem (Baltic a-stem) 
nouns. PIE nom. sg. *-s, associated in Baltic with masculine gender, was extended in 
East Baltic to neuters, as these merged with the masculines: Lith. ezeras Take’ : OPruss. 
assaran (*azaran), with -n < PIE *-m. In East Baltic, as in Slavic, nouns in the geni- 
tive assumed the original ablative ending *-o-Hed, resulting in a circumflex *-a: Lith. 
vilko ‘wolf’, Latv. vilka. The Old Prussian genitive apparently preserves an original 
-s- formant: deiwas ‘of God’. Lithuanian dative -ui (circumflex) continues PIE *-o-Hei, 
through *-uoi < CBalt. *-oi. The accusative singular (and plural) are formed with the 
addition of -n (PIE *-m) to the theme vowel: Lith. vilkq. The instrumental adds a laryn- 
geal to the theme vowel, resulting in an acute CBalt. *-5: Lith. vilku. The Old Prussian 
adverbial bitai ‘in the evening’ preserves an old locative ending -ai (PIE *-o-y), also 
found as a fossilized form in the Lithuanian adverb namie ‘at home’. The regular ending 
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seen in Lith. vilke ‘in the wolf’ (Latvian has generalized the a-stem ending) seems to 
reflect an inessive -en added to the old locative form, with an unmotivated acute. The 
Baltic languages preserve, in the singular, an inherited vocative with the bare theme 
vowel -e\ OPruss. O Deiwe ‘O God!’, Tawa (~ Tawe) Noiison ‘Our Father’; Latv. tgv 
< *t§ve (modern tgvs) ‘father!’; Lith. Dieve ‘God!’, Teve must / ‘Our Father’, vilke ‘O 
wolf! ’; for personal names, this formation has been replaced in Lithuanian by the parti- 
cle -ai: Jonai! (nom. Jonas ‘John’). 

The nominative -accusative dual ending in (du) vilkii continues an Indo-European 
laryngeal added to the thematic stem, while the dative dual vilkam and instrumental 
vilkam go back to *-a-ma-. Nominative plural -ai (Lith. vilkai), also found in Old Prus- 
sian ( wijrai ‘men’), is undoubtedly borrowed from the pronominal stems. A neuter nom- 
inative plural in -a is attested in the Old Prussian Elbing Vocabulary: warto ‘door’ (cf. 
OCS vrata ‘gate, nom./acc. pi.’). The genitive plural ending of Lith. vilkq, and the Old 
Prussian place name Tlokunpelk ‘bear swamp’, represents a Proto-Baltic circumflex *-on 
(> EBalt. *-uon > -i/), continuing PIE *-o-om; the Catechism texts often show -an, which 
would reflect a short *-on. The earlier Lithuanian dative ending -mus, shortened to -ms 
( vilkams ), does not fully agree with OPruss. -mans ( waikammans ‘servant, dat. pi.’), but 
see Maziulis 2004: 43. The East Baltic accusative plural (Lith. vilkiis ) apparently rep- 
resents *-ons, with inorganic length and acute tone; Old Prussian has an expected -ans\ 
deiwans ‘gods’. The instrumental plural of Lith. vilkais represents a PIE *-o-Heys. The 
locative (inessive) plural seen in vilkuose is secondary; according to Stang 1966: 186, 
the original ending is preserved in adverbial numerals such as keturiese ‘four (together)’, 
dial, keturiesu (Zinkevicius 1966: 324), continuing BS1. *-oysu; cf. Slav. (OCS) vlbcexb 
‘wolf, loc. pi.’ Sample declensions of Baltic a-stem nouns in the three Baltic languages 
are given in table 11.4. 


TABLE 11.7 DECLENSION OF A-STEM NOUNS 



Lithuanian 

Latvian 

Old Prussian 

singular 




nom. 

vilkas ‘wolf’ (a.p. 4) 

vilks ‘wolf’ 

*grlks ‘sin’, cf. waix (*vaiks) ‘servant’ 

gen. 

vilko 

vilka 

grikas 

dat. 

vilkui 

vilkam 

griku 

acc. 

vilkq 

vilku 

grikan 

instr. 

vilkii 

= acc. 

? 

loc. 

vilke 

vilka 

*grlkai, cf. (adverbial) bitai ‘in the 




evening’ 

voc. 

vilke 

vilk(s) 

*grlke, cf. deiwe ‘God’ 

dual 




nom.- 

vilku 



acc. 




dat. 

vilkam 



instr. 

vilkam 



plural 




nom. 

vilkai 

vilki 

grikai 

gen. 

vilkq 

vilku 

grikan 

dat. 

vilkams 

vilkiem 

*grlkamans, cf. waikammans ‘servant’ 

acc. 

vilkus 

vilkus 

grikans 

instr. 

vilkais 

= dat. 

? 

loc. 

vilkuose 

vilkuos (sp.: vilkos) 

? 
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Nominal derivation 

Among the older derivational formations are deverbal agent nouns with an *-aia- suffix 
added to the infinitive stem: Lith. giedotojas ‘choir-boy’ (inf. giedoti ‘to sing, chant’, 
Latv. dziedatajs ‘singer’ (inf. dziedat ‘to sing’), with traces in Slavic: OCS ratajb ‘plow- 
man’ : OPrass. artovs, Lith. artojas (inf. arti ‘to plow’). Verbal noun formations dif- 
fer across Baltic: Lithuanian uses -imas, added to the preterit stem: piesimas ‘drawing’ 
( piesti ‘to draw’); the latter has a direct cognate in CS *pisi>mo ‘writing’ (Pol . pismo, Cz. 
pismo). In Latvian, verbal nouns are generally formed with the suffix -Sana (Mathiassen 
1997: 158): rakstTsana : rakstit ‘to write’, while in Lithuanian this suffix refers to the 
manner in which something is done: rasvsena ‘handwriting’ (i.e., how one writes). The 
corresponding suffix in Old Prussian is -sna\ rickausnan ‘government’. A widespread 
suffix, varying somewhat in form, denotes persons according to their occupation, place 
of residence, or some characteristic: Lith. darbininkas ‘worker’ {darbas ‘work’), Latv. 
darbinieks ‘employee, office-worker’ ( darbs ‘work’), OPruss. balgninix ‘saddler’ 
( balgnan ‘saddle’); the Slavic cognate -b nikb again demonstrates the closeness of the 
Baltic and Slavic language families. 


The adjective 

The grammatical categories of case, number, and gender (masculine or feminine), inher- 
ent in the noun, are reflected in an associated adjective. Thematic neuter adjectives are 
attested in the Old Prussian Elbing Vocabulary ( kirsnan ‘black’, cf. Lith. kifsnas ‘jet- 
black (horse)’, CS *ci>m'b ‘black’, and Skr. krsnah ‘black’) and in the Enchiridion: 
nawnan ‘new’; the third catechism also has the neuter «-stem (used as an adverb) poligu 
‘like’ and the /-stem arwi ‘true’. A vestigial adjectival neuter can be found in Lithuanian 
in predicate adjectives and certain pronominal forms (Petit 2010: 171-174). 


Declension types for simple adjectives 

The declensional paradigms of the adjective reflect those of the noun, with the substi- 
tution in Lithuanian of pronominal endings in the dative singular and plural masculine 
{-am, -iems), the locative singular masculine {-ante), and, for (7)a-stems, the nominative 
plural masculine (-/). The Latvian adjective follows the corresponding noun declension, 
which already incorporates pronominal endings in the dative singular and plural for mas- 
culines {-am, - iem ). The Baltic adjectival declensions are paired for gender on the basis 
of vowel height, yielding two major types for Lithuanian, a low-vowel type, in which 
a-stem masculines are paired with d-stem feminines {geras : gera ‘good’) and /a-stem 
masculines with jd-stem feminines {zalias : zalia ‘green’); and a high-vowel type, in 
which a-stem masculines are paired with /-stem feminines {saldiis : saldi ‘sweet’). The 
high-vowel type is distinctive only in certain case forms; otherwise, it merges with the 
ia/ia- type. It is quite productive in modem Lithuanian and includes borrowed words: 
abstraktiis, -i ‘abstract’. Latvian has only the low-vowel declensional type: a-stem mazs 
‘small, m.’ (Lith. mazas) paired with d-stem maza ‘small, f.’ (Lith. maza), and /a-stem 
za]s ‘green’ (Lith. zalias) paired with jd-stem za/a (Lith. zalia). The «//-stem type of 
Lithuanian (where it has already merged with the ia/ia- type in most case forms) has 
been replaced in Latvian by the ia! id-stem type: Lith. giliis, gili ‘deep’, but Latv. dzijs 
(*dzilias), dzi/a; Lith . platiis, plati ‘wide’, but Lat v.plass (*platias), plasa; Lith. saldiis, 
saldi ‘sweet’, but Latv. salds, salda (with generalized hard stem). 
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TABLE 11.8 DECLENSION OF AIA - STEM AND 

LITHUANIAN 

[///-STEM SIMPLE ADJECTIVES IN 


a/ a- stem: 

mazas, a ‘small’ (a.p. 4) 



singular 


plural 



masculine 

feminine 

masculine 

feminine 

nom. 

mazas 

maza 

mazi 

mazos 

gen. 

mdzo 

mazos 

mazq 

mazq 

dat. 

mazam, OLith. mazamui 

mazai 

mazlems, OLith. maziemus 

mazoms 

acc. 

mazq 

mazq 

maziis 

mazas 

instr. 

mazu 

maza 

mazais 

mazomis 

loc. 

mazame 

mazoje 

mazuose 

mazose 

u/i-stem: saldus, -i ‘sweet’ (a.p. 3) 


singular 


plural 



masculine 

feminine 

masculine 

feminine 

nom. 

saldus 

saldi 

saldus 

saldzios 

gen. 

saldaus 

saldzios 

saldziq 

saldziq 

dat. 

saldziam 

saldziai 

saldiems 

saldzioms 

acc. 

saldq 

saldziq 

saldzius 

saldzias 

instr. 

saldziu 

saldzia 

saldziais 

saldziomis 

loc. 

saldziame 

saldzioje 

saldziuose 

saldziose 


Declension types for compound (definite) adjectives 

The category of definiteness is marked in adjectives by the historical affixation of 
the third-person personal pronoun (in the appropriate case) to the simple (indefi- 
nite) form: Lith. indefinite naiijas, nauja ‘new’ : definite naujasis (*naujas+jis), 
naujoji (*nauja+jl); indefinite saldiis, saldi ‘sweet’ : definite saldiisis (*saldus+jis), 
saldzioji (*saldia+jl). The tendency to merge the Lithuanian «//-declension with 
the ia/ia- type, noted above, is extended still further in the definite adjective: thus 
nom. sg. f. saldzioji has replaced the reflex of *saldl+jl found in OLith. saldiii (read 
saldvji). The corresponding Latvian forms are less transparent: mazais ‘small’, maza. 
Old Prussian may show a comparable formation in pirmois (m.), pirmoi (f.) ‘first’. 
Old Lithuanian shows a peculiar adjectival formation in which pronominal -jis is 
added to a case form of a noun: danguig-iis ukinikas ‘the heavenly farmer’, where 
danguig (danguje) is the locative (inessive) of dangiis ‘heaven, sky’ (Schmalstieg 
1988 : 303 ). 

Lithuanian has generalized the mobile accentual paradigm for disyllabic adjectives; 
remnants of old initial stress are found in the nominative singular of a number of M-stcm 
adjectives with an acute base: aiskus, diski ‘clear’, lygus, lygi ‘flat, even’, but the definite 
forms aiskiisis, aiskidji; lygusis, lygidji follow the productive (mobile) pattern. Latvian 
did not generalize mobility in adjectives, and therefore preserves the original accentual 
distribution of acute bases. 
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TABLE 11.9 DECLENSION OF AA4-STEM AND C///-STEM DEFINITE ADJECTIVES IN 


LITHUANIAN 



a/ a- stem: mazas, 

a ‘small’ (a.p. 4) 





singular 


plural 



masculine 

feminine 

masculine 


feminine 

nom. 

mazasis 

mazoji 

mazieji 


mazosios 

gen. 

mazojo 

mazdsios 

mazqjq 


mazqjq 

dat. 

mazqjam 

mazajai 

maziesiems 


mazosioms 

acc. 

mazqji 

mazqjq 

maziuosius 


mazqsias 

instr. 

mazuoju 

mazqja 

mazaisias 


mazdsiomis 

loc. 

mazajame 

mazojoje 

mazuosiuose 


mazdsiose 



uli- stem: saldus, 

-i ‘sweet’ (a.p. 3) 





singular 


plural 



masculine 

feminine 

masculine 


feminine 

nom. 

saldusis 

saldzioji 

saldieji 


saldziosios 

gen. 

saldziojo 

saldzidsios 

saldziqjq 


saldziqjq 

dat. 

saldziajam 

saldziajai 

saldiesiems 


saldziosioms 

acc. 

saldqjj 

saldziqjq 

saldziuosius 


saldziqsias 

instr. 

saldziuoju 

saldziqja 

saldziaisiais 


saldziosiomis 

loc. 

saldziajame 

saldziojoje 

saldziuosiuose 


saldziosiose 


Numerals 

Cardinal and ordinal numbers 

Also agreeing in gender, number, and case with a nominal head, and therefore function- 
ing as adjectives, are the numbers from one to nine and their compounds. Lith. vlenas, -a 
‘one’ and Latv. viens, -a continue an East Baltic base *vein-, with an unexplained v- in 
comparison with SCr. in ‘someone, other’ (which also confirms the East Baltic acute); 
OPruss. ains, aina, ainan apparently continues an o-grade *oyn-, which may also under- 
lie the East Baltic forms. The corresponding ordinal is Lith. pirmas, a, Latv. pirmais 
(dial, pirmais), OPruss . pirmois. Lith. dii (m.), dvi (f.) ‘two’ continues a dual form (m.) 
*d(w)o-h,, (f.) *d(w)o-ih[; Latv. divi (m., from *duvi by regular sound change), divas 
(f.) adopts the masculine/feminine plural endings found in ‘four’ through ‘nine’; OPruss. 
dwai ‘two’ is also a plural form. The ordinal is Lith. aiitras, a, Latv. uotrais, OPruss. 
antars, antra, continuing an Indo-European form for ‘other’. Lith. trvs, Latv. tris ‘three’ 
represent an /-stem plural formation *triies < *treies, which does not distinguish gender. 
For the ordinal, East Baltic agrees with Slav. (OCS) tretii in having refashioned the stem 
as *tret-: Lith. trecias (*tretias), trecia, Latv. tresais, tresa ; the distinct OPruss. tins, tirtis, 
nom. sg. f. tirti may represent an earlier *trtyo-, or a remaking of a BS1. *trit(i)ios. 

Regular plural declensions on an adjectival pattern begin with Lith. keturi, ketu- 
rios ‘four’, Latv. cetri, cetras (in Indo-European terms, *k w etur-) ‘four’; Latv. cetri is 
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a replacement for earlier cetri (cf. the ordinal cqturtais) under Slavic influence. Like 
Slav. (OCS) ceU’brt-b, the Lithuanian and Old Prussian ordinals show a different syllab- 
ification of the -ur- sequence: Lith. ketvirtas, OPruss. kettwirts: PIE *k w etwr-t-. Lith. 
penki,penkios ‘five’, Lat v. pieci, piecas have as their base PIE *penk w e; Lith. sesi, sesios 
‘six’, Latv. sesi, sesas continue EBalt. *ses- (Latvian through *sesi-). A reduced-grade 
is found in the OPruss. ordinal (w)uszts, f. uschtai, Lith. dial, iisios ‘six-week lying-in 
period (after birth)’. These point to an unusual pre -Baltic alternation *sweks — *uks-. 
Lith. septyni, septvnios ‘seven’, Latv. septitji, septiijas represent a Baltic *septin < PIE 
*septm (the long -f- in the Lithuanian forms is secondary). The ordinals OPruss. septmas, 
OLith. sekmas (*sep(t)mas; modem septiiitas) are continuations of PIE *septmo-. The 
acuted -o- of Lith. astuoni, astuonios ‘eight’, Latv. astuoni, astuonas reflects the final 
laryngeal of PIE *h 3 ekteh 3 ; the related ordinal, OLith. asmas, OPruss. asmus (Modem 
Lith. astuntas, Latv. astuotais ), rebuilt on the pattern of ‘seventh’, directly corresponds 
to Slav. (OCS) osrrvb. Lith. devyni, devvnios ‘nine’, Latv. devini, deviyas (EBalt. *devin, 
after *septin ‘seven’) agree in their initial d- with Slav. (OCS) devgtbf the Old Pmssian 
ordinal newmts ‘ninth’ preserves the original nasal of PIE *hjnewn. 

Unlike ‘one’ through ‘nine’, Baltic ‘ten’ and its multiples, including the teens, function 
as nouns, governing (in general) the genitive case (unless the entire noun phrase is gov- 
erned by another case). Lith. desimtis and indeclinable desimt ‘ten’, Latv. desmit (dial. 
desimt), and OPruss. dessimpts continue a Proto-Baltic consonant stem * desimt (the base 
of which is PIE *dekm), which, like the majority of such stems, joined the /-stems; the 
Lith. gen. pi. desimtif and OLith. (Dauksa) gen. sg. deszimtes are vestiges of the conso- 
nantal declension. Lith. simtas ‘hundred’ and Latv. simts are direct continuations (except 
for the loss of neuter gender) of PIE *(d)kmt6m. Lith. tiikstantis (m. ja-stem, dial, and 
OLith. f. /-stem) ‘thousand’, Latv. tukstuotis (m. w-stem) are related in an unclear way 
to OPmss. (ace. pi.) tusimtons, Slav. (OCS) tvsqsti, tysqsti and Gmc (Goth.) pusundi ; the 
-k- of the East Baltic forms is secondary. 

‘Eleven’ through ‘nineteen’, which (like ‘ten’, etc.) have noun syntax, are formed 
according to different models in Lithuanian and Latvian. Latvian follows a Slavic-like 
model of ‘one-after-ten’, ‘two-after-ten’ (-pa-d(e)smit, cf. OCS ( dbva ) na desgte): 
vienpadsmit ‘eleven’ (‘one -after-ten’), divpadsmit ‘twelve’ (‘two-after-ten’), trispad- 
smit ‘thirteen’ (‘three-after-ten’). Lithuanian reflects, with its formant -lika, a “Ger- 
manic” model of “one left,” “two left,” seen in Goth, ainlif ‘eleven' , twalif‘ twelve’: 
Lith. vienuolika ‘eleven’ (‘one-left’), dvvlika ‘twelve’ (‘two-left’), tiylika ‘thirteen’ 
(‘three-left’), keturiolika ‘fourteen’ (‘four left’), etc. Lith. dvidesimt ‘twenty’ replaces 
an old dual phrase dvi desimti. The remaining multiples of ten, through ‘ninety’, are 
formed with the accusative plural feminine of the number together with an inde- 
clinable desimt. trisdesimt ‘thirty’, keturiasdesimt ‘forty’, penkiasdesimt ‘fifty’, and 
so on. 


The pronoun 

Personal pronouns 

(Standard) Lithuanian and Latvian differ in the initial vowel of the first-person sin- 
gular pronoun: Lith. as, dial, and OLith. es ‘I’, Latv. es; Old Prussian agrees with 
Lithuanian in showing mainly as, with a few instances of es. The final consonant 
of the Balto-Slavic etymon *ez- must have devoiced in Baltic before the operation 
of Winter’s Law, since the Baltic forms show a short vowel; compare Slav. (OCS) 
azb, with the reflex of lengthening. The oblique cases are formed on the stem *man-, 
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OPruss. *men- (with variants). The second-person singular, Lith. tii (shortened by 
Leskien’s Law), Latv. tu, Old Prussian tii, toil, points to an acuted *tu (PIE *tuH). 
In the first-person plural, there is a length difference between OPruss. mes, match- 
ing Lith. mes (with secondary lengthening) and Latv. mes, which shows a long acute 
vowel. The oblique cases retain an initial n- in Old Prussian: gen. pi. noiison, dat. 
pi. noiimans (comparable to Slav. (OCS) nasr,, nann), as opposed to Lith. miisif, 
mums, Latv. musu, mums. The second-person plural shows agreement across Bal- 
tic: Lith. jus, Latv. jus, OPruss. ious, pointing to CBalt. *ius. Lithuanian preserves 
an old dual form in miidu, mudvi (dialectally also vedu, vedvi ) ‘we two’ , jiidu, jiidvi 
‘you two’. The third-person pronouns differ across the Baltic languages. Lith. (m.) 
jis, (f.) ji, with correlates in High Latvian, continues an Indo-European deictic stem 
*i- (*is, *ih,) with initial j- generalized from a thematized oblique stem ia-. In a num- 
ber of Lithuanian dialects, anas ~ anas, -a, ‘yon’ serves as the third-person pronoun, 
matching in origin and function Slav. (OCS) om, ona. The Latvian pair vins, vtija is of 
uncertain origin; OPruss. tans, tanna ~ tenna undoubtedly reflects a conflation of the 
demonstrative *ta- and *anas (cf. also Pol., Cz. ten ‘that’). Alone among the Baltic lan- 
guages, Old Prussian shows an enclitic di-\ prowela-din ‘betrayed him’. An attributive 
(emphatic) pronoun is represented in Lith. pats (< pads), pad, Latv. pats, pasa (pad) 
‘oneself’; 9 the Old Prussian equivalent is subs, not found in East Baltic. 

TABLE 11.10 DECLENSION OF THE LITHUANIAN THIRD-PERSON PERSONAL PRO- 
NOUN JIS, JI 


masculine feminine 



singular 

plural 

singular 

plural 

nom. 

jis 

jie 

ji 

jos 

gen. 

jo 

m 

jos 

jq 

dat. 

jam 

jiems 

jai 

joms 

acc. 

ji 

juos 

jq 

jos 

instr. 

juo 

jais 

jit 

jomis 

loc. 

jame 

juose 

joje 

jose 


Clide pronouns 

A reflexive clitic particle, Lith. -s(i), Latv. -s, is joined to a verb to express middle voice: 
Lith. keliuos(i) ‘(I) get up’, Latv. celuos ‘(I) rise, get up’. In verb forms with a preverb 
or other prefix (ne-, te-, be-), the particle follows Wackemagel’s rule in Lithuanian (see 
below): atsikeliu ‘(I) get up’, inf. atsikeld; nesikeliu ‘(I) do not get up’, while in Latvian 
the particle behaves as in simplex forms: pieceluos ‘(I) get up’, inf. piecelties ‘to get 
up’. In origin, the reflexive particle is a clitic form of the third-person dative pronoun, 
found in Old Lithuanian texts in both dative and accusative functions. The old texts also 
preserve the pronominal clitics 1 sg. mi and 2 sg. ti, which often appear in second posi- 
tion: Pa-mi-rodik ‘show me’ (Modem Lith. parddyk man), pa-ti-rddisiu ‘(I) will show 
you’ (modem Lith. parddysiu tau). An example of a reflexive enclitic in Old Prussian is 
maitatun-sin ‘to nourish oneself’. 

Demonstrative pronouns 

The Baltic demonstrative pronouns reflect a system of person deixis: ‘this’ (near the 
speaker), ‘that’ (near the hearer), and ‘yon’ (beyond speaker and hearer). Lith. sis, si 
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(PBalt. *s!) ‘this’, Latv. sis, si, and OPruss. (m.) schis, like Slav. (OCS) Sb, si, continue an 
Indo-European deictic stem *ki-; the s- of the Latvian and Old Prussian forms represents 
a spread of *sia- from the oblique cases. Lith. tas, ta (PBalt. *ta) ‘that’ and Latv. tas, ta, 
like Slav. (OCS) tb, ta, represent generalizations of the oblique stem *to- of the Indo- 
European demonstrative pronoun *so, *seh 2 , *tod. The Old Prussian set (m.) stas, (f.) sta, 
stai (*stajl?), (n.) sta ‘this, the’ apparently represents *sita-, analogous to Lith. sitas ‘this’, 
a conflation of sis and tas; compare Lith. stai, abbreviated from sitai ‘here’. Lith. anas, 
-a ‘yon’, cognate with Slav. (OCS) onb, ona, serves dialectally (eastern Lithuanian and 
Zemaitic) as the third-person pronoun. 


Interrogative pronouns 

The interrogative pronouns in Baltic continue an Indo-European stem *k w o-: OPruss. 
(m., f.) kas, (n.) ka. The East Baltic forms do not distinguish ±human or ±animate: Lith. 
kas ‘who, what’, Latv. kas. The interrogative/relative Lith. kuris, kuri ‘which, that’, Latv. 
kurs, kura, apparently represents the addition of a pronominal jis to Lith. kur, Latv. kiir 
‘where’, used (as it is in dialects) as a relative pronoun. Lith. kat(a)ras, -a, Latv. katrs, 
katra ‘which of two’ continues a PIE *k w otero-. The correlative series t- (demonstrative) 
: k- (interrogative) : i- (relative), found in many Indo-European languages, is represented 
in Baltic by such forms as Lith. toks (toks), tokia ‘such a’ : koks (koks), kokia ‘what (sort 
of)’ : joks (joks), jokia ‘no, not a single (originally, ‘any (sort of)’)’, parallel to Slav. 
(OCS) takb, kakb, jakb. Latvian presents a somewhat different formation in tads, tada 
‘such a’; kads, kada ‘what (sort of)’. 


The verb 

Verbal inflection 

While the Baltic noun presents an archaic appearance, the verb has been radically refash- 
ioned and reduced to three tenses: present, past, and future. The present continues the 
Indo-European present, with the following basic stem types: an athematic type (see 
below), a thematic type (Lith. 1 pres. sg. lipii, 3 sg. lipa : lipti ‘to climb’), and “semi- 
thematic” types (Lith. /-stem 1 pres. sg. sedziu, 3 sg. sedi : sedeti ‘to sit’ and Lith. *a-stem 
1 pres. sg. prasau, 3 sg. praso : prasyti ‘to ask’). 10 The future, formed on the infinitive 
stem with a suffixed -si/s- (Lith. kelsiu, kelsi, kels ‘(I, you, he/she/they) will raise’, Latv. 
celsu, celsi, cels ; OPruss. postasei ‘(you) will become’), continues an Indo-Euro- 
pean desiderative. 11 The various Indo-European preterit formations have been replaced in 
Baltic by a simple innovative past. The function and forms of the Indo-European middle 
voice have been replaced by both an ablaut form (reduced grade) of a verbal root, and a 
reflexive construction: Lith. (trans.) kelti ‘to lift, raise’, (intr./mid.) kilti ‘to rise’, (reflex- 
ive) keltis ‘to get up (raise oneself)’ (the base here is PIE *kelH- ‘to rise (up), tower 
(over)’, LIV 2 349). Non-finite verb forms include the infinitive (the dative/locative of 
a pre -Baltic verbal noun in - ti -): Lith. -ti, Latv. -t, OPruss. -t, -twei; cf. CS - ti ), vestiges 
of a supine (the accusative of a pre -Baltic verbal noun in -tu-)\ Lith. -tip Latv. dial, -tu, 
OPruss. -tun, -ton (serving as infinitive), cf. CS -tv, and various participles, which com- 
bine with forms of ‘to be’ to build a system of relative tenses. In addition to the indicative, 
a number of moods are distinguished, including an imperative, conditional, and optative 
(permissive); these are presented below. 
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Athematic presents 

Although the only productive stem type in East Baltic is now the thematic (continuing 
an Indo-European stem with -e/o- formant), athematic verbs experienced a period of late 
productivity in Lithuanian, expanding beyond their original scope. Athematic presents 
are well attested in texts from the sixteenth and seventeenth centuries (see Petit 2010: 
211-212) and as relic forms in dialects. Old Lithuanian ‘to be’ and ‘to give’ (with regu- 
larized spellings) serve as examples: ‘to be’ 1 sg. esmi (*es-mie), 2 sg. esi (*es-sie), 3 sg. 
est(i); 1 pi. esme, 2 pi. este; 1 du. esva, 2 du. esta; ‘to give’ 1 sg. duomi, 2 sg. duosi, 3 sg. 
duost(i) (= OPruss. dast ); 1 pi. duome, 2 pi. duoste; 1 du. duova, 2 du. duosta. A trace of 
the original athematic ending can be found in Latv. 1 sg. gsmu, Lith. dial, esmii (standard 
Lith. thematic esii ) ‘(I) am’. Best represented among the Old Prussian athematic verbs is 
‘to be’: 1 sg. asmai (also asmu), 2 sg. asset (with variant spellings), 3 sg. ast; 1 pi. asmai, 
2 pi. astai. The Lithuanian first- and second-person endings -mi and -si are Leskien’s Law 
shortenings of *-mie, *sie (cf. OLith. essiegu, with emphatic particle -gu), corresponding 
to the -mai, -sai of Old Prussian. These may represent a conflation of Indo-European 
*-mi, *-si and the (originally middle-voice) *-ai of the 1 sg. perf. (OCS vede), as inherited 
perfects were absorbed by the athematics in Baltic. 


The thematic verb type 

Thematic verbs use the same set of person markers for all three tenses. The first-person 
singular marker in, for example, Lith. keliii ‘(I) lift, raise’, (refl.) keliuos(i) ‘(I) get up’, 
Latv. ce}u, (refl.) ce}uds ‘(I) rise, get up’ directly reflects a PIE *-oH. The marker of the 
second-person singular (Lith. keli Tift, raise’, kelies(i) ‘get up’; Latv. cel (*celi), cedes 
‘rise, get up’) is an EBalt. *-ei of uncertain origin. The Old Prussian material, due to the 
circumstances of its transmission, is not clear; Stang (1966: 408) suggests that forms 
such as giwassi, gfwasi ‘(you) live’ are comparable to Slav. (OCS) zivesi and represent a 
conflation of the athematic second-person singular ending -si with the *-ei of East Baltic. 
The endings of the first and second persons plural, Lith. keliame, keliate, (refl.) keliames, 
keliates\ Latv. celam, celat; (refl.) celamies, celaties, reflect EBalt. *-me, *-te, the acuted 
length of which is perhaps by analogy to the singular forms. A distinctively Baltic inno- 
vation is the loss of a number distinction (singular/dual/plural) in the third person; here 
thematic verbs show a bare stem in -a for all persons: Lith. kelia Tift, raise’, kelias(i) 
‘get up’, Latv. cel, cejas ‘rise, get up’; past tense: Lith. kele, keles(i); Latv. cela, celas. 
Old Prussian occasionally distinguishes a singular for both thematic and athematic verbs, 
present and past, by means of a suffix -ts (cf. tas ‘that’): astits (*esti-ts) ‘is’. 


The “semi-thematic ” verb type 

In addition to the athematic and thematic stem types (the latter with a number of sub- 
types), Baltic also preserves stem types (termed “halbthematisch” by Stang (1966: 319)) 
in which an unsuffixed present tense stem is associated with an extended infinitive stem. 
In the first, a present stem with an -i- theme vowel (an analogous type in Slavic has a 
long -F-) is paired with an infinitive containing an -e- formant. These often have stative 
meaning: Lith. 1 sg. sedziu (inf. sedeti ‘to sit’), 2 sg. sedi, 3 sg. sedi ; 1 pi. sedime, 2 pi. 
sedite; compare OCS sezdg, sedisb : sedeti. This type is also characterized by reduced- 
grade root vocalism: Lith. 3 pres, mini, inf. mined ‘to mention, remember’, compare 
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OCS mbnit'b, nibiieti ‘to think’. In another “semi-thematic” type, an o-grade present tense 
with an -a- stem formant is paired with an infinitive in -F-; these have iterative meaning: 
Lith. gano : ganyti ‘to pasture (cattle)’ (originally ‘to drive (cattle)’), Latv. gana, gamt\ 
Lith. praso : prasvti ‘to ask’, Latv. prasa : prasit, and correspond to a Slavic formation 
preserving a present tense marker -F-: OCS gonitb : goniti ‘to chase, persecute’, prositb 
: prositi ‘to ask’. 


The Baltic preterit 

The Baltic preterit is marked by the addition of an *-e or *-a suffix to a verb stem, which 
generally agrees in ablaut grade with the infinitive stem; the thematic endings were added 
directly to this preterit formant. Since the third person has no ending, it can be considered 
the unmarked form. An example of a preterit with the *-a formant is Lith. lipti ‘to climb’: 
Lith. 3 lipo (*lip-a); 1 sg. lipau represents *lipa + *-uo; 2 sg. lipai, *lipa + *-ie; 1 pi. 
lipome, *lipa + *-me; and 2 pi. lipote, *lipa + *-te. The *-! formant is found in Lith. kelti 
‘to lift, raise’: Lith. 3 kele (*kele); 1 sg. keliau is then *kele + *-uo (with -eu to -iau); 2 
sg. kelei, *kele + *-ie; 1 pi. keleme, *kele + *-me; and 2 pi. kelete, *kele + *-te. Latvian 
has generalized the *-a formant, but *-! is still found dialectally and in texts from the 
sixteenth and early seventeenth centuries. The earlier *-e is reflected in the timbre of an 
e/e root vowel, which preserves a higher articulation before former <?: corresponding to 
Lith. kele ‘lifted, raised’, vede Ted’, nese ‘brought’, Latvian has cela (*cele), veda, nesa, 
with /e/ rather than /as/; compare sgda ‘sat’, with /re/, representing *sed-a, Lith. sedo. The 
limited Old Prussian material suggests that the *-e and *-a formants were found here as 
well: *-e: yrnmi[ts] (*ime-ts) ‘took’, *-a: pivwela[din] ‘betrayed (him)’. 


Baltic diathesis and ablaut 

The distribution of *-a and *-e in Lithuanian depends for the most part on stem type and 
root vocalism, but is ultimately connected with a system of diathesis that characterized an 
earlier stage of Baltic: *-a is associated with intransitive verbs and *-e with transitive. 12 
One of the striking features of the Baltic verbal system is the use of ablaut to mark the 
relation of transitive/intransitive. In a once highly productive type of diathetic marking, 
a verbal base with transitive meaning came to be characterized by a w-present (a sub- 
type of the thematic present) with e-grade in the root and an -e formant for the preterit; 
the e-grade was extended by analogy to the infinitive. A correlated base with intransi- 
tive (often inchoative) meaning was characterized by reduced grade of the root, with a 
nasal-infixed a-present and an a-preterite. The infixed present tense stem is always char- 
acterized by circumflex tone, regardless of the tone of the infinitive. 13 Thus, the e-grade 
transitive Lith. kelti ‘to lift, raise’ is correlated with a reduced grade intransitive kilti 
(*kl-) ‘to rise’. The present tense of the latter is marked by a nasal infix, and the tone 
becomes circumflex: kvla (i.e., kjla < *ki-n-l-a); the preterit shows the a-formant: kilo. In 
an environment where a nasal infix is not possible, such as an obstruent stem with long 
root vocalism, an -st- suffix is added to the present tense stem: Lith. tolti ‘to move away’: 
tolsta, tolo. 


Mood 

The Old Prussian imperative 2 sg. immais ‘take’, 2 pi. imaiti (: Tmt ‘to take’) continues an 
Indo-European optative in -oi-, also reflected in the Lithuanian permissive, formed with 
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the proclitic particle to-: tesakai ‘may he/she/they say’, tedarbie (< *-ai) ‘may he/she/ 
they work’. A more common formation in Modem Lithuanian is to- plus the third-person 
present: tesako, tedirba. Old Prussian is unique in forming a permissive with the particle 
-sei (pareysey ‘may X come’), which may be connected with the sigmatic future (Stang 
1966: 442). The imperative is formed in Lithuanian by adding the particle -k(ite) to the 
infinitive stem: elk, eikite ‘go!’ (inf. eiti ‘to go’). The Latvian imperative is identical to 
the second-person present; the plural is formed by adding -iet to the singular: saki ‘say! ’, 
sakiet (inf. sacit ‘to say’). The conditional mood of East Baltic derives from a -bi- (‘be’) 
element added to the supine, most clearly seen in forms such as Lith. 1 pi. butumbime 
‘we would be’, 2 pi. butumbite ‘you would be’. Most of the present-day forms are unan- 
alyzable (e.g., Lith. 1 sg. buciau ‘(I) would be’, butq ‘he/she/they would be’). In Latvian, 
a single form identical with the old supine has been generalized for all persons and num- 
bers: cqltu ‘X would raise’. Characteristic of both Lithuanian and Latvian is the use of a 
participial form - in Lithuanian an active participle in the nominative case, in Latvian an 
indeclinable form derived from the present active participle - to mark reported speech or 
indirectly perceived action (evidential mood): Latv. Vir/s dzivuojuot (sp.: dzfvojot) Riga 
‘He is said to live in Riga’, Lith. Jis gyvenqs Vilniuje ‘He is said to live in Vilnius’. Pecu- 
liar to Latvian is the use of the debitive (expressing obligation), formed with the particle 
ja- and the third-person present, with the logical subject in the dative: man jabrauc (‘to- 
me must-go’) ‘I must go’; the debitive of ‘to be’ is formed with the infinitive: man jabut 
‘I have to be’. 


Participles 

Lithuanian and Latvian present an elaborate system of participles and gerunds. Among 
the more frequent participles (which decline like adjectives) are the present active, con- 
tinuing a PIE -nt- participle: Lith. rasqs, stem rasant-, inf. rasyti ‘to write’; OLith. (nom. 
sg. f. def.) sqntiii ‘being’; Latv. rakstuoss : rakstit ; OPrass. dilants ‘working’; past active, 
reflecting a PIE perfect participle in -ues-, CBalt. *-uens ~ *-us-: Lith. (pa)rasgs, stem 
(pa)rasius-; Latv. (uz)rakstfjis, stem (uz)rakstTjus-, OPruss. dauns ‘having given’; pres- 
ent passive: Lith. rasomas ; Latv. rakstams ; OPruss. poklausunanas ‘being heard’; and 
past passive, reflecting a PIE verbal adjective in -to-: 14 Lith. (pa)rasvtas, Latv. (uz)rak- 
stTts; OPruss. dats ‘given’. The past active participle is used, together with a finite form of 
the verb ‘to be’, to fonn a system of perfect tenses: Lith. as esii (pa)ras^s (m.), (pa)rasiusi 
(f.) ‘I have written’, as buvaii (pa)rasgs (m.), (pa)rasiusi (f.) ‘I had written’; Latv. es 
gsmu (uz)rakstfjis (m.), (uz)rakstfjusi (fl), es biju (uz)rakstljis (m.), (uz)rakstijusi (fl). Pas- 
sive constructions are formed with the copula and the corresponding passive participle: 
Lith. knvga biivo rasoma ‘the book was being written’, knvga biivo parasyta ‘the book 
was/had been written’. Old Prussian has a passive construction using wTrst ‘becomes’: 
laikuts wlrst ‘is held’. 

Both Lithuanian and Latvian preserve the reflex of an earlier dative absolute in gerund 
constructions, which serve as a dependent clause when the subject differs from that of 
the main clause. The subject of the gerund appears in the dative; the gerund itself is a 
frozen form of an active participle in the dative (still found in the oldest Lithuanian texts): 
Lith. saulei (dat. sg.) tekant (pres, gerund) ‘as the sun is/was rising’ (‘to-the-sun rising’), 
gimus (past gerund) vaikui (dat. sg.) ‘when the child had been bom’; Latv. saulei (dat. 
sg.) lecuot (pres, gerund) ‘at sunrise’, ziemai (dat. sg.) tuvuojuoties (pres, gerund, refl.) 
‘as winter is/was approaching’. When the subjects of the main clause and dependent 
clause are identical (and the verbal action is simultaneous), a special active participle, 
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formed in Lithuanian with -damas, in Latvian with -dams, is used: Lith. skaitydamas, 
Latv. lasTdams ‘(while) reading’. 


Aspect 

Aspect in Baltic is not as fully grammaticalized as it is in Slavic. While imperfective 
simplex verbs may be perfectivized through the addition of a lexically empty prefix (Lith. 
rasyti ‘to write, imperfi’ : parasvti (perfect.), Latv. rafafiY (imperf.) : uzrakstit (perfect.)), 
the aspectual meaning of verbs formed with a lexical prefix, for example, Lith. aprasvti 
‘to describe’, Latv. aprakstit, is typically contextual. In Latvian, certain verbs with spa- 
tial prefixes and perfective meaning have acquired corresponding phrasal imperfectives 
consisting of the simplex verb and a spatial adverb: iziet (perfect.) / iet ara (imperf.) ‘to 
go out’, niionemt (perfect.) / neiiit nuost (imperf.) ‘to take off’. Perfective verbs used in 
the present tense have non-progressive meaning (like the simple aspect in English): Lith. 
Saule pateka (perfect.) rvtuose ir nusileidzia (perfect.) vakaruose, Latv. Saule pace\as 
austrumuos un niioriet rietumuds ‘The sun rises in the east and sets in the west’. 


Verbal derivation 

Among the more widespread verb types with a derivational suffix in both present tense 
and infinitive/preterit forms is *-e-ia (inf. *-e-ti), forming deadjectival verbs: Lith. senas 
‘old’ : seneti ‘to grow old’, 3 pres, seneja; Latv. v<?cs : vecet, 3 pres. vece\ cf. Slav. (OCS) 
stan : stared, 3 pres, starejetv, this suffix is also used to form causatives in Latvian: 
audzet ‘to cultivate, grow (trans.)’, 3 pres, audze : augt ‘to grow (intr.)’. Another caus- 
ative type is found in - in-ala -: Lith. auginti, 3 pres, augina ‘to raise (children)’ : augti 
‘to grow’; Latv. audzinat ‘to bring up, nurture’, 3 pres, audzina : augt ‘to grow’, OPruss. 
auginnons ‘brought up, raised’, (po)augints ‘brought up’. A third widespread type con- 
sists of denominal verbs in -auia (inf. -auti), cognate with Slav. -iijQ : -ova-: Lith. keliauti 
‘to travel’, 3 pres, keliauja, 3 past keliavo : kelias ‘road’, OPruss. grikaut ‘to confess’ 
(‘acknowledge one’s sins’) : grTk- ‘sin’. 


SYNTAX 
On case usage 

The nominative case, in addition to its basic functions as a marker of the grammatical 
subject and predicate noun, also marks the second argument in Latvian debitive con- 
structions, where the logical subject is in the dative: Man (dat. sg.) japerk maize (nom. 
sg.) ‘I have to buy bread’. The nominative may mark the direct object of an infinitive 
in Baltic impersonal constructions: Lith. Langai (nom. pi.) reikia uzdaiyti (‘to close’) 
‘The windows need to be closed’ (Schmalstieg 1988: 145-147), Latv. (from older texts) 
zirgs (nom. sg.) vajadzes mazgat ‘the horse will need to be bathed’ (Endzelin 1922: 409). 
Characteristic of East Baltic is the use of a preposed adnominal genitive, often in place 
of a relational adjective: lietuviif (gen. sg.) kalba ‘Lithuanian language’ (‘of-Lithuanians 
language’), latviesu valoda ‘Latvian language’; Lith. aukso (gen. sg.) ziedas, Latv. zelta 
(gen. sg.) gredzgns ‘a gold ring’ ( auksaslzelts ‘gold’), a usage that already appears 
in early translations of Polish texts into Lithuanian: dangaus (heaven, gen. sg.) karalvste 
‘heavenly kingdom’ (Pol. krolewstwo niebieskie). The genitive also marks agent or cause 
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in passive voice, a function inherited from Proto-Indo-European (Schmalstieg 1988: 180): 
Lith. vejo (gen. sg. agent) nulauzta berzo (adnominal gen. sg.) saka ‘a birch-tree branch 
broken off by the wind’, Latv. veja nuolauzts bqrza zars. In Lithuanian, a partitive gen- 
itive may be found in the role of subject or object: Miske vra vilkq (gen. pi.) ‘There are 
wolves in the woods’, Atvaziavo svecty (gen. pi.) ‘(Some) guests arrived’, Radau klaidq 
(gen. pi.) ‘(I) found (some) mistakes’. Arguments in negative existential constructions 
are in the genitive: Lith. Nerd zuvies (gen. sg.) ‘There is no fish’, Latv. Nav zivs, as are 
direct objects in negative sentences: As neturiu net brolio (gen. sg.), net sesers (gen. sg.) 
‘I have neither a brother nor a sister’, OLith. Ne turek kitu Diewu (gen. pi.) prieg manes 
‘Do not have other gods before me’ (Schmalstieg 1988: 173); Latvian uses the accusative 
here: Es needu maizi (ace. sg.) ‘I do not eat bread’. The object of the now-rare supine is in 
the genitive: Lith. Senis parejo piety (gen. pi.) valgyty ‘The old man returned to eat din- 
ner’ (pietiis ‘dinner, nom. pi.’), a usage that has been extended to infinitive constructions 
replacing the supine: Senis parejo piety valgyti. The dative, alongside its basic function 
denoting indirect object, is found in impersonal constructions: Lith. Man (dat. 1 sg.) 
reikia pagalbos (gen. sg.), Latv. Man (dat. 1 sg.) vajag palidzibu (ace. sg.) ‘I need help’; 
Lith. Man (dat. 1 sg.) salta ‘I am cold’ (with a historically neuter adjective), Latv. Man 
salst (inf. salt ‘to freeze’). For ‘have’, Latvian uses a “mihi est” construction with the 
dative, while Lithuanian has “habeo” with accusative: Latv. Man (dat. 1 sg.) ir gramata 
(nom. sg.) ‘I have a book’ (negated: man nav gramatas (gen. sg.) ‘I do not have a book’); 
Lith. As (nom. 1 sg.) turiu knvgq (ace. sg.). The dative is also used for the logical subject 
in Latvian debitive constructions: Man (dat. 1 sg.) japerk gramata (nom. sg.) ‘I have to 
buy a book’. The basic function of the accusative is to mark the direct object of a verb 
(Lithuanian switches to a genitive object in a negative sentence). The accusative often 
marks temporal and spatial expressions; the Lithuanian and Latvian words for ‘today’ 
are adverbs derived from the accusative for ‘this day’: Lith. siaiidien, Latv. suodien. In 
Lithuanian, the instrumental, in addition to denoting ‘by means of’ (Lith. rasvti piestukii 
‘to write with a pencil’), marks a predicate complement with verbs denoting a change of 
state: Jis tapo zurnalo (gen. sg.) mety (gen. pi.) zmogumi (instr. sg.) ‘He became the mag- 
azine’s man of the year’; modem Latvian uses the nominative here. In Latvian, the role 
of instrumental has been assumed by the preposition ar ‘with’, used with the accusative 
form in the singular ( raksttt ar zunuli ‘to write with a pencil’) and the dative form in the 
plural. The locative (inessive) is used without a preposition in both Lithuanian ( Vilniuje 
‘in Vilnius’) and Latvian (Riga ‘in Riga’); the Latvian locative has also assumed illative 
meaning, represented in modem Lithuanian by / + accusative: Vujs ienaca istaba (Lith. 
Jis jejo j kambarj) ‘He entered the room’. 


Basic word order 15 

The relative order of the major syntactic constituents of a sentence in the modem Baltic 
languages is determined by information structure - whether the constituents function as 
theme or rheme. In a neutral declarative sentence consisting of S and V, S is typically the 
theme and V the rheme, with resultant SV word order: Lith. Kaunas tebemiegojo ‘The 
village was still sleeping’, Latv. Janis laca ‘John is reading’; the V may be a nominal 
predicate, with the copula not overtly expressed in the present tense: Lith. Jis studeiitas, 
Latv. VU)S students ‘He is a student’. In a themeless sentence, in which both constituents 
function as rheme, the unmarked word order is VS: Lith. Isauso lytas ‘Morning dawned’, 
Gvveno senelis ir senele ‘There (once) lived an old man and old woman’, Latv. DzTvuoja 
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tevs ‘There lived a father’. In sentences consisting of three basic constituents, the prag- 
matically neutral and statistically most prevalent word order is SVO: Lith. Jis myli savo 
motinq, Latv. Vips mil savu mati ‘He loves his mother’. In Lithuanian, although the object 
may be highlighted by inversion (thus SOV), SVO and SOV are often in free variation 
(Ambrazas et al. 1985: 676), with SOV more characteristic of dialects, folklore, and 
formulaic speech: Pirmi gaidziai velniq [O] baido [V] ‘The first roosters scare away the 
devil’. SOV is also typical for sentences with object pronouns: As jj [O] pamaciaii ‘I saw 
him’, and is the unmarked order, alongside SVO, if the object is the rheme: Dede lap % 
[O] nusove [V] ‘Uncle shot a fox’, in answer to the question Kq nusove dede ? ‘What did 
uncle shoot?’ If S is the rheme, the most natural order is OVS: Kas nusove lapg? ‘Who 
shot the fox?’ Lapp [O] nusove dede [S] ‘ Uncle shot the fox’. Other word order per- 
mutations are possible, accompanied by expressive intonation. For non-context-bound 
sentences, in which all three major constituents function as the rheme, S and V are com- 
monly inverted (cf. VS for corresponding two-component sentences), thus both VSO: 
Pakviete [V] zvirblis [S] visits pauksteliiis [O] ‘A sparrow invited all the birds’, and OVS: 
Priesus [O] skvre [V] iipe [S] ‘A river separated the foes’. 


Phrasal word order 

Adpositions in the modern Baltic languages precede the governed noun (prepositions), 
in harmony with a now-neutral SVO basic word order. Lithuanian preserves a few post- 
positions (Ambrazas et al. 1985: 685): jo deka ‘thanks to him’, as does Latvian: vina del 
‘because of him’. A vestige of head-final position is found in the East Baltic local cases, 
formed with postpositions (see above). Other dependents generally precede the head: the 
neutral position of an adjective is before the noun it modifies, and an adnominal genitive 
also precedes its head noun, in harmony with an earlier SOV basic sentence structure 
assumed for Indo-European, but at odds with the prevailing SVO structure of the mod- 
em Baltic languages: Lith. tevo (gen. sg.) zodziai, Latv. tijva (gen. sg.) vardi ‘father’s 
words’ ( tevas/tpvs ‘father’). Partitive genitives, on the other hand, are postpositional: 
Lith. biitelis alaus ‘a bottle of beer’, Latv. pudele alus (contrast alaus biitelis, alus pudele 
‘beer bottle’). 


Wackernagel’s Law in Baltic 

Vestiges of Wackemagel’s Law of clitic placement can be found in several formations in 
East Baltic (see especially Petit 2010: 260-307). These include tmesis with a reflexive 
particle between a preverb and verbal root: Lith. at-si-kelti ‘to get up’, the Latvian of 
traditional folk songs ( daipas ): Iz-sa gauzi rauddjuo-s (with reflexive doubly marked; the 
corresponding form in modem Latvian would be iz-rauddjuo-s ) ‘I wept bitterly ’ and dialec- 
tally (Endzelin 1922: 703-708), and in Old Lithuanian with clitic pronouns or expressive 
particles: pa-mi-sakyk ‘tell me’, pa-gu-zjstat ‘you know (emphatic)’. In Old Lithuanian, 
we find as a relic the placement of the inherited relative pronominal *yo- between a prefix 
and root in definite adjectives and participles: pa-io-prasta, modem pa-prasto-jo ‘ordi- 
nary, gen. sg ,pra-iis-pulqs, modem pra-puolqs-is ‘having perished’; compare the Slavic 
superlative formant *na-jt-, Russ. Hau-. In Lithuanian, the emphatic particle -gi- occurs as 
an enclitic with adverbs and conjunctions: kufgi ‘where (emphatic)’, taigi ‘so, therefore’, 
afgi ‘emphatic question particle’, irgi {if ‘and’ + gi, lexicalized as ‘also’), and between a 
preposition and following noun, yielding extended forms such as nudg ( nuo ‘from’ + gi), 
ing (in [modem /] ‘into, to’ + gi), izg (is ‘out of, from’ + gi), prieg (prie ‘at’ + gi), which 
were particularly widespread in Old Lithuanian texts. 
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FURTHER READING 

Two traditional handbooks of the historical and comparative phonology and morphology 
of the Baltic languages are Endzellns 1971, useful for derivational suffixes, and Stang 
1966, with a section on the historical prosody of the Baltic languages. The lengthy intro- 
duction to Arkadiev 2015 (1-109) presents an up-to-date overview of the field of Baltic 
linguistics. For a synchronic and diachronic sketch of Lithuanian accentuation, see Leh- 
feldt 2001: 103-129. A number of individual topics, such as the Baltic neuter, are treated 
in depth in Petit 2010. Kabelka 1982, Dini 2000, and Sabaliauskas 2002 emphasize cul- 
tural and historical issues connected with the development of the Baltic languages. The 
first etymological dictionary of Baltic, Derksen 2015, includes in its introduction an over- 
view of the Baltic languages and their dialects, with special attention to Baltic prosodic 
phenomena in comparison with Slavic. The classic etymological dictionary of Lithua- 
nian, with information on other Baltic languages, is Fraenkel 1962-65. Smoczyhski 2016 
emphasizes Baltic ablaut relations, with corresponding Slavic material. An etymological 
dictionary of Old Lithuanian (through 1700) is now available (Hock 2015). Karulis 1992 
is a traditional etymological dictionary of Latvian; etymological and dialectal data can 
also be found in the classic dictionary of Latvian, Mulenbachs-Endzellns (Mulenbachs 
1923-32), available online. Maziulis 1981 is an annotated edition of all the Old Prus- 
sian texts. Etymological dictionaries of Old Prussian include Maziulis 1988-97 and (for 
verb forms) Smoczyhski 2005. Schmalstieg 1974 offers a grammar of Old Prussian, and 
Schmalstieg 1976 a review of literature on Old Prussian language, history, and culture. 
Endzelin 1922 is an exhaustive descriptive and historical grammar of Latvian and its 
dialects. For an overview of the phonology and morphology of the Latvian dialects in 
comparison with the standard language, see Gaters 1977. Mathiassen 1996 and 1997 are 
concise introductory grammars of Latvian and Lithuanian. For a more extensive pre- 
sentation of Latvian grammar, see Fennell and Gelsen 1980 and, with diachronic notes, 
Forssman 2001. 


NOTES 

1 The classic study, based on the evidence of hydronyms, is Toporov and Trubacev 
1962. 

2 The *e, *5 stage is apparently preserved in western Zemaitic: pens ‘milk’, dona 
‘bread’, compared with the ie, uo of standard Lithuanian pienas (Latv. piens), duona. 

3 For certain stem classes of the Latvian verb, final shortening of long vowels and 
diphthongs and apocope of short vowels leads to homonymy between the first-person 
singular present and past tense forms on the one hand, and the second-person singular 
and third person in the present tense on the other hand. Thus, the Lithuanian distinc- 
tion augu ‘(I) grow’ : augau ‘(I) grew’ ( augti ‘to grow’), risu ‘(I) bind, tie’ : risau ‘(I) 
bound, tied’ (risti ‘to bind’) is lost in Latv. augu, risu, and the forms distinguished in 
Lith. risi ‘(you) bind, tie’ : risa ‘(he/she/they) bind, tie’ merge in Latvian as ris. 

4 Saussure’s Law originally functioned as an automatic phonetic rule, operating on any 
instance of stressed short or circumflex followed by acute. However, with the loss of 
the conditioning environment (phonemic tone on unstressed syllables), together with 
the effects of Leskien’s Law, the process has become a morphologically conditioned 
rule. This is most obvious in derivation, where the effects of analogical leveling are 
already apparent in the earliest texts. 

5 Latvian broken tone is described by Lehiste (1972: 313) as “a change in the mode 
of phonation approximately in the middle of the syllable nucleus [. . .] realized as 
laryngealization or as a glottal stop.” 
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6 The choice of *s, *z for Proto-Baltic is motivated by their subsequent development 
as h in early Baltic borrowings into Baltic Finnic: Finn, heina ‘hay’ < CBalt. *selna-, 
cf. Lith. sienas. 

7 According to Kortlandt (1977: 325-326), the original glottalic feature of the plain 
voiced stops, together with the inherited laryngeal (merging the PIE laryngeals), was 
reinterpreted as a prosodic feature of the syllable, continuing to the present in Latvian 
and Zemaitic (Lithuanian) broken tone acute. 

8 For a phonetic account of this change, see Flamp 2006. 

9 A related noun is found in Lith. patis ‘husband’, pad ‘wife’, Latv. pats, pad, cf. OInd. 
patih ‘master, husband’, Gr. hook; ‘husband’; and Lith. viespats ‘lord’, OPruss. 
waispattin ‘wife, mistress of the house (ace. sg. f.)’. Except for the full grade of 
the vies-/wais- element, the Baltic forms are identical to OInd. vispatih ‘master of 
the house’; cf. also OLith. viespatni ‘mistress of the house’ and OInd. patm ‘mis- 
tress’. Old Lithuanian shows traces of a consonantal stem: nom. pi. viespates, gen. 
pi. viespatif (Fraenkel 1962-65, vol. 2: 1245). 

10 Note that the latter type is not distinguished formally from an *-a preterit: 1 sg. prasau, 
3 sg. praso (prasvti ‘to ask’) is present tense; radau, rado ( rasti ‘to find’) is past. 
Apresent in *-a will typically have a preterit in *-e: 1 sg.prasiau, 3 sg. prase, and thus 
the distinction between present and preterit is maintained within the same verb. 

11 The first-person future forms Lith. duosiu, Latv. duosu ‘(I) will give’ are essentially 
the thematic equivalents of OInd. dasvami ‘(I) will give’. A Slavic vestige of this 
formation is found in the RussCS future participle of bvti ‘to be’, bvsgsteje, bvsgsteje 
(= Lith. fut. act. ptcp. busiant - : bud ‘to be’) and OCz . probvsucny ‘useful’. 

12 The origin of the *-e and *-a formants remains unclear. Stang (1966: 388) compares 
the Baltic a- and e-preterits to Slavic imperfect formations: Lith. vede Ted’ and OCS 
vedeaxb ‘(I) led’, Lith. rinko ‘gathered’ and OCS zbdaaxb ‘(I) waited’; see also Petit 
2010: 249-254 and Derksen 2011: 208. 

13 This morphological mechanism accounts for a number of apparent exceptions to 
Winter’s Law in Baltic, as an expected acute syllable is replaced by a non-acute 
(circumflex or short). For example, the short root vowel of Lith. dubiis ‘deep, hol- 
low’, repeatedly presented in the literature as an exception to Winter’s Law, is in fact 
associated with the vocalism of intransitive diibti ( dumba , diibo) ‘to become hollow, 
sunken’, an ablaut-based refashioning of the Winter’s Law acute found in the Latvian 
cognate dudbjs ‘deep’ and the Lithuanian transitive diiobti (3 pres, duobia) ‘to hollow 
out’ (see Young 2011). 

14 Baltic lacks the -(e)no- formant found for the past passive participle in Slavic and 
Germanic, although Endzellns (1971: 97) notes that Latv. dial, vedene ‘daughter-in- 
law’ (to vest, 3 pres. vgd ‘to lead; take a wife’) and the like may contain a vestige of 
this formant; cf. OCS vedem>, past pass. ptcp. to vesti ‘to lead’. 

15 The following account of Lithuanian word order, together with examples, is based on 
Ambrazas et al. 1985: 672-691. 
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CHAPTER 3 


SLAVIC 


Marc L. Greenberg 


INTRODUCTION 

The Slavic language family constitutes a widely distributed genetic grouping of languages 
spoken today from Central Europe to the Pacific, represented by standard languages that 
are traditionally divided into three branches: West - Czech, Slovak; Polish; Upper and 
Lower Sorbian; South - Slovene; Croatian, Bosnian, Serbian, Montenegrin; Macedonian, 
Bulgarian; and East - Belarusian, Ukrainian, Russian. The standard languages reflect 
ethnic and cultural distinctions that largely crystallized in the 19th century and as such 
are not ideal reflections of the genetic development of Slavic linguistic variation from 
the proto-language. For example, standard Croatian, Bosnian, Serbian, and Montenegrin 
are based on a single genetic dialect, Stokavian, but now are stylized in their officially 
valorized forms to fit the national identity projects of four post- Yugoslav states. Rus- 
sian has a considerable lexical component of South Slavic origin, owing to its diglossic 
origins in the medieval period. Other Slavic language varieties - sometimes referred to 
as Slavic micro-languages - that have to a greater or lesser extent become codified but 
lack robust (or any) institutional support, exemplify further variegation, e.g., Kashubian 
(a West Slavic language spoken in the environs of Gdansk), Rusyn (Carpathian variety of 
East Slavic with some West Slavic features), Silesian (transitional variety of West Slavic 
between the Czecho-Slovak sub-branch and Polish), Prekmurje Slovene (divergent vari- 
ety of Slovene), and Kajkavian Croatian (non-standardized variety of Croatian with 
affinity to Slovene). Attested but extinct Slavic language varieties, for which descriptive 
information is available, include Polabian (f mid-18 c.; westernmost Slavic language, 
along the Elbe/Laba river in today’s Germany), Pomeranian (f medieval period; along 
the Baltic at the mouths of the Oder and Vistula rivers), and Slovincian (f early 20 c.; 
northwestern Poland, where it was perhaps a variety of Kashubian). Another variety (or 
varieties) of Slavic existed in today’s Austria and Hungary, which, although unattested, 
is “Pannonian” Slavic. 

Textual attestation of Slavic begins in the early second millennium A.D. with the 
Freising Folia (ca. 1000, written in Carolingian [Latin] script in a language with close 
affinity to what would later become Slovene) and the texts of the Cyrillo-Methodian 
tradition proceeding from the mid 19th c., though with extant manuscripts surviving 
from the 1 1th — 12th cc. (Canonical “Old Church Slav[on]ic”). The latter are written in the 
Glagolitic alphabet, which was designed, based in part on existing alphabets, to reflect the 
sound structure of Slavic as it was spoken in the 9th c. around Thessaloniki; or its succes- 
sor, the Cyrillic alphabet, which was based on contemporary Greek uncial script, mapping 
closely the (roughly) letter-to-phoneme correspondences of Glagolitic. An important and 
still growing body of texts are the Novgorod Birchbark Letters, which attest to the ver- 
nacular language (i.e., not influenced by the language of the church) of northern Russia 
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MAP 11.2 MAP OF SLAVIC LANGUAGES 

Titular/state languages are in all caps. Other living varieties'. (1) Lower Sorbian, (2) Upper Sorbian, (3) 
Kashubian, (4) Rusyn, (5) Silesian, (6) Prekmurje Slovene, (7) Kajkavian Croatian, (8) Cakavian Croa- 
tian. Extinct varieties: (a) Polabian, (b) Pomeranian, (c) Slovincian, (d) Pannonian. 


in the 12th 1 5th cc. and, by virtue of the dialect’s peripheral position, illuminate archaic 
features of medieval Slavic. The language variation found in contemporary standard lan- 
guages and dialects along with OCS serve as material for the comparative reconstruction 
of prehistoric stages of Slavic. In the exposition, focus will be on OCS forms, supple- 
mented by material from living languages and dialects, where relevant. 
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Periodization 

Slavic is a satem language (cf. OCS VbSb ‘village’, OInd. vis- ‘dwelling’, Alb. vis ‘local- 
ity’ || Lat. vicus, Gr. o!ko<;, Goth, weihs) and its closest congener the Baltic branch of 
Indo-European (see the chapter on Balto-Slavic). Periodization of Slavic is necessarily 
schematic before the period of the earliest attestations, which coincide with the beginning 
of disintegration into distinct dialect groups that can be more or less mapped onto the 
contemporary standard (“daughter”) languages. In this chapter we will operate with the 
following terminological designations: 

Proto-Balto-Slavic - The emergent dialects whose innovations are distinct from 
Indo-European but are shared with dialects that were to become Baltic and Slavic. 

Proto-Slavic - The emergent dialects whose innovations are distinct from Baltic but 
are shared with dialects that were to become Slavic. The reconstructions of PBS1. and 
PS1. are schematic and may be in some instances anachronistic because the time depth 
precludes precise dating. The representations in these system are given in small caps to 
make them visually distinct from CS1., which is the system underlying attested forms in 
living Slavic languages. Mutatis mutandis, Proto-Baltic refers to the Baltic forms paral- 
leling PS1. 

The term Common Slavic will be used in this chapter to refer to forms that underlie 
the earliest attested and contemporary Slavic languages as well as to the representa- 
tions of these forms that are most familiar to scholars of Slavic, representing the vowel 
qualities presumed to be reflected in OCS (i.e., i, y, u, e, a\ f, q\ e, o, b, r>; see chart 
below). 1 

The Proto-Balto-Slavic period is contested in the literature with regard to the relation- 
ship between the two parts, i.e., whether Baltic and Slavic constituted a unified speech 
community, underwent common and parallel innovations because of proximity and/or 
bilingualism, or converged anew after a period of divergence. The facts cannot be inter- 
preted unequivocally in favor of any one of these scenarios, so perhaps the most prudent, 
if unsatisfyingly equivocal, view is to consider the Proto-Balto-Slavic period one of a 
closer-knit interaction than with any other dialects emerging from Indo-European. 


Emergence of Slavic as a historical speech community, migrations 

The Late Common Slavic period (ca. 500-1000 A.D.) coincides with the moment when 
a Slavic speech community appears relatively clearly in the historical record (mentioned 
by the Byzantines in 527; see Schenker 1995: 15ff.) as it expands from its core area and 
comes into contact with other speech communities. How the expansion proceeded has 
engendered considerable debate, ranging from a traditional view that sees the westward 
spread as owing to migration (e.g., Andersen 1996, 1999, 2003, Doluxanov 2000, Green- 
berg 2010) to a diffusion model that, in its fully articulated version, views the speech 
community as having developed a homogenized lingua franca used by both Avar elites 
and confederates, including the Slavs, in the Avar Khaganate (Lunt 1984—1985). For fur- 
ther discussion see Nichols 1993, Curta 2004, Snoj and Greenberg 2012. 

On a micro-level the settlement pattern of Slavic communities centered on the zupa 
(< *gewpeh 2 ; Snoj 2003: s.v.) ‘a traditional district organized around a tribe’, which 
Pleterski describes as the building block of a “fractal society” of independent or loosely 
connected tribal units (2013: 10). These units were also the units of migration, which is 
reflected in the geographical dislocation and distal association of dialect features (see, for 
example, Andersen 1996, 1999, Schallert & Greenberg 2007). 



522 MARC L. GREENBERG 


In turn, the issue of whence the migrations originate has been the subject of contro- 
versy, though the clusters of Slavic hydronyms, in proximity with Baltic ones, found in 
the Dniester and Dnepr river systems of today’s Ukraine provide a good anchoring point 
(Trubacev 1 968), after which the Slavic speakers most likely spread by way of and across 
the Danube into central and southern Europe. 


Disintegration of Common Slavic and language contact 

The second millennium A.D., often referred to as the time of “disintegration” of Common 
Slavic (Pol. ivzpad, Russ, raspad), was a time of significant divergence among the Slavic 
languages owing to increasing geographical spread, linguistic drift, contact with Slavic 
and non-Slavic languages, and the rise and fall of local, confessional, and national iden- 
tities. Structural effects that might be mentioned are the preservation of case marking in 
East Slavic and enrichment with additional case distinctions in Russian, brought about by 
contact with Finnic, viz. the partitive genitive and secondary (“concrete”) locative cases in 
standard Russian, which may be seen as just the tip of the iceberg with regard to conver- 
gent developments that have taken place among Baltic, Finnic, and northern Russian dia- 
lects. Central European Slavic languages, including the living West Slavic languages and 
the western South Slavic languages (especially Slovene and Croatian), have convergence 
features in morphosyntactic and lexical domains that are due to centuries-long contact 
with German, e.g., the use of prefixed imperfective verbs in narrated sequences of events 
(see Dickey 2011), or the development of a definite article, parallel to German die/ der Idas 
and Hungarian a(z), in Czech (Doudleby dialect) t_i tahouni se pouzivali po posekdhi obili 
... a gdis se strnist’e spatne posekalo, to tam bili t_i kozi ‘the draft horses were used after 
harvesting the crops . . . and when the stubble was poorly cut, then came the goats’ (Holub 
2014: 183). The eastern South Slavic languages, Macedonian and Bulgarian, have under- 
gone significant grammatical change as part of the Balkan convergence area. The dramatic 
nature of the structural change is heightened by the fact that in the earliest texts, OCS (in 
focus in this chapter) reflects the language state not long after it was relatively unified 
(“Common Slavic”), and the epicenter of Balkan features is in today’s Macedonia, the 
home territory of the first Slavic language planners, Constantine and Methodius. 


PHONOLOGY 

Vowels 

TABLE 11.11 PROTO-INDO-EUROPEAN TO LATE COMMON SLAVIC VOWELS 

PIE PS1. LCS1. 


*ey 

*oy 

*ay 

*u 

*ow 

*aw 

*ew 


long vowels and diphthongs /tense/ [long] vowels 

*1 / 

*EY 

*AY 

*u y 

*AW U 
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PIE 

PS1. 

LCS1. 



long vowels and diphthongs 

/tense/ [long] vowels 

*e 

*6 

e 

*oy 

*AY 


*ay 



*5 

*A 

a 

*a 



*en, ’em 

*EN 

? 

*m, *n 

*IM, *IN 


*on, *om 

*AN 

9 

*m, *n 

*UM, *UN 


*er 

*ER 

er 

*or 

*AR 

or 

*ar 



*el 

*EL 

el 

*ol 

*AL 

ol 

*al 




*1R, *UR 

r || hr, hr 

*1 

*IL, *UL 

1 1| bl, bl 


short vowels 

/lax/ [short] vowels 

*i 

*1 

b 

*u 

*u 

b 

*e 

*E 

e 

*0 

*A 

o 


Front 

Back 

High 

I/i 

u/u 

Low 

e/e 

A/a 


Monophthongization of diphthongs 

The transition from the PS1. to the CS1. stage, which underlies the attested and living 
systems of Slavic languages, is marked by the reshaping of syllables on the principle 
of “rising sonority” (see also below with regard to consonantal changes). In this set of 
developments, diphthongs monophthongized, and the functional load was transferred 
from quantitative to qualitative vowel contrasts. The equivalences are given in the chart 
above. The Late Common Slavic vowel system, which is close to that underlying OCS, 
was structured thus: 

TABLE 11.12 LATE COMMON SLAVIC VOWELS 

front back 



tense 


lax 

tense 

lax 

tense 

high 


i 


y 


u 

high-mid 







central 



i> 


T> 


mid 



e 


0 


low 

5 

e 


a 
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The monophthongization being a syllable-level phenomenon, diphthongs were treated 
differently in closed (tautosyllabic position) vs. open syllables (heterosyllabic), which 
gave rise to new alternations, e.g., *plew-/plow- > PS1. *plaw-tey, *pla|u-e-xI > OCS 
pluti, plovesi ‘flow, swim’ inf., 2 sg.; PIE *s w e-kruh 2 - > PS1. *swekru, *swekru|w-e-s 
> OCS svekiy, sveknve ‘husband’s mother’ nom. sg., gen. sg. (cf. Sin. kri, krvi ); PIE 
*ken-/kn- > PS1. *zA-k(e)n-tEi, zA-ki|n-e-xI > OCS zacqti, zacbnesi ‘begin’ inf., 2 sg. With 
regard to relative chronology, the monophthongization followed the first palatalization 
of velars and preceded the second and third palatalizations. Moreover, the change of PIE 
*ew > CS1. *(j)u antedates deiotation following consonants, e.g., PIE *lewd h eyes > PS1. 
*lyawd-iy-e > OCS ljudije ; PIE *kewmo- > PS1. *syawm-a-s > OCS sunn, (also spelled 
sjunib) (see also below). Note, however, that heterosyllabic *ew merges with *aw, e.g., 
PIE *newo- > OCS novb ‘new’. 


Syllabic synharmony and palatalization 

The opposition of back-front was fundamental to the system, as is evident from the fact 
that (at first) consonants followed by front vowels were sub-phonemically palatalized. 
Palatalizations of velars and changes in clusters of Cj into palatal consonants (“deio- 
tation”) introduced a systematic opposition between plain and palatal consonants (see 
further below). On the one hand, there was a tendency to adjust the tonality of vowels so 
that palatal consonants were followed by front vowels (“intrasyllabic harmony”), e.g., 
PIE *moryo- > PS1. *mar-y-a-(n) > OCS morje. The beginning of phonemic palataliza- 
tion is perhaps attributable to the reinterpretation of PS1 *E (CS1. *e) as *a after palatal 
consonants, which disrupted the pattern of intrasyllabic harmony, e.g., PIE *g wh er- > PS1. 
*gEr-a-s > CS1. *zan> (Russ., Sin., BCMS zar). This points to the view that CS1. was 
sub-phonemically diphthongized [ y as], and subsequently raised to a high-mid diphthong 
throughout most of the Slavic territory, as is also indirectly recoverable from later reflexes 
of *e, cf. OCS reka, Slk. rieka, Croat, rijeka. The onglide portion of the diphthong was 
evidently reinterpreted as a feature of the preceding consonant. Palatalization remained a 
fundamental element of the phonological system of Russian and Belarusian, whereas in 
Ukrainian and the West Slavic languages palatalization has been positionally eliminated 
and, in South Slavic in all but Bulg., eliminated altogether, save for the contrasts that 
arose with the loss of post-consonantal j (see below). 


Elimination of diphthongs with r, 1 

Sequences of vowel + r, l were eliminated in the LCS1. period and can be dated to the 
second half of the first millennium A.D., as evidenced by borrowings of toponyms assim- 
ilated during the migrations, e.g., Finn. Alvatti — » Russ. Lovat’ (river name), Finn. *aal- 
dokas — > Russ. Ladoga (lake name), Lat. Albona — » Croat. Labin (toponym), Lat. Arba 
— > Croat. Rab (island name). The process ended by the 9th c., as borrowings related to 
Slavic Christianization show up without metathesis, e.g., Lat. altare — » OCS olbtan 
‘altar’. Word-internal developments show dialect differentiation, where the open-syllable 
principle is realized differently for West and South Slavic, which show metathesis and 
lengthening of the vowel, and East Slavic, where the vowel is duplicated on either side 
of the consonant. The development was not carried through fully in Polabian. Examples: 
PIE *(h,)wolko — > PS1. wals-a-s > Plb. vlas, Pol., ULSorb. wlos, Cz., Slk. vlas, Sin. 
las, Mac., Bulg. vlas, OCS vlas-b || BRuss. volas, Ukr., Russ, volos; PIE *g h ord h o- > PS1. 
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*gard-a-s > Plb. gord, Pol. Bialogard (toponym in N. Poland) || Pol. grod, Cz. hrad, 
USorb. hrod, Sin., BCMS, Mac., Bulg., grad, OCS gradt || BRuss. horad, Ukr. horod, 
Russ, gorod ‘city’; PIE *derg h no- > *PS1. dern-a-s > Plb. dren, Cz. dfin, Slk. drieh. Sin. 
dren, Croat, drijen, Mac. dren, Bulg. drjan || BRuss. dzeran, Ukr., Russ, deren ‘cornus 
mas’. 


Contraction, jerfall 

Among the final processes that marked the restructuring of Slavic are the loss of intervo- 
calic j (“contraction”) and the “fall of the jers” (or “Havlik’s Law”), the alternating loss of 
the short lax vowels, b, 7>. These processes began ca. the 10th c. A.D. in West and South 
Slavic and progressed eastward, where contraction was limited in East Slavic and jer 
fall progressed well into the 12th c. Contraction resulted in increased frequency of long 
vowels in those areas where quantity distinctions remained phonemic, cf. CS1. *moldaja 
‘young’ f. sg. def. > Cz., Slk. mlada || Sin., BCMS mlada || Russ., Ukr. molodaja; CS1. 
*delajesi ‘you make, do’ > Cz. delas || Sin. delas || Russ, delaes [d’elajis], 

Havlik’s Law identifies a process whereby word-final jers are lost universally and, by 
a process of compensatory lengthening, a jer in a syllable following a lost jer lengthens 
and merges with an existing vowel (with dialectally conditioned results). A jer in a syl- 
lable preceding a non-jer vowel elides. The change had wide-ranging effects on syllable 
structure, vowel systems, and word prosody, among the chief ones being the end of the 
centuries-long development of open syllables. Examples: CS1. ’"k'bto ‘who’ > Pol. kto, 
Croat, tko, Russ, kto, BRuss., Ukr. xto; CS1. *vi.dovbCb, *vbdovbca ‘widower’ nom., 
acc./gen. sg. > Pol. wdowiec, wdowca, Cz. vdovec, vdovce. Sin vdovec, vdovca, BCMS 
udovac, udovca, Mac. vdovec, Russ, vdovec, vdovca, Ukr. vdovec’, vdovcja ; CS1. (with 
falling, “circumflex” stress) *'di>m> ‘day’ > Pol. dzien, Cz. den, Slk. den. Sin., BCMS dan 
(long falling stress), Mac., Bulg. den, Russ., Ukr. den ’, BRuss. dzen CS1. (with rising, 
“neo-acute” stress) *pt's'b ‘dog’ > Pol. pies, Cz., Slk. pes. Sin. pes [pas] (short falling 
stress), BCMS pas, Mac., Bulg. pes, Russ, pes, BRuss. p’os, Ukr. pes. 


Suprasegmentals 

The development of Slavic word prosody persists as one of the most dynamic areas of 
inquiry, and considerable debate surrounds its details. A basic account recognizes that 
PS1. developed a paradigmatic accent system with three patterns, which since Stang 1957 
are referred to as accent paradigms (a.p.) A, B, and C. A.p. A is characterized by sta- 
ble stem stress throughout the paradigm and is characterized by the “old acute” accent, 
which developed on long syllables typically of laryngeal origin. A.p. B has consistent 
stress on the ending alternating with “neo-acute” accent in the first syllable before the 
ending. The mobile a.p. C displays alternations between ending-final stress (or stress on 
the endings in acute -marked grammatical morphemes) in some forms and a falling tone 
(“circumflex”) on absolute initial syllables. In PS1. (and PBalt.) each syllable possessed 
a suprasegmental marking (or lacked it), which by LCS1. had resolved to a single stress 
for each phonological word. 

A.p. A and B originate in PIE stem-stressed (barytone) paradigms and were differen- 
tiated by laryngeal (non-apophonic) length in stems of type A. After the application of 
Hirt’s Law, (“old acute”) stress was retracted or, in the case of two consecutive laryn- 
gealized syllables, assigned to the first of them, e.g., PIE *korh 2 w-eh, > PS1. *karw-a~ 
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> CS1. korv-a > Sin. h-ava, Russ., Ukr. korova ‘cow’, cf. PIE *g w en-eh, > PS1. *gen-a 

> CS1. *zen-a > Russ, zena ‘wife, woman’. Non-acute stems underwent a forward shift 
of stress (Dybo’s Law), which was subsequently retracted in internal syllables (Stang’s 
Law), which in turn gave rise to the “neo-acute” rising-stress profile of a.p. C. The Slavic 
mobile type arose in PIE end-stressed and mobile paradigms (Meillet’s Law) with analog- 
ical loss of the laryngeal (“old-acute”) marking where applicable. The Slavic paradigms 
correspond to Lithuanian accent paradigms 1 (= A), 2 (= B), 3, 4 (= C) as in the following 
chart. Different scholars have treated the origin of the mobile pattern in Baltic and Slavic 
differently, with the two major schools - the Moscow School, led by Vladimir A. Dybo, 
and the Leiden School, led by Frederik Kortlandt - viewing the pattern as archaic (Mos- 
cow) vs. innovative (Leiden) with respect to Vedic and Greek. The exposition here cannot 
do justice to the complexities of the developments, for which reason the interested reader 
is referred to the critique in the appendix to Lehfeldt 2001 and to Kapovic 2015 (esp. 
195ff.) for the current state of the art, as well as the literature cited therein. 


TABLE 11.13 PIE, BALTIC, AND SLAVIC ACCENT PARADIGM CORRESPONDENCES 


Vedic, Greek, Germanic 

Baltic 

Slavic 

barytone 

1 - Lith. stem-stressed with acute 

A - stem-stressed with “old-acute’ 


tone; glottalized (“broken”) tone 

stress; secondary traces of 


in Lith. Lowland (Zemaitian) 
dials, and Latvian 

glottal stop or glottalization 


2 - Lith. stem-stressed non-acute 

B - ending-stressed alternating 


tone with end stress on endings 

with neo-acute on stem-final 
syllable 

oxytone 

3 - Lith. mobile-stressed with 

C - mobile stressed, alternating 


“acute” tone in stem-stressed 

between word-final stress and 


forms; glottalized tone in Lith. 

initial (recessive) “circumflex” 


Lowland dials, and Latvian 

stress 


4 - Lith. mobile-stressed with 



“circumflex” tone in stem- 
stressed forms 



“Old acute” stress is often marked with a double acute (a) to distinguish it from the 
“neo-acute” (long: a, short: e) in order to avoid specifying the nature of the specific 
suprasegmental properties in question. The cover terms may be read as empty labels 
for suprasegmental contrasts. However, in this author’s view the laryngeal origin of the 
old-acute stress type can be read as “globalized”; indirect evidence of its persistence in 
Slavic as either a globalized (creaky-voiced) vowel or a glottal stop up through LCS1. is 
found in South Slavic pitch-accent systems: In Slovene in non-final stressed syllables, the 
tone contour is low in a phonetically long syllable (Sin. brata ‘brother’ gen. sg.), while 
in final or monosyllabic words the tone is high in contrastively short syllables (Sin. brat 
‘brother’ nom. sg., vs. long falling in “circumflex” brat ‘to go read’ sup.). In BCMS the 
reflex is short falling stress, e.g., brat, brata. To wit, creaky-voiced syllables are cor- 
related with low pitch and vowel lengthening, while glottal stops are correlated with high 
pitch and vowel shortening. In other words, on this view, which builds on Kortlandt’s 
view of the persistence of laryngeals in Baltic and Slavic (see Kortlandt 1985) creaky 
voice developed from the “old-acute”-marked syllables in non-final syllables in Sin., into 
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glottal stops in final or monosyllables and, in BCMS, in any old-acute-stressed syllable, 
and were subsequently reinterpreted upon the loss of the glottal feature as either long 
rising tone or short falling stressed syllables, respectively (Greenberg 2007, Holub & 
Greenberg 2013). In accord with tradition the sign for the long neo-acute (a) is used here, 
where it means a long rising tone, as is found, for example, in Kajkavian and Cakavian 
Croat, susa ‘drought’ (< LCS1. susa < PS1. *saux-y-a). The grave accent (e) signifies the 
neo-acute in short syllables, e.g., LCS1. kdh> ‘stake’ (< PS1. *kal-as), *vol’a ‘will’ (noun) 
(< PS1. *val-y-a), as well as stress in endings that is not of laryngeal origin, e.g., *kola 
‘stake’ gen. sg. In the latter case, the signification recognizes that in examples where 
length has been preserved or extended to the stressed syllable, the surface forms in tonal 
dialects display rising pitch, e.g., Cakavian Croat, noge ‘of a foot’ gen. sg. In the case 
of the “circumflex,” the signs borrowed from Serbo-Croatian dialectology for the falling 
tone are typically used (long: a, short: e). Here, instead, for reconstructed Slavic forms 
the ictus mark ('C V) is used to emphasize the default (phonemically unstressed) nature of 
the inherited Proto-Slavic circumflex, which is a fixed (falling-contour) stress in the first 
syllable of the phonological word, e.g., CS1. *'nog9 ‘foot’ ace. sg., *'na nogq ‘onto (the) 
foot’. This configuration is preserved as such in, e.g., BCMS nogu, na nogu, as well as 
reflected in the stress pattern of Russ, nogu, na nogu (see Jakobson 1963: 71-72). 

TABLE 11.14 ACCENTUAL PARADIGMS 

Slavic a.p. PIE Baltic Slavic 


A 


B 


C 


*breh 2 ter > Ved. bhratar-, 
Gr. (ppaiip ‘brother’ 
*d h uh 2 mo- > Ved. dhuma-, 
Gr. llfiLioc/snioke' 
*seh,-ye/o- ‘sow’ 

*d h worom > Ved. dvaram 
‘courtyard’ 

*wob h seh 2 > OHG waspa 
‘wasp’ 

*g wh en-, Ved. hdnti, Gr. 

0Etvco‘hit, drive (trans.)’ 
*suHnus, Ved. sunti- ‘son’ 
*bh 2 yom, Gr. miov ‘egg’ 
*g h eym-eh 2 , Ved. hima 

*mer- ‘die’, Ved. mdrate 
‘dies’ 


Lith. brolis (a.p. 1), Latv. brdlis 

Lith. sett, seju 
Lith. dvaras (a.p. 2) 

Lith. vapsd (a.p. 2 > 4) 

Lith. ginti, genu 
Lith. sums (a.p. 3) 

Lith. ziema (a.p. 4) 

Lith. mirti, mirstu 


CS1. *bratrb, gen. sg. *bratra > 
Sin. brat, gen. sg. brata 
CS1. *dymb, *dyma > Sin. dim, 
dima 

CS1. *sejati, sejo. Croat, s'ijati, 
sijem 

CS1. *dvorb, gen. sg. *dvora > 
Sin. dvor, dvora; Croat. Cak. 
dial, dvor, dvora 
CS1. *(v)osa, *(v)os() > Sin. 

6s a, oso 

CS1. *gr>nati, *zen 9 , *zenetb > 
Sin. gnciti, zenem, zene 
CS1. *'sym,, Sin. sin 
CS1. *'aje, Croat, dial .jaje 
CS1. *'zima, *'zimg, Russ, zimd, 
zimu. Sin. zima nom. sg., na 
zimo ‘in winter’ (acc.) 

CS1. *merti, *'mbr(j, *mbretb> 
ORuss. umru, umretb 


Winter ’s Law 

In Baltic and Slavic an additional suprasegmental peculiarity, noted relatively recently 
(Winter 1978), is the appearance of acute-marked length in otherwise short-vowel sylla- 
bles preceding a PIE voiced unaspirated stop, e.g., PIE *sed- ‘sit’ > PBalt., PS1. sed-e-tey 
‘sit’ inf. > Lith. sedeti, OCS sedeti (vs. Lat. sedere). Kortlandt (1978) has claimed that 
this feature corroborates Gamkrelidze and Ivanov’s reconstruction of the PIE consonant 
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system as consisting in plain : glottalized : aspirated rather than the traditional plain 
: voiced : voiced aspirated (Gamkrelidze & Ivanov 1973; see also Vermeer 1984: 335, 
Kortlandt 1985), in which it is assumed that the glottalic feature for the consonant is 
transferred to the preceding vowel. Winter’s Law is also correlated with the appearance 
of acute stress in the relevant syllable, suggesting that, if the glottalized interpretation 
of the old plain voiced series is correct, then the glottalization was transferred to the 
preceding vowel in Baltic and Slavic. In the chart below, the LCS1. reflexes include later 
changes marked “cond.” for conditioned changes, explained forthwith. 


Consonants 


TABLE 11.15 PROTO-INDO-EUROPEAN TO LATE 


COMMON SLAVIC CONSONANTS 


PIE 

PS1. 

LCSl. 

*p 


P 

*b 

* B 

b 

*b h 

*t 

4* -p 

t, cond. f 

*d 

*D 

d, cond. d’ 

*d h 

*k 

*K 

k , cond. c, c 

*k w 

*g 

*G 

g, cond. dz > z, dz 

*gw 

*g h 

*gwh 

*g 

*Z 

z, cond. z 

*g h 

*k 

*S 

s, cond. x (> s, s) 

*s 

*M, 

m 

*n 

*N 

n, cond. n 


*R, *L 

r, cond. r; /, cond. /’ 

*y, *w 

*Y, *W 

7> 


Elimination of geminates, dissimilation *t/d > before *t 

Inherited geminate fricative clusters were eliminated, e.g., PIE *skoy > PS1. *sAi-ni 

> OCS senn ‘shadow’, PIE *tuh,s-(d)kmt- > PS1. *tusint-y-I > OCS tysgsti ‘thousand’. 
Clusters of two dentals stops become fricative + stop. This change is late PIE, as it is 
shared by Baltic as well as Indo-Aryan, Germanic, Celtic, Latin, and Albanian, e.g., PIE 
*h,ed — > PS1. *yes-tey > OCS jasti ‘to eat’; PIE *k w itti- > PS1. *kist-i > OCS cbstb 
‘honor’. A small number of examples point to the Slavic merger of PIE *pt > PS1. *tt > 
st, e.g., ORuss./OCS strvi ‘father’s brother’, ORuss. Stribogb ‘Slavic pagan god name’ 
(both derived from PIE *ph,tr- ‘father’); similarly ORuss. nestera ‘niece’ (< PIE *nept-), 
though PS1. *tt > t is also observed, e.g., OCS netii ‘nephew’. 


RUKI change 

PIE *s changes to x in PS1. following the segments r, u, k, i, a change that pre -dates 
the merger of the fricative reflex of PIE *k in Slavic with PS1. *s, e.g., PIE *wrs- > PS1. 
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*wirx- > ORuss. vbrx'b, but PIE *pork- PS1. *parsen(t)- > OCS pras f ‘pig’. The change 
parallels a similar development in Indo-Iranian (s) and Baltic ( s ), as well as, possibly, 
Armenian (p. 206, 495, 438). Further examples: PIE *snus-eh 2 > PS1. *snuxA > OCS 
sirbxa ‘daughter-in-law’ (cf. OInd. snusa ); PIE *ksewb h — » PS1. xOBA > Cz. chyba ‘mis- 
take’, Ukr. xvbaty ‘to wobble 1 (cf. OInd. ksubhyati ‘to shake, vacillate’); PIE *maysa- > 
PS1. *mayx- > OCS mex'b ‘bladder, bellows’ (cf. Lith. maisas ‘bag’). The change fails 
to occur before an obstruent, giving rise to alternations as in OCS rexb ‘I said’ aor. vs. 
resta ‘they both said’ aor. < PS1. *rek-s-am, *rek-s-tA. Word-initial x- developed from 
the generalization of the A-reflex from prefixed forms (as well as initial *sk- > *ks-), 
e.g., OCS (pri)xoditi ‘to walk’ (‘to approach’) <— PS1. *prey-sad-I-tey (derived from PIE 
*-sed-/-sod-/-sd- ‘sit’). 


Loss of final consonants 

Slavic underwent a series of changes often referred to as the “law of rising sonority” or the 
“law of open syllables,” which eliminated syllables culminating in a segment other than 
a vowel. The changes developed from the elimination of consonants in word-final posi- 
tion, likely in the first centuries A.D., to the elimination of liquid diphthongs in the last 
centuries of the first millennium (the type PS1. *gard-as > OCS gradb || ORuss. gorodb 
‘fortification, city’). These changes included the monophthongization of diphthongs and 
the development of prothetic consonants, as well as entailing structural changes in gram- 
matical endings. These issues will be discussed below in the appropriate sections. Here 
we will focus on some of the early changes in Slavic. The earliest change is loss of final 
*-r, which is shared with Baltic, e.g., PIE *meh 2 ter — > PS1. *mAtI, > OCS mati ‘mother’, 
PBalt. *mAte > Lith. mote ‘wife’. 


Cluster simplification 

In addition to the adjustments in clusters mentioned above, remaining clusters and those 
arising in word formation were simplified, the result of which is the elimination of all 
clusters aside from fricative + stop, cf. PIE *yeh 2 - ‘travel’ + *-sd- ‘sit’ (0-grade) — > 
PS1. *yezd-I-tey ‘to ride’ > OCS jazditi ‘travel’, BCMS jezditi ‘to ride a horse’ [Green- 
berg and Dickey 2006]), though syllable onsets could also end in a sonorant, e.g., PIE 
*h,eyskreh 2 > PS1. yIskr-A > OCS iskra ‘spark’. Examples: CC > CC - PIE *wob h hseh 2 
> *PS1. as-A > OCS osa ‘wasp’; PIE *h,rouwd h so- > PS1. *raws- > OCS rush ‘red’; 
h[dsnt-sni- > PS1. *densn-i > OCS *d§sm> ‘gums’; PIE *lowksneh 2 > PS1. *lawn-A > 
OCS luna ‘moon’; *d h g h emi ‘earth’ loc. sg. — » *zem-y-A > OCS z emlja ‘earth’; PIE 
*d h olb h — > PS1. *dal-ta > OCS dlato, Croat, dlijeto ‘chisel’; CC > CC - PIE *ob- 
+ *h,welk-/ h,wolk- ‘drag’— >■ PS1. *ab-alk- > OCS oblakr, ‘cloud’; PIE *nok w ts — > 
*nakt-i- > OCS nostb ‘night’; PIE *wronk-eh 2 > PS1. *rank-A> OCS *rqka ‘hand’. 

The cluster *sr developed excrescent -t- in conformity with the cluster constraint 
adduced above, an innovation also found in Baltic, Albanian, Germanic, and Celtic, e.g., 
PIE *srew-/srow- ‘flow’ — > PS1. *straw-g-A, *straw-m-0, -m-en- > OCS struga ‘flow’, 
Strumien ‘hydronym in Poland’, cf. Eng. stream. The constraint remained productive into 
the historical period, e.g., OCS Izdrail’b ‘Israel’; Vbzdrydati ‘to break out crying’ perfect., 
cf. rydati ‘to cry’. 

A particular case is -gt-/-kt- before I/i (and later PS1. y), which merged with reflexes 
of CS1. *f, *d h ugh 2 ter- > PS1. *dukt-I> OCS dbsti ‘daughter’, Russ, doc’, OCz. dci, for 
which reason */kt/ was likely realized as *[t] (see Vermeer 2014: 217 and below). 
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TABLE 11.16 CLUSTER SIMPLIFICATION 

cc > ec cc> or 

PS, TS, KS > S (X) VR-, VL- > R, L 

TK, DO > K, O BV > V 

KT > T [T] 

PT > ST, T 

TM, TN, DM, DN > M, N 


First palatalization of velars 

Among the first Slavic innovations exclusive of Baltic is the palatalization of velars *k, 
g, x before front vowels, which occurred after the RUKI change and before the monoph- 
thongization of diphthongs. This places the change in the first half of the first millen- 
nium A.D. Borrowings from Slavic into Greek show that it was completed by the 7th 
c. A.D. (e.g., Cbrnica > Tospvian ‘toponym in Messenia’, *Struzija > Expound ‘top- 
onym in Phoeis’, *Vtrsi>ci> > Bepoixoi ‘toponym in Achaia’). The reflexes are uniform 
in Slavic: PS1. *k, g, x > CS1. *c, z, s. It is generally considered that *z had originally 
paralleled the development of *c by having an (allophonic) affricate stage dz , which is 
observed only indirectly in clusters of *zg before a front vowel, e.g., PS1. *drozg-iy-A > 
CS1. *drozdzi>ja > OCS drozdija (with other vocalic affixes) Ukr. drozdzi, Prekmurje Sin. 
drozdze ‘yeast’. Accordingly, *sk also resulted in a combination of fricative + affricate, 
e.g., PIE *h,eys-ske- > PS1. *aysk-e-mas > Sin. iscemo ‘seek’ 1 pi. 


TABLE 11.17 FIRST VELAR PALATALIZATION 


PIE 

PSl. 

OCS 

*kelH- 

*KEL-A 

celo ‘forehead’ 

*pek w - 

*PEK-E-XI 

pecesi ‘bake’ 2 sg. 

*g wh iHsl- 

*GIL-A 

zila ‘vein, nerve’ 

*mag h - 

*MAQ-E-XI 

mozesi ‘can’ 2 sg. 

*kel- > Gmc *x e l ma-z —> 

*XELM- 

slemb ‘helmet’ 

*d h wes — ► 

*DAWX-Y-A 

dusa ‘soul’ 


Prothesis 

Prothetic consonants in initial syllables where the inherited onset was vocalic were 
inserted, though differing results occur in the daughter languages owing to later devel- 
opments. Examples: PIE *h,nh,men- > PS1. *inmen- > CS1. *jbm§ > OCS im§, OCz . jme 
‘name’; PIE *h 2 eskreh 2 > PS1. *Aiskr-A > CS1. *jiskra > OCS isb-a, Cz .jisb-a ‘spark’; 
PIE *dng h uh 2 — > PS1. *enzuk-a-s > *j^zyki. > OCS jfzykb ‘tongue, language; heathen’; 
PIE *n ‘in’ > PS1. UN > CS1. *vt, > OCS vb ‘in’; PIE *upso-ko- > PS1. *usak-a-s > CS1. 
*vysokT> > OCS vvsok-b ‘high, tall’ nom. sg. m. indef. 


Second and third palatalization of velars 

If the monophthongization of diphthongs marks the beginning of Common Slavic, the 
second and third palatalization of velars and “deiotation” give rise to isoglosses cutting 
across the Slavic speech areas, marking the disintegration of Common Slavic. Tradition- 
ally, the development has been broken into two, presumably chronologically distinct, 
changes. However, the second and third palatalizations were likely a single process with 
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two sets of conditioning environments: the “second” palatalization is triggered by the rise 
of new front vowels arising after the monophthongization of diphthongs (see above) and 
the “third” by a preceding front vowel. 


TABLE 11.18 SECOND AND THIRD VELAR PALATALIZATIONS 


PIE 

PSl. 

Attestations 

Conditioning environment 

*k w oyn-eh 2 

*KAYN-A 

OCS cena ‘price’ 

*K before front V (“2 pal.”) 

*k w oyto- 

*KWAYT-A-S 

OCS cvet’b ‘flower’, ‘color’ || 

Pol. kwiat, Cz. kvet 


*wlk w o- 

*WILK-AYXU 

ORuss. vblcexb ‘wolves’ loc. pi. 


*g h oylo- 

*GAYL-A 

OCS dzelo ‘very’, Sin. zelo 


*g h woy-sd- 2 

*GWAYZD-A 

OCS dzvezda ‘star’ Pol. gwiazda, 
Cz. hvezda 


*ksoy-ro- 

*XAYR-A 

*GRAYX-AY 

OCS serb ‘blue-gray’ | 

Pol. szary, Cz. sery 

OCS gresi | Pol. grzeszy, 

OCz. hriesi 


*meh,s- 

*mes(e)nk-a- 

OCS mes§cb ‘moon’ 

*K after front V (“3 pal.”) 

*stig h o- 

*STIG-A 

OCS stbdza ‘path’ 


*wiso- 

*WIX-A 

OCS vbSb ‘all’|| Pol. wszystek, Cz. 
vsechen 



The diatopic variation found in the palatalization reflexes reveals a dynamic period 
of change, where the differing rates of expansion of the innovations result in vary- 
ing relative chronologies (see esp. Andersen 1969, Vermeer 2014). Alongside the sec- 
ond and third palatalizations, the lenition of CS1. *g > y was under way in much of 
central Slavic, ultimately affecting Cz., USorb., Slovak, NW Sin., part of Cakavian 
Croat., BRuss., Ukr. and southern Russ. These are classic examples of center-periphery 
phenomena. For example, in the Novgorod-Pskov area (peripheral), as attested in the 
Birchbark Letters, e.g., kele ‘whole’ nom. sg. m., xeri ‘of a gray cloth’ gen. sg. f. 
(cf. modern Russ, celvj, servj), indicates that the second palatalization failed to reach 
this area. In West Slavic (central) the results of the palatalization of *k, g are identical 
to the results of the deiotation of *t, d (see below). If it is assumed that the second and 
third palatalizations resulted in [f, d’] (palatal stops) in WS1., then the change of *x in 
the second and third palatalization to s follows the systemic phonemicization of the 
series as palatals (rather than alveo-palatals, as elsewhere) (Vermeer 2014: 189). This 
scenario is confirmed by the fact that in West Slavic only the reflexes of the second 
palatalization before v have been phonologized as k, g (Pol. kwiat ‘flower’, gwiazda 
‘star’). Had they been affricated, as elsewhere, depalatalization as stops would have 
been unexpected. 


Loss of post-consonantal) (“deiotation”) 

The last major CS1. change was the simplification of clusters of C + j, which in all cases 
resulted in the elimination of j (whence the traditional term “deiotation”). Dental stops 
were first palatalized and then, in another wave of innovation, assibilated. The early phase 
is attested in the Freising Folia, where we find the spelling imoki (2x) (< CS1. *ji>m 9 t’bji> 




532 MARC L. GREENBERG 


‘having’ pres. act. ptcp. nom. m. sg.); cf. in the same text v ueki (vs> veky ‘for all time’ ace. 
pi.). Dental fricatives merged with palatal fricatives from the first palatalization of velars 
(wherever an affricated realization of *dz < *G + front vowel had not been retained). After 
labials, j became l’, and the sonorants n, /, r developed palatal counterparts. Variation is 
significant, as can be seen in the following chart. 


TABLE 11.19 ‘DEIOTATION’ REFLEXES 



PIE 

CSl. 

West 

South 

East 

*tj > *t’ 

*kwoyt-o- 
‘ bright’ 

*svet’a ‘candle’ 

Pol. swieca, Cz. 
svice, US orb. 

sweca 

Sin. sveca, 

BCMS svijeca, 
Mac. sveka, 
Bulg. svesta 

BRuss. svjaca, 

Ukr., svica, 

Russ, sveca 

*dj > *d’ 

*h!ud h - ‘red’ 

*rtd’a ‘rust’ 

Pol. rdza, Cz. 
rez, Slk. hrdza, 
LSorb. rza 

Sin. rja, BCMS 
rda, Mac. rga, 
Bulg. rbzda 

Russ, rzavcina, 
BRuss., Ukr. rza 

*sj > *s 

^nek-/ 
hjnok- 
‘ carry, take’ 

*nosa 

‘something 
carried, worn’ 

Pol. nosze, Cz. 

nuse 

Sin. nosa 

Russ., BRuss. nosa 

*zj > *z 

*h 2 eg- ‘goat’ 

*koza ‘skin, 
leather’ 

Cz. kuze, Slk. 
koza, ULSorb. 
koza 

Sin. koza, BCMS 
koza, Mac., 
Bulg. koza 

Russ, koza 

*stj > *sf 

*polh 2 - ‘flat’ 

*plast’b ‘flat 
garment’ later 
‘raincoat’ 

Pol. plaszcz , Cz. 
plasf, LSorb. 
plasc 

Sin. plasc, 

BCMS plast, 
Mac., Bulg. 
plast 

Russ., BRuss., Ukr. 
plasc 

*zdj > *zd’ 

*dus-dyu- 
‘bad sky’ 

*dtzd’b ‘rain’ 

Pol. deszcz, Cz. 
desf, Slk. 
dazd\ ULSorb. 
desc 

Sin. dez, gen. 
sg. dezja. Sin. 
Prekmurje dial. 
desc, dezdza, 
Stok. arch. 
dazd, Mac. 
dozd, Bulg. 
ddzd 

Russ, dozd’, BRuss. 
dozdz, Ukr. dose 

*pj > *pl’ 

*plew- ‘spit’ 

*pl’uj 9 , *pl’uva- 
‘spit’ 1 sg., inf. 
stem 

Pol., pluj§, Cz. 
pliji , Slk. 
pi ’ujem 

Sin., BCMS 
pljujem, Mac. 
plue, Bulg. 
pljuja 

Russ, plevat ’, 
BRuss. pljavac ’, 
Ukr. pljuvaty 

*bj > *bl’ 

*b h lewH- 

*bl’uj 9 , *bl’uva- 

Pol. bluj§ , Cz. 

Sin., BCMS 

Russ., Ukr., bljuju. 


‘ overflow’ 

‘vomit’ 1 sg., 
inf. stem 

bliji, Slk. 
bl ’ujem, 
ULSorb. bluju 

bljujem, Mac. 
blue, Bulg. 
balva 

BRuss. bljavac ’ 

*mj > *ml' 

*d h g h emi 
‘earth’ loc. 
sg- 

*zeml’a ‘earth’ 

Pol. ziemia, Cz. 
zeme, ULSorb. 
zemja 

Sin., BCMS 
zemlja, Mac, 
Bulg. zemja 

Russ., Ukr. zemlja, 
BRuss. zjamlja 

*nj > *n’ 

(Gr. Kodcovia 

*dyn’a ‘melon, 

Pol. dynia, Cz. 

Sin. dinja, BCMS 

Russ., BRuss., Ukr. 


‘quince’) 

pumpkin’ 

dyne, ULSorb. 
dynja 

dinja, Mac., 
Bulg. dinja 

dynja 

*lj > *r 

*welh r 

‘choose’ 

*vol’a ‘will’ 

Pol., ULSorb. 
wola, Cz. vule, 
Slk void 

Sin., BCMS, 

Mac., Bulg. 
volja 

Russ., BRuss., Ukr. 
volja 

*ij > * r - 

*g h er-/*g h oro- 

*zor’a ‘dawn, 

Pol. zorza, Cz. 

Sin. zarja. 

Russ., Ukr. zarja. 


‘ shine’ 

dusk’ 

zore, Slk. zora, 
USorb. zerja, 
LSorb. zorja 

BCMS, Mac. 
zora, Bulg. 
zarja 

BRuss. zara 
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Deiotation had also affected velars, with the results being identical to the first palataliza- 
tion, e.g., PS1. *dawx-y-A > OCS dusa ‘soul’. The lack of the reflexes of the j > / 'change after 
labials in West Slavic and eastern South Slavic is a result of generalization of the non-iotated 
forms at the morpheme boundary. Toponyms reveal that the change had taken place through- 
out Slavic, e.g., Pol. Grnbla, Lublin , Cz. Liblin, Bulg. Koprivlen, Popovi Dnvlja. 

PS1. *w became v throughout Slavic, though there is a tendency to preserve the variant 
[w] in syllable -final position in some areas, e.g., BRuss. low. Sin. lov [low] < CS1. *lov'b 
‘hunt, catch’. 

TABLE 11.20 COMMON SLAVIC CONSONANT SYSTEM 


labial 


dental 


palatal 


velar 


stops p 

fricatives / 

affricates 

nasals m 

tap/trill 

liquid 

glides v 


b t 

s 
c 
n 
r 
l 


d 


z 


d 


z 


k g 

X (7) 


n’ 


r 


j 


MORPHOLOGY 

Slavic inflectional morphology continues the synthetic inflection type inherited from PIE, 
conserving case systems in the nominal system and the general patterns of inflection in 
the verbal system, albeit transformed by the syllable restructuring that affected the shape 
of endings. Of particular note is the development of the verbal aspect system in Slavic, 
whereby systematic distinctions of perfective (“completed,” “viewed as a whole”) vs. 
iMPERFECTivE hold for most verbs, e.g., OCS kupiti ‘buy’ perfect. : kupovati ‘buy’ impf. 
Sub-aspectual and lexical distinctions are accomplished by means of prefixation and suf- 
fixation, e.g., (PIE *k (w, eg <h) - >) OCS cezngti, cezng ‘disappear’ inf., 1 sg., imperf. (CS1. 
*jbz-cezngti) > istezngti ‘disappear’ perfect.; istezati ‘idem’ imperf. Within the verbs of 
motion in unprefixed verbs additional aspectual distinctions hold, often subsumed under 
the labels determinate (“unidirectional”) vs. indeterminate (“manner of motion,” “mul- 
tidirectional,” “iterative”), e.g., OCS iti, nesti ‘go’, ‘carry’ det., imperf.; xoditi, nositi 
‘walk’, ‘carry’ indet., imperf. 

Case marking is well preserved in all but eastern South Slavic - Mac., Bulg., and 
southern Serbian dialects, which were affected by contact phenomena in the Balkan con- 
vergence area. The inherited past tense systems were reduced in West and East Slavic as 
well as in Sin. and, to a more limited extent, BCMS, to a single construction based on the 
perfect (aux. of ‘be’ + /-participle). Mac. and Bulg. preserve only simple and coordinated 
tenses, but they also expanded to include evidentiality, “witnessed” vs. “non-witnessed,” 
e.g., Mac. Toj bese vo Skopje ‘he was in Skopje’ vs. Toj bil vo Skopje ‘he was supposedly/ 
evidently in Skopje’ (Comrie & Corbett 1993: 272; though the synchronic category has 
been questioned, see Kempgen et al. 2009: 262-268). 

Nominal morphology 

Nouns 

As in other PIE dialects nominal morphology has become restructured. In Slavic the pro- 
ductive declension patterns have fused PIE theme vowels with case and number markers 
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to make up the characteristic endings, while those in turn have been reshaped by phonetic 
changes in word-final position (see above regarding sound change). Root nouns of the 
type Lat. n ox, noctis ‘night’ have generalized the non-nom. sg. case forms and reinter- 
preted the paradigm as an /-stem, e.g., OCS nostb ‘night’ (<— PS1. *nakt-i-m ace. sg.). 
As a consequence of the rising sonority developments and the fusion of the vowels with 
grammatical markers, Slavic nominal morphology clearly distinguishes between lexical 
stems, which always end in a consonant, and grammatical endings, which consist of or 
begin with a vowel. 

Consonantal stem types have largely merged with vocalic stem types in the daughter 
languages, although, curiously, Sin. has innovated by creating deverbative nouns from 
the relic M-stem class, e.g., osveziti ‘to refresh’ — > osvezitev ‘refreshment’, perhaps as an 
extension of the tendency already observable in OCS to create deverbal nouns of the type 
prel’uby, prel’ubwe ‘adultery’ nom., gen. sg. <— prelubiti (< pre- ‘trans’ + I’ubiti ‘love’), 
as well as conflation with deverbatives of the -twa type, cf. Sin., BCMS, Bulg. bitva 
‘battle’ <— CS1. *biti ‘to beat’. Of note is also the Slavic innovation of an ent- stem class 
denoting offspring, e.g., PIE *d h eh r ‘suckle’ — » OCS detg, detete ‘child’ nom., gen. sg., 
PIE *pork — * OCS prasq,prasgte ‘piglet’ nom., gen. sg. 

The nominal system of Slavic conserves all PIE cases but the ablative - which is 
thought to have merged with the genitive in the o-stem - as well as three genders (fi, m., 
n.) and three numbers (sg., pi, du.). OCS and most of the daughter languages, with the 
exception of Mac. and Bulg., preserve six syntactic cases (nom., ace., gen., dat., loc., 
instr.) and a voc. form. OCS, Sin., and LSorb. retain dual number. With regard to form 
classes OCS displays the consonantal and vocalic stem classes of PIE, though some reor- 
ganization of the classes is already evident. The daughter languages have continued to 
reduce the stem classes, generally with the o- (jo-), a- (jo), and /-stems emerging as the 
target paradigms, subsuming the unproductive consonantal stems. Grammatical gender 
predominantly aligns as follows: o- (jo-) = m., n.; a- (ja) = fi; /- = fem; conflation of case 
forms occurs (most > least): du. > pi. > sg.; instr., loc., dat. > gen. > nom., ace. 


Vocalic stems 

The productive o-stem masculines include older layers of inherited nouns, e.g., PIE 
*sth 2 -lo- > OCS stolb ‘chair’, PIE *memso- > OCS mgso ‘meat’, as well as o-grade 
deverbatives of the type OCS vozt> ‘wagon’ (cf. vezg ‘I convey’), plotb ‘fence’ (cf .piety 
‘I weave’), and numerous derivatives with suffixes with the o-stem shape, e.g., OCS 
ostrogb ‘palisade’ <— CS1. *ostr- ‘sharp’. The M-stem masculines generally merged in 
individual Late Common Slavic dialects with the o-stems, though in some of the daughter 
languages the endings have become part of the masculine declension type, sometimes 
with innovative functions, e.g., marking animacy in Cz. Zbyhk-ovi ‘to Zbynek’ dat. sg. 
(vs. stolu ‘chair’ dat. sg.) and partitivity in Russ, caju ‘some tea’ (vs. cvet caja ‘the color 
of the tea’), the additional case meaning also being an effect of the Finnic substratum in 
northern Russian. 

Innovative -jo- and -ya-stcm declensions developed in Slavic, built on the o- and 
a-stems, respectively. These differ mostly in the adjustment of intrasyllabic harmony, 
e.g., y o-stem *mar-i-am > *mor-e ‘sea’ nom., acc. sg. Inherited yo-stems are few, but 
the class remained productive and includes deverbatives such as OCS vozdb ‘leader’ 
(cf. voditi ‘to lead’), as well as newer borrowings, e.g., OCS kral'b (from Germ. Karl 
‘Charlemagne’). Innovative palatal stems, such as those that arose through the second 
palatalization of velars, also decline as yo-stems, e.g., OCS pengckb ‘coin’ (from Germ. 
penning). The ya-stem type subsumed the relics of the *-ih 2 feminines, still attested by a 
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small number of nouns in OCS, e.g., PIE *aldih 2 > OCS ladii ‘boat’, PIE *peh,w — » OCS 
pustyn'ii ‘desert’ as well as in the f. nom. sg. form of the pres. act. ptcp., e.g., OCS nesgsti 
‘carrying’ (< PS1. *nesant-y-I). 


TABLE 11.21 

VOCALIC STEMS 


class 

PIE 

OCS 

o-stem 

*g h ord h o- 

gradb ‘fortified place, city’ m. 


*moyth 2 to- 

mesto ‘place’ n. 

7 ' 0 -stem 

*mangyo- 

mpzb ‘man, husband’ m. 


*polh 2 yo- 

pole ‘field’ n. 

w-stem 

*med h u- 

medb ‘honey’ 

i- stem 

*nih 2 ti- 

nitb ‘thread’ 

o-stem 

*g w eneh 2 

zena ‘woman, wife’ 

70 - stem 

*d h g h emy-eh 2 

zemlja ‘earth’ 


TABLE 11.22 

CONSONANTAL STEMS 


class 

PIE 

OCS 

n- stem 

*seh,men- 

sem§, gen. semene ‘seed’ 

r-stem 

*d h ugh 2 ter- 

dbsti, dbstere ‘daughter’ 

5-stem 

*klewos- 

slovo, slovese ‘word’ 

nt-stem 

*ag w no- 

agn§, agnete ‘lamb’ 

u (uw)-stem 

*swekruh 2 - 

svekry, svekrbve ‘husband’s mother’ 


Nominal declension 

In the following chart the presumed PIE input is compared with the attested OCS endings 
(CS1., where b occurs before j, which yields ij in OCS), which are presumed to be similar 
to those of LCS1. Wherever the arrow — > is used, a non-linear development is presumed 
to have occurred. The trajectories of the endings, as well as the PIE input, are a subject 
of much debate, and the detail needed to explain them far exceeds the scope of this chap- 
ter. An emblematic issue is the origin of the -r> ending in the nom. and ace. sg. cases of 
the o-stems. The expected outcome, following the regular phonetic development, loss 
of final consonants, would have been nom. sg. *-os > -o. (This outcome is in fact repre- 
sented regionally in proper nouns, where the ending may have earlier been interpreted as 
vocative or hypochoristic, BCMS Branko, Vlado [<— Branislav, Vladimir]', Ukr. Danylo, 
Petro. In the Finnic substratum areas of Pskov-Novgorod, as attested in the Birchbark 
letters, the vocative ending -e was generalized, on which see Vermeer 1994.) Such an 
outcome threatened merger with the neuter o-stems, as well as neutralizing the distinc- 
tion between agent and patient roles. Among alternatives - other solutions have been 
considered - was to select the u-stem endings in -b, which did not solve the problem of 
agent-patient distinctions. This problem was repaired by the development of the category 
of animacy, whereby genitive case marking was introduced for animate referents in the 
accusative position, e.g., OCS vbsi bo eko proroka imgtb Ioana (Matthew 21:26, Codex 
Marianus) ‘all thus as a prophet- anim. take John-ANIM. ’ The input of the instr. sg. and dat., 
instr. pi. and du. forms assume a PIE dialect input of *-m- (as opposed to *-b h -), common 
to Germanic, Baltic, and Slavic. The schema below shows the expected PIE input on the 
left and CS1. on the right (with PS1. for illustration in some instances), where > indicates 
a direct phonetic continuation and — » indicates a post-PIE rearrangement of the material 
(PIE input, with some adjustments, based on Sihler 1995: 248). 
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TABLE 11.23 VOCALIC STEM ENDINGS 


o-stems 

o-stems /-stems 

o-stems 

singular 

nom. 

*-OS — > *U > -b 

*-us > -b *-is > -b 

*-eh, > -a 

(n.) 

*-om > o 



acc. 

*-om — * *u > -b 

*-um > -b *-im > -b 

*-eh,m > -o 

gen. 

*-oh 2( e)t (abl.) > -a 

*-ows > -u *-eys > -i 

*-eh,os — > *-u > -y 

dat. 

*-oy —> *aw > -u 

*-ewey > -ovi *-eyey > *-ey > -i 

*-eh,ey > *-ay > -e 

instr. 

*-oh, — > *AMI > 

*-umi > -bmb *-imi > -bmb 

*-eh,mi —* *-ayam 


-omb 


> -0)9 

loc. 

*-oy > -e 

*-ew > -u *-ey > -i 

*-eh,y > *-ay > -e 

voc. 

*-e > -e 

*-ew > -u *-ey > -i 

*-h 2 > -o 

plural 

nom. 

*-oy > -i 

*-ewes > -ove *-eyes > -bje 

= acc. pi.? >y 

nom., acc. n. 

*-eh 2 > -a 



acc. 

*-oms > -y 

*-ums > -y *-ims > -i 

*-eh 2 ms > -y 

gen. 

*-om — > *-u > -b 

*-owom > -ovb *-eyom > -bjb 

*-eh 2 o/om > -b 

dat. 

*-omus — ► -omb 

*-umus — > -bmb *-imus > -bmb 

*-eh 2 mus > -amb 

instr. 

*-oys — > *-u > -y 

*-umTs > -bmi *-imT > -bmi 

*-eh 2 miHs > -ami 

loc. 

*-oysu > -exb 

*-USU > -bXb *-isU > -bXb 

*-eh 2 su > -axb 

dual 

nom., acc. 

*-eh 2 > -a 

*-uh 2 > -y *-ih 2 > -i 

*-eh 2 y > -e 

nom., acc. n. 

*-h 2 y > -e 



gen., loc. 

*-h 2 ow > -u 

*-owh 2 ow> -ovu *-eyh 2 ow > -bju 

*-eh 2 ow > -u 

dat., instr. 

*-omoH > -oma 

*-umoH > -bma *-imoH > -bma 

*-eh 2 moH > -ama 

TABLE 11.24 

CONSONANTAL STEM ENDINGS 


singular 

nom. 


*-s (m.), *-o > -0 (implies Slavic stem changes) 

acc. 


*-m > -0/b 


gen. 


*-es > -e 


dat. 


*-ey > -i 


instr. 


*? > *-i/emi — > -h/emb (m.); *? > *-iyam > -bjg (f.) 

loc. 


*-i — > -He 


voc. 


= nom. sg. 


plural 

nom. 


*-es > -e\ -i, -a from o-stems 


acc. 


*-ms > -t; -i, -a, -y from o-, o-stems 

gen. 


*-om > -b from o-stems? 


dat. 


*-(i)mus > -bmb 


instr. 


*-(i)mi > -bmi; -y from o-stems 

loc. 


*-(C)sU — > -bXb/ -ixb/ -exb 


dual 

nom., acc. 


*-ih, > -i (r-stems, dbnb); -a, ■ 

-e from o-, o-stems 

gen., loc. 


*-ow > -u 


dat., instr. 


*-imoH > -bma 
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Adjectives 

Adjectival declension derives from nominal declensions, where agreement in gender, 
number, and case follows the pattern of the 0 -, 70 -stems for masculine and neuter and the 
d-,y'd-stems for feminine, PIE *new-os, *new-om, *new-eh, > OCS nov-b, nov-o, nov-a 
‘new’ nom. sg. m., n., f.; Gmc *tewd — » PS1. *tewd-y- > CS1. *t’ud'- > OCS stuzd-b, 
stuzd-e, stuzd-a ‘foreign’ (with variant base forms tuzd-, stuzd-, perhaps as a function of 
dissimilation). Of particular note is the development of an articulated form of the adjec- 
tive with *j- pronouns (see below), which mark definiteness, an innovation shared with 
Baltic, e.g., novb-jb, novo-je, nova-ja; nova-jego gen. sg. m./n., novu-jemu dat. sg. m./n., 
etc. Cf. naujas ‘new’ m. sg. indef. vs. naujasis def. (< naujas +jis). 

Comparative forms are produced with -jbj-l-bs- or innovative -ej-bs- (shortened in the 
nom. sg. m. and n. by word-final consonant elision) and follow the j- declension types, 
e.g., OCS bol'i, bol'e, bol'bsa ‘bigger’; OCS novei, nove(j)e, noveisa ‘newer’. 

Participial forms also belong to the adjectival declensions, e.g., present active partici- 
ple: OCS nesv (m., n.), nesgsti (< PS1. *nesa-nt-s, *nesa-nt-m, *nesant-y-I) ‘carrying’; 
past active participle nesb (m., n.), nesbsi (< PS1. *nes-us-s, nes-us-m, *nes-us-y-I) ‘hav- 
ing carried’; past passive participle nesem, neseno, nesena (< PS1. *nes-en-as, -a, -a) 
‘having been carried’; present active participle nesonib, nesomo, nesoma (< PS1. *nes- 
am-as, -a, -A) ‘being carried’. The resultative “/-participle” occurs only in the nom. forms, 
e.g., neslb, neslo, nesla, which occur in compound tenses and mood constructions. 


Pronouns 

Personal pronouns in Slavic conserve much of the PIE pattern, distinguishing six cases 
as in the noun. Here the OCS examples stand (nearly) for all of Slavic. PIE *h,eg,* 
h,egHom > Cz .ja, Pol., Slk., ULSorb., BCMS, BRuss., Ukr., Russ.ya || OCS azb jSAn.jaz 
‘I’, Mac. jas, Bulg. az (with prothetic j- and lengthening by Winter’s Law; on the hetero- 
geneous inheritances see Kapovic 2009); PIE *h,mene > OCS mene, mg ‘me’ - full and 
clitic forms; PIE *tuH > OCS tv ‘you’ 2 sg. nom.; in OCS tebe, tg 2 sg. ace., the expected 
stem from PIE *tewe does not occur but is replaced by the stem from the dat. sg., *teb h - 
(OCS tebe 2 sg. dat. sg.); the nom. (and ace.) forms of the 1 and 2 pi. are evidently 
reshaped on the model of ty: OCS my ‘we’ nom., ny ‘us’ ace., vy ‘you’ (pi. and honorific) 
nom., ace. The ace., gen. forms generalized the lengthened vowel root and must have 
been extended by a suffix, PIE *nos, *wos — » PS1. *nAs-a(m), *wAs-a(m) > OCS nasb, 
vasb. The homophonic loc. pi. forms OCS nasb, vasb provide the direct continuation of 
the loc. pi. ending in -su, which has been generalized in the noun and adjective paradigms 
as -Xb (following the RUKI change). The reflexive pronoun follows the pattern of the 
3 sg.: OCS sg ace., sebe gen., sebe, si dat. full and clitic, sebe loc., sobojg instr. Third 
person pronouns, as well as corresponding relatives, are built from the anaphoric pronoun 
PIE *yo- apart from the nom. forms, e.g., OCS jb ace. sg. m ,,je ace. sg. n ,,jg ace. sg. 
f .,jego gen. sg. m., n .;jejg gen. sg. f ,;jemu dat. sg. m., n.Jejb dat. sg. f. In prepositional 
constructions, initial j is replaced by n' (< *nj) as a result of the reanalysis of -n in prep- 
ositions, *vun ‘in’, *sun ‘with’ (< PIE *h,n, *kon) as n-, e.g., otb n'ego ‘from him, it’, 
kb n'ejb ‘toward her’, sb n'inib ‘with him’. In OCS the nom. fonns from anaphoric *yo- 
occur only in relativizers compounded with the particle (focus marker) ze (the o-grade 
of which appears in the gen. sg. m. and n. pronominal and adjectival ending: jego), e.g., 
cbto ze vidisi socbcb ibze estb \>b ocese bratra toego a brwno eze_ estb vb ocese h’oenib 
ne cujesi ‘what you-see a mote -ace. -sg.-m. that-nom.-sg.-m. is in the eye of brother yours 
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but the log-acc.sg.-n. that-nom.sg.-n . is in eye yours not you-feel’ (Luke 6:41, Codices 
Zographensis, Marianus). In the daughter languages the usual 3 pers. nom. pronouns 
are built from CS1. *on-, e.g., OCS onb, ona, ono; oni, ony, ona m., f., n. sg., pi. (< PIE 
*h 2 en-), though Mac. and Bulg. use forms built from PIE *to-, e.g., Bulg. toj, tja, to m., f., 
n. sg. nom. In other Slavic languages, pronouns built on this fonn usually serve a distal, 
switch-reference, or resumptive function. 


TABLE 11.25 OCS 3 PERS. PRONOUN 



sg. 

pi. 

du. 

nom. 

*jb m., *ja f., *je n. 

*ji m., *j^ f., *ja n. 

*ja m., *ji f., n. 

acc. 

jh m. Jp f „je n. 

j? m„ f ;ja n. 

= nom. 

gen. 

jego m„ n.;/e/g f. 

ixb 

jeju 

dat. 

jemu m., n.; jejb f. 

imb 

ima 

loc. 

n'emb m., n.; n'ejb f. 

ixb 

jeju 

instr. 

imb m. n.; jejg f. 

imi 

ima 


OCS and the Slavic daughter languages preserve other 3 pers. pronouns marked for 
proximity in time or space, e.g., OCS Sb, si, se ‘this here’ (< PIE *ki-); BCMS ovaj, ova, 
ovo ‘this’ (< PIE *h 2 ew-) - contrasted with onaj, ona, ono ‘that’, cf. Sin. one ‘what- 
chamacallit’ < CS1. *ono + *je). The deictic s- particle also plays a role in the formation 
of adverbs of time, OCS dbnbSb, Cz., Slk. dnes, BCMS danas || Russ, segodnja ‘today’ 
(< CS1. *dtm> + st || *sego + dBne); OCS si nosti, BCMS sinoc, Mac. sinoka ‘last night’ 
(< CS1. *si not’i). 

Interrogative pronouns in Slavic display both conservative and innovative features, as 
is evident from the following table. The nom. forms of both the animate and inanimate 
have been extended with the element *to(-s, -d), though variant forms also occur, e.g., 
Pol. nikt. Sin. nihce ‘nobody’ < CS1. *nik'bt'b(-ze); the littoral (Cakavian) dialect of Croa- 
tian has the bare fonn of the inanimate ca (< CS1. *cb). Sin. and Croatian Kajkavian dial. 
kaj ‘what’ is formed from a different inherited element, PIE *k w eh 2 , extended with the 
anaphoric element CS1. *jb < PIE *yo- (Snoj 2003: s.v.). The same element is found as 
a subordinating conjunction in Bulg., Pol. dial., as well as in Prekmurje Sin. ka, where it 
contrasts with irrealis da (< PIE *doh,), the (otherwise indicative) prevailing subordinat- 
ing conjunction in western South Slavic (see Greenberg 2011). 


TABLE 11.26 INTERROGATIVE PRONOUNS 



PIE 

OCS 


Cz. 

Sin. 

Russ. 

nom. 

*k w o- ‘who’ 

*k w id ‘what’ kbto 

CbtO 

kdo 

CO 

kdo 

kaj 

kto 

cto 

acc. 

*k w om 

*k w im 

= gen. 

= nom. 

= gen. 

= nom. 

= gen. 

= nom. 

= gen. 

= nom. 

gen. 

*k w oso 

*k w eso 

kogo 

ceso 

koho 

ceho 

koga 

cesa 

kogo 

cego 

dat. 

*k w osmoy 

*k w esmey 

komu 

cesomu 

komu 

cemu 

komu 

cemu 

komu 

cemu 

loc. 

*k w osmi 

*k w esmi 

komb 

cemb 

kom 

cem 

kom 

cem 

kom 

cem 

instr. 

*k w Vh r 

*k w Vh,- 

kymb 

cimb 

kym 

cim 

kom 

dim 

kem 

cem 


Further pronominal bases include PIE *eyno- > OCS im> ‘other’, PIE *eysth,o- > OCS 
istb ‘same’, PIE *wik- > OCS VbSb ‘all’. 
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Indeclinables 

Adverbs of time, place, manner, and direction were created by combining pronominal 
elements with deictic particles. 


TABLE 11.27 OCS ADVERBS OF PLACE, TIME, MANNER, DIRECTION 


PS1. *-d h e ‘where’ 

PSl. *-da, -kuda ‘when’ 

PSl. *-(a)ka ‘how’ 

PSl. *-ama ‘direction' 

*k- kbde ‘where’ 

kbda, kogda ‘when’ 

kako ‘how’ 

kamo ‘whither’ 

*s- sbde ‘here’ 

sbda ‘now’ 

sice (< *s!ka) ‘in this way’ 

semo ‘hither’ 

*t- 

togda ‘then’ 

tako ‘thus’ 

tamo ‘there’ 

*ov- ovbde ‘there’ 

ovogda ‘then’ 


ovamo ‘thither’ 

*on- om>de ‘over there’ 

onbda ‘at that time’ 


onamo ‘yonder’ 

*j- jbde(ze) ‘at that place’ jeda ‘at the time when’ 

jako ‘thus’ 


*vbs- vbSbde ‘everywhere’ 

vbsegda ‘always’ 

vbsako ‘in all ways’ 



Other adverbs include OCS tu ‘here’ (< PIE *tow), mvnogo ‘much’ (< PIE *mnog h o-). 
Productive adverbial formants include -o, -e, -bskv, e.g., OCS dobro, dobre, both ‘well’; 
slovenbskv ‘Slavic’. 


Verbal morphology 

The Slavic verb materially reflects a simplified picture of Indo-European verbal morphol- 
ogy, though, as mentioned above, Slavic has innovated in its morphologized aspectual 
system by means of prefixation and suffixation as well as by extending the ablaut pattern. 
Only one example of PIE reduplication persists in the verb, and this is in the small group 
of athematic verbs, i.e., OCS dadgtb ‘they give’ (PS1. *dAden-t- <— PIE *deh r , cf. Skr. 
dadami ‘I give’). The remaining athematic verbs in Slavic are (PIE *hjes- >) OCS jesmb, 
jesi, sgtb 1 sg., 2 sg., 3 pi. (inf. byti) ‘be’; (PIE *weyd- >) verm 1 sg. (inf. vedeti) ‘know’ 
(an isolated relic form descended from the PIE intransitive conjugation is found in OCS, 
OCz., ORuss. vede, on which see Ivanov 1981: 68); (PIE *ed — * PS1. ed-mi >) OCS 
emb/jamb (inf. esti,jasti). The verb imeti, 1 sg. imamb ‘to have’ (< CS1. *ji>mamb < PS1. 
*im-a-mi <— PIE *m- ‘grasp, take’), also belongs here - it is related to an e/o-thematic 
verb fti,jbmg ‘to take’ inf., 1 sg. 

The present tense endings, as in the nominal declension, have been reorganized. The 
1 sg. ending of the thematic type in -g cannot have been original but was built from the 
inherited PIE *-oh 2 , and the -m marker was extended from the athematics after the loss 
of final consonants. The extension of the athematic 1 sg. -m made a second comeback in 
the historical period with the rebuilding of the present tense markers in West Slavic and 
western South Slavic, e.g., Slk. nesiem. Sin. nesem; Pol. gram, Cz. hram, Sin., BCMS 
igram || Russ, igraju, Ukr. hraju, Bulg. igraja, Plb .jaigrojq (< CS1. *jbgraj 9 ) ‘play’ 1 sg. 
(see further Janda 1996). The 3 sg. and pi. forms in OCS -tb are of secondary origin, most 
likely from the 3 sg. pronoun in /-. East Slavic, however, preserves the original athematic 
ending in -ti, e.g., ORuss. jestb ‘is’, Russ, as/’, BRuss. esc’, albeit with considerable 
dialectal variation (see Miller 1988 for details). A zero 3 sg. ending (inherited from PIE) 
is also amply attested, e.g., Cz. nese, Slk. nesie, Sin. nese, Ukr. nese. Similarly, 1 pi. 
endings show variation through generalization of the ablaut variants -mehno, most likely 
in response to competition from the 1 sg. marker in -m, cf. USorb. svm - smv, Cz.jsem ‘I 
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am’ - jsme ‘ 
Bulg. sam - 

TABLE 11.28 

we are’, Slk. som - sme, Sin. sem - smo, BCMS sam - 

sme. 

INFLECTION OF NON-PAST FINITE VERB FORMS 

- smo, Mac. sum - sme. 


athematic 

thematic 

1 sg. 

damb (< PS1. *dad-mi) 

nesg (< *nes-a-m) 

2 sg. 

dasi (< *dad-si) 

nesesi (< *nes-e-xI) 

3 sg. 

dastb (< *DAD-T-) 

nesetb (< *nes-e-) 

1 Pi 

dumb (< *DAD M-) 

nesernb (< *nes-e-m-) 

2 pi. 

daste (< *dad-te) 

nesete (< *nes-e-te) 

3 pi. 

dadqt-b (< *DAD-EN-T-) 

nesQti > (< *NES-A-NT-) 

1 du. 

dasta (< *dad-ta) 

neseta (< *nes-e-t-a) 

2 du. 

3 du. 

daste (< *dad-te) 

nesete (< *nes-e-te) 


Traditionally, OCS verbs are grouped into five classes, following the classification by 
Leskien 1969 (1871), which corresponds to layers of archaism vs. innovation, and are 
indexed by their characteristic theme ( e/o , nelo, etc.). Two basic stems are accounted for, 
represented by the infinitive, which normally contains a (semantically empty) suffix, e.g., 
LCS1. *btr-a-ti ‘to take’, and a present or non-past-tense stem *ber-e- ‘takes’. Sub-types 
are determined by the suffix or lack of it (in which case the stem-final segment is relevant) 
in the infinitive stem. In the table below the OCS forms are given in the inf., 1 sg., 3 sg. 
pres, tense (the listing of types here is not comprehensive). 


TABLE 11.29 VERB CLASSES, FOLLOWING LESKIEN 


class 

theme 

PIE 

PS1. 

OCS 

I.A 

elo 

*h,nel£- 

*NES-TEY 

nesti ‘carry’ det. 




*NES-A-M 

nesg 




*NES-E- 

nesetb 

1.B 


*b h er- 

*BIR-A-TEY 

bbrati ‘take’ 




*BER-A-M 

berg 




*BER-E- 

beretb 

II 

nelo 

*steh 2 - 

*STA-TEY 

stati ‘stand up’, ‘begin to’ 




*STA-NA-M 

stang 




*STA-NE- 

stanetb 



*kos- 

*kas-nu(n)-t-ey 

kosngti ‘touch’ 




*KAS-NA-M 

kosng 




*KAS-NE- 

kosnetb 

III. 1.A 

jeio 

*gnoh 3 - 

*ZNA-TEY 

znati ‘know’ 




*ZNA-YA-M 

znajg 




*ZNA-YE- 

znajetb 

III. 1.B 


*kes- 

*KES-A-TEY 

cesati ‘gather (fruit)’ 




*KES-YA-M 

cesg 




*KES-YE- 

cesetb 

III.2.B 


*(s)newH- 

*AB-SNAW-A-TEY 

osnovati ‘found’ 




*AB-SNAW-YA-M 

osnujg 




*AB-SNAW-YE- 

osnujetb 

IV.A 

T (< PIE *-eye-) 

*h,nok- 

*NAS-EY-TEY 

nositi ‘carry’ indet. 




*NAS-Y-A-M 

nosg 




*NAS- l(Y)E- 

nositt 
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class theme PIE PS1. OCS 

*Sed- *SED-E-TEY 


IV. B 

IV.B 

V athematic 

V 


*b h oyH- 

*SED-Y-A-M 

*sEd-i(y)e- 

*bay-E-tEy 

*b h uh 2 - 

*BAY-A-M 

*BAY-EY- 

*BU-TEY 

*h,es- 

*ES-MI 

*woyd- 

*ES-TI 

*WAYD-E-TEY 


*WAYD-MI 

*WAYD-TI 


sedeti ‘sit’ stative 

sezdg 

seditb 

bojati sg ‘be afraid’ 

bojQ sq 

bojitb sg 

byti ‘to be’ 

esmb 

estb 

vedeti ‘know, be aware’ 

vemb 

vestb 


Form classes and compound tenses/moods of the verb, as attested in OCS, include the 
following: 


TABLE 11.30 SLAVIC VERB FORM CLASSES 


infinitive 

nesti ‘to carry’ 

supine 

nestb ‘(to go) carry’ 

present tense 

nesg, nesesi ‘carry’ 1 sg., 2 sg. 

aorist 

nexb, nese ‘carried’ 1 sg., 2, 3 sg. 

imperfect 

neseaxb , nesease ‘was carrying’ 1 sg., 2, 3 sg. 

imperative, optative 

nesi, nesete ‘carry’ 2, 3 sg., 2 pi. 

present active participle 

nesy, nesgsti ‘carrying’ nom. sg. m./n., nom. sg. f. 

past active participle 

nesb, nesbsi ‘having carried’ nom. sg. m./n., nom. sg. f. 

present passive participle 

nesomb, nesomo, nesoma ‘being carried’ m., n., f. sg. 

past passive participle 

nesenb, neseno, nesena ‘having been carried’ m., n., f. sg. 

perfect 

estb neslb ‘has carried’ m. sg. 

pluperfect 

be neslb , bease neslb ‘had carried, had been carrying’ 

conditional 

bi neslb ‘would carry’ 2, 3 sg. 

future anterior 

bgdetb neslb ‘will have carried’ 3 sg. 


Ablaut, tense, and aspect 

Slavic tenses and their corresponding form classes have become simplified from what 
was a much more complex earlier system in late PIE, with partially differing results in 
Baltic and Slavic. Slavic and Baltic inherited e-grade thematic verbs (PIE *wed h - > OCS 
vedetu ‘leads’ det., Lith. veda; *g w eyh 3 - OCS ziti, zivg, Cz. zit, ziji ‘live’ inf., 1 sg.) as well 
as derived o-grade causatives, iteratives, and deverbal nouns (OCS voditi ‘to lead’ indet., 
Lith. vadyti ‘idem’, OCS vozdb ‘leader’, vojevoda ‘warlord’; PIE *g w oyh 3 - > ORuss. goiti 
‘revive’, Cz. hojit ‘heal’, Sin., BCMS gojiti ‘raise [a child]’). On the other hand, Slavic 
possesses sets of synthetic past tense forms (“aorist” and “imperfect”) built from PIE *-s, 
which is lacking in Baltic, whereas Baltic possesses an 3-future lacking in Slavic (with 
the exception of the relic pres. act. ptcp. form OCS bysgst-l bysQst- [< PS1. *bu-s-y-e/ 
ant-y-] ‘that which is to be’, replaced by bgdgst- ‘idem’, contrasting with sgst- ‘being’; 
see Vaillant 1966 [=1950-77]); see also Darden 1990, Andersen 2013. Lacking a separate 
synthetic future form, the primary Slavic future is expressed by the same form as for the 
present tense, which could be termed “prospective/actual” (following Andersen 2013), 
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where in the daughter languages perfective verbs in the non-past tend to be understood 
as future; in ESI. perfective verbs in the non-past forms are exclusively understood as 
future. Compound future tenses, in addition to the future anterior attested in OCS bgdetb 
nesla ‘will have carried’, which has become the generalized future construction in Sin. 
and Kajkavian Croat. (Sin. bo nesla ‘will carry’) and one of the future constructions in 
Pol. bqdq niosla or bgdg niesc (both with imperf. verbs only), developed in the daughter 
languages, e.g., BCMS pisat ce ‘will write’, Bulg. ste pisa ‘will write’ (both auxiliaries 
from CS1. *(xo)t’eti ‘want’), Ukr. budu pvsati ‘I shall write’ or pysatymu (< CS1. *pisati + 
*ji>m<j) ‘idem’ (both with imperf. only). 

Slavic extended the pattern of ablaut as a means of (partially) articulating inno- 
vative aspectual morphology, in effect reinvigorating the vrddhi grade, to produce 
imperfective partners for new perfective lexical items produced through prefixa- 
tion, e.g., (PIE *leg h - ‘lie [down]’ >) OCS lesti, Iqgg ‘lie down’ inf., 1 sg. perfect., 
poloziti ‘place something standing’ perfect polagati, polagajg ‘idem’ imperf. (PS1. 

*-leg-/-lag > -lag-). The innovation builds also on the excrescent vowels that 

developed from syllabic sonorants, e.g., OCS Sbbbrati ‘collect’ perfect., sbbirati 
‘idem’ imperf. (PIE *b h r- > PS1. *bir — > *b!r-). The rise of the aspectual system of 
Slavic as such is still being debated, with significant differences of opinion. Recent 
work indicates that the system of verbal aspect developed through heterogeneous 
developments that coalesced into a general schema opposing perfective and imper- 
fective aspects. Among the processes were, in addition to the ablauting pattern just 
mentioned, (1) the merger of synthetic imperfect and aorist into a single preterite in 
PS1., which resulted in the attested OCS aorist, and the rise of a new agglutinative 
synthetic preterite, the imperfect (see above) (see Andersen 2013); (2) prefixation 
of simplex verbs, creating new lexical meanings, aktionsarten, as well as purely 
aspectual distinctions, “ preverbe vide ” in the perfective aspect (e.g., OCS ubiti 
‘kill’ inf. perfect. < PIE *ow- ‘ablative’ + *b h eyH- ‘beat’; OCS uspeti ‘succeed’ inf. 
perfect. < PIE *ow- + *sp h eh[- ‘flourish’) (see Dickey 2005 and forthcoming); and 
(3) suffixation, allowing the articulation of imperfective partners for lexicalized 
prefixed perfectives (OCS ubijati, ubivati ‘kill’ inf. imperf.; uspevati ‘succeed’ inf. 
imperf.). The general meanings of the perfect. : imperf. opposition in the daughter 
languages differ geographically, with two clear-cut sub-types: “eastern” (ESI. + 
Bulg.) and “western” (Cz., Slk., ULSorb. + Slovene), as well as two transitional 
types, Pol. (trending toward “eastern”) and BCMS (trending toward “western”). In 
each type the perfect, is marked for a general meaning, whereas the other imperf. 
is unmarked as such: the “eastern” type conveys “temporal definiteness” (e.g., 
succession of events) and the “western” type “totality” (for details, see Dickey 
2000). 


WORD FORMATION 

Slavic lexis has considerably innovated new configurations of PIE material, including 
‘hand’ (Lith. ranka ) < PIE *wronk-eh, <— *wrenk- ‘grasp’; OCS noga ‘foot, leg’ (Lith. 
naga ‘hoof’) <— PIE *h,nog h o- ‘(toe)nail’ + collective suffix *-eh 2 , cf. OCS nogbtb 
‘(finger-/toe)naiT (Lith. nagutis ‘idem’). Slavic word formation makes considerable use 
of prefixation and suffixation, as well as, in regard to the verb, extensions of ablaut pat- 
terns, as was mentioned above in the discussion of the verb. 
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TABLE 11.31 DEVERBATIVES FROM CSL. *VEDETI ‘TO KNOW’. 
*RYBA ‘FISH’ 

, DERIVATIVES OF CSL. 

CS1. 

Attestation 

Meaning 

*ved-a 

Cz. veda 

‘science, knowledge’ 

*ne-ved’-a 

OCS nevezda 

‘idiot’ 

*ne-ved’-i> 

OCS nevezdb 

‘idiot’ 

*ne-ved’-bstvo 

OCS nevezbstvo 

‘ignorance’ 

*ne-ved-om-T>-jb 

Cz. nevedomy 

‘unknown’ 

*ne-ved-bm> 

Sin. neveden , BCMS arch. nev(j)edan 

‘ignorant’, ‘arrogant’ 

*ryb-ar’-b 

OCS rybar’b 

‘fisherman’ 

*ryb-ar’-i-ti 

Cz. rybarit, Sin. ribariti 

‘to fish’ 

*ryb-ic-a (< PS1. *rub-Ik-a) 

OCS rybica 

‘small fish’ 

*ryb-i>n-T>-jb 

Russ, rybnoj 

‘pertaining to fish’ 

*ryb-i>n-ik-'b 

Cz. rybnik 

‘pond’ 

*ryb-i>n-ic-a 

Pol. Rybnica, Sin. Ribnica 

toponym 


Examples of PIE compounding persist in relic form, e.g., OCS gospodb ‘the Lord’, 
‘lord’ (< PIE *g h osti- + *pot-, if it is in fact cognate with Lat. hospes, hospitis - Slavic d 
is unexpected as a reflex of PIE *t), or the famous taboo word for ‘bear’, OCS medvedb 
(< PIE *med h u- ‘honey’ + *ed- ‘eat’), cf. o-grade USorb. sknadz, Sin. strnad , Ukr. 
strenadka ‘yellowhammer ( Emberiza citrinella )’ (< CS1. *stri>nadi> < PIE *kri(m)n(o)- 
‘grain’ + *h,od- ‘eat(er)’) (Snoj 1992: 198-199). Slavic given names preserve further 
endocentric compounds that were productive in the medieval pre-Christian era, e.g., 
ORuss. Svdtoslavb (<— PIE *kwen-to- ‘sacred, strong, good’ + *kle/ow- ‘famed’), OCS 
Vladimin (<— PIE *wel-d h e- ‘rule, have strength’ + *meyH-ro- ‘good, pleasing’). 


SYNTAX 

The syntax of Slavic languages is treated less in the historical than in the synchronic 
spectrum of Slavic linguistics than are phonology and morphophonemics, which have 
received the majority of attention. In contrast, however, studies of Slavic synchronic 
syntax have burgeoned in the last several decades. Here we will focus on some highlights 
with a particular focus on developments relating the Indo-European heritage of Slavic. 

Generally, Slavic simple clause structure tends to be SVO, though word order is “free,” 
presumably owing to case marking. Much attention in the structural literature, particu- 
larly the Prague School, has focused on the functional sentence perspective in Slavic 
languages. So, for example, in contemporary Bulgarian (a language that has reduced 
case distinctions down to three) one may produce a natural three-element sentence such 
as SVO Ivan nameri knigata ‘Ivan found book-the’ also as VSO, VOS, SOV, OVS, and 
OSV, all responding to the questions Kakw stana? ‘What happened?’ or Kakvo pravi 
Ivan? ‘What is Ivan doing’, where the focus of the sentence changes depending on the 
sentence intonation, the element in question being given emphatic intonation (examples 
from Kempgen et al. 2009: 656-657). A traditional Prague School analysis (without 
regard to sentence intonation) assumes that the focus (“new information”) is the last sen- 
tence element, a pattern that generally holds in written Slavic languages. 

From an Indo-European perspective one may note the persistence of Wackernagel’s 
Law in OCS and modern Slavic languages, which assigns unstressed particles to the 
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second position in the clause following the first accented element oni ze_ Vbzlozisg. 
rgce na nt i gsg Jb ‘they [ focus particle ] placed hands on him and took him’ (Codex 
Marianus, Mark 14:46), cf. Cze zda se nine ‘it seems to me’, Sin. zdi se mi ‘idem’. 
East Slavic is exceptional in that it has fused the reflexive particle sja (< CS1. *s§) to 
the right of the verb (e.g., Russ, kusat’ ‘to bite’ vs. kusat’sja ‘to bite one another, be 
characterized by biting’), and the trend toward fusion can be seen in the chronologi- 
cal progression of texts in the Novgorod Birchbark corpus: a nvne sa druzina po ma 
porucila (no. 109, late 12th c.) ‘and now [reflJ retinue about me vouched’, a jazo 
sa klaneju (no. 344, early 14th c.) ‘and I refl . bow’ (salutation formula), a cemu sa 
gnevaesi? (no. 605, 12th c.) ‘but why refl . anger-[2 sg.]’ vs. a dzo tobe klanajusa 
(no. 147, 13th c.) ‘and I to-you bow- REFL .’. a to sci dijalosb sedne vo Veliki dnb ‘and 
this refl. done-REFL. today on Great (Easter) day’ (no. 154, 15th c.), with the latter 
example reflecting a rare simultaneous instantiation of both the older and clitic place- 
ment (Zaliznjak 2004: 188-189). 

Significant variation is found in compound sentences in the Slavic languages, which 
underwent major shifts in the historical period. Canonical OCS attests to a strategy of 
clause combining employing participles in the dative case for subordination (the “dative 
absolute”), which subsequently disappeared from most of Slavic save for southern Rus- 
sian dialects (see Andersen 1970), e.g., OCS vb pgtoe ze na desgte leto vladycbstva tiverbja 
k’esarja obladajgstu pontbskumu pilatu ijudeejg, i cetvbrtovlastbvujgstu galileejg irodu 
. . . bvstb glagolb bozii kb ioanu zaxarinu svnu ‘in fifth-[defi] year of reign Tiberius-[GEN.] 
Caesar-[GEN.] ruling-[pRES. act. ptcp. dat. sg. m.] Pontius-[DAT.] Pilate-[DAT.] and ruling- 
as-tetrarch-[pRES. act. ptcp. dat. sg. m.] Galilee Herod-[DAT.] . . . was-[AOR. 3 sg.] word- 
[nom. sg. m.] God’s ’to John Zakharias’ son - ‘In the fifteenth year of the reign of Tiberius 
Caesar, Pontius Pilate being governor of Judea, and Herod being tetrarch of Galilee, . . . 
the Word of God came unto John the son of Zacharias’ (Luke 3:1-2, Codex Zographen- 
sis); nibnogo sgstu narodu i ne imgstenib ceso esti . . . isusb glagola ‘being-[pRES.ACT. 
ptcp.-dat.pl] many people and [they] not having-[pRES. act.ptcp.-dat.pl] what- [gen. sg.] 
to-eat . . . Jesus spoke’ (Mark 8:1, Vaillant 1950-1977: 90). Variation in the material of 
the general (realis) subordinating conjunctions in Slavic languages reflects the degree 
to which subordination has been reshaped in Slavic languages, e.g., East Slavic cto 
(< CS1. *ci>to ‘what’ < PIE *k w i- + *to-), West Slavic ze (< CS1. *ze ‘focus marker’ < PIE 
*g wh e), Sin., BCMS da (< PIE *doh 2 Tative particle’, cf. Germ, zu, Eng. to), Prekmurje 
Sin., Medimurje Kajkavian ka (< *k w eh 2 , cf. Lat. qua ‘in what manner’), Mac. deka 
(< PIE *d h e ‘there’ + *k w eh 2 ), Bulg. ce (< PS1. * ke < PIE *k w e ‘and’) and deto. A similar 
pattern of variation occurs with relative markers. OCS mainly employed a compound 
of *j- pronouns + the ze marker (e.g., xlebb bo ibze azb danib plbtb moe estb ioze azb 
danib za zivotb Vbsego mira ‘bread thus rel.-acc.-sg.-m. I give-l-sg. flesh my-f.-sg.-nom. 
is-2-sg. REL.-Acc.SG.-F. I give-l-sg. for life whole-gen.-sg.-m. world’; John 6:51, Codex 
Zographensis). Already in the Freising Folia an emerging phonological change z > r, 
which spread throughout South Slavic, reshaped the marker, setting up the conditions by 
which the inherited relative adjective *kbter- (< PIE *k w otero- ‘which of two’) could be 
reanalyzed as kbte- + r, where the last element became a generalized relativizer. The -r 
element subsequently became a productive formant for building a range of relativizers in 
Sin. and dialects of BCMS associated with the Roman rite (Kajkavian, Cakavian) (e.g., 
Sin. kar ‘that which’, kadar ‘ when[ever] ’, kdor ‘who[ever] ’) and was replaced in dialects 
associated with the Byzantine rite (see Greenberg 1999 for details). 

A further step in clause combining is the loss of the infinitive in the Balkan Slavic lan- 
guages, Mac. and Bulg., an areal feature characteristic of the Balkan languages (Albanian, 
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Aromanian, Greek, Romanian) that has also spread to Serbian and Montenegrin. This 
feature, as well as others discussed in this chapter, may be observed in the following frag- 
ment from Jaroslav Hasek’s Osudy dobreho vojaka Svejka za svetove valky (The adven- 
tures of the good soldier Schweik during the world war) in the original Czech and several 
translations. The fragment translates “ ‘Which Ferdinand [did they kill], Mrs. Muller?” 
asked Schweik, not ceasing to massage his knees, ‘I know two Ferdinands.’” 


West Slavic 

- Ktorego [1] F erdynanda, pani Miillerowo? - zapvtal [2] Szwejk, nie przestajqc [3] 
masowac [4] kolan. - Ja znam dwoch Ferdynandow. (Polish) 

“Kteiyho Ferdinanda, paid Miillerovd?” otazal se Svejk, neprestavaje si masirovat 
kolena, "ja znam dva F erdinandy. ” (Czech) 

“Ktoreho Ferdinanda, pani Miillerovd? ” opvtal sa Svejk, neprestavajuc si masirovat' 
kolena. “Ja poznam dvoch Ferdinandov. ” (Slovak) 


Western South Slavic 

“Katerega Ferdinanda, gospa Miillerjeva? ” je vprasal Svejk, ne da bi si nehal mazati 
kolena, “poznam dva Ferdinanda [5], ” (Slovene) 

- Kojega Ferdinanda, gospoda Miilerova? - odazvao se Svejk ne prestajuci masirati 

koljena — ja poznam dva Ferdinanda. (Croatian) 

- Koga to Ferdinanda, gospodo Miler? - upita Svejk, ne prekidajuci masiranje 

kolena.- Ja 

poznajem dva Ferdinanda. (Serbian) 


Eastern (Balkan) South Slavic 

Koj Ferdinand, gospogo Miler? - zaprasa Svejk, ne prestanuvajki da si gi masira 
kolenata [6], - poznavam dva Ferdinanda. (Macedonian) 

Koj Ferdinand, gospozo Mjulerova? —zapita Svejk, bez da prekbsne da raztirava kole- 
nite si. —Az [7] znaja dvama Ferdinandovci. (Bulgarian) 


East Slavic 

Jakoho ce Ferdinanda, pani Mjullerova? - spvtav Svejk, ne perestajucy roztyratv 
kolina. - Ja znaju dvox Ferdinanda. (Ukrainian) 

Jakoha F erdynanda, Pani Mjuler? - spvtaw Svejk, ne perastajucv masirovac 'kaleni. - 
Ja vedaju dvux Ferdvnandaw. (Belarusian) 

Kakogo Ferdinanda, pani Mjullerova? - sprosil Svejk, ne perestavaja massirovat’ 
koleni. - Ja znaju dvux Ferdinandov. (Russian) 


Non-Slavic Balkan 

— Pe care Ferdinand, doamna Miillerovd? intreba Svejk continuind sa-?i maseze 
genunchiul. Eu cunosc doi. (Romanian) 

— Cilin Ferdinand, zonja Mvler? - pveti Shvejku duke ferkuar gjunjet. - Une njoh dv 
Ferdinande. (Albanian) 
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1 With the exception of Mac., Bulg., which no longer mark cases in nouns and 
adjectives, the m. sg. acc. animate forms in Slavic take the marking of the 
genitive. 

2 Past tense ‘he asked’ is rendered with the aorist (simple past) 3 sg. in Mac., 
Bulg., which preserve both simple and coordinated past tenses, and the relic of 
the perfect in the rest of the Slavic languages, which have only one past tense 
construction. The simple past 3 sg. is used in Romanian ( intreba ) and Albanian 
( pveti ). Sin. preserves the full construction with clitic auxiliary je ‘be’ 3 sg. (cf. 
OCS jest'b ) + the /-participle. Auxiliary clitics are obligatory in non-3-PERS. past 
tense in all but the East Slavic languages, which use the unauxiliated /-participle 
for all persons and numbers. The 3 sg. auxiliary has been eliminated in West 
Slavic (though it persisted well into the historical period), while in BCMS it is 
normally dropped in reflexive constructions ( odazvao se ‘he responded’, but 
would have been pitao je ‘he asked’, had the translator chosen to use this verb). 

3 With the exception of Sin. and Bulg. the first verb in the subordinate clause, ‘not 
stopping’, is rendered with a pres. act. adv. ptcp. (PS1. *-ant-y-). The option would 
have been available to both of these languages as well but was not chosen by the 
translator, though in Sin. the participle is deprecated with all but a few lexical items. 

4 Except for Mac., Bulg., the Slavic translations use the inf. form of ‘to massage’. In 
Mac. the equivalent construction contains da comp., si refl.-dat. (i.e., ‘one’s own’), 
gi them.-Acc.-PL. (agreeing with an object marked for definiteness), masira ‘mas- 
sage’ 3-sg.-ind. In this regard, Mac. takes the infinitival phrase as the “Wackema- 
gel” unit, placing all of the clitics after the participle. In Bulg. the construction is 
similar, though the possessive marker si is moved to the right of the object kolenite 
si ‘knees-the refl.-dat.’ Romanian is similar to the Mac. construction, though it 
lacks an equivalent to the reduplicated gi definite-object marker. Albanian has 
developed an innovative infinitival construction duke ferkuar, though a finite verb 
would also have been possible, pa ndaluarse ferkuari gjunjet ‘without continuing 
subordinating conjunction rub-3-SG. knees’ (see chapter on Albanian). 

5 Sin. retains a distinct dual form, which also corresponds to dual markers in 
nominal and verbal inflection. In East Slavic the dual has expanded to include 
groups of three and four (“paucals”), marked only in nominal phrases. 

6 Mac. and Bulg. mark definiteness with the form of the inherited dem. pron. CS1. 
*tb (agreeing in gender and number with its head), marked to the right of the 
first element of the nominal phrase. Parallel constructions are found in Roma- 
nian and Albanian. 

7 Sin., Mac., and Bulg. continue the full form of the 1 sg. pronoun ( jaz , jas, 
az, respectively), though in PRO-drop languages (West and South Slavic) the 
personal pronouns are not obligatory except for discursive strategies such as 
reference switching and emphasis. 


FURTHER READING 

Students of Slavic languages can get a good overview of Slavic from handbooks, includ- 
ing the English-language reference work Comrie and Corbett 1993, which is arranged 
according to standard languages; a more concise Russian-language work, Moldovan 
2005, is structured similarly. An English and German handbook, Kempgen et al. 2009, is 
arranged thematically but also includes sketches of standard languages. A single-volume 
overview of Slavic philology suitable for advanced undergraduate students is Schenker 
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1995. Several English-language grammars of Slavic languages, written by specialists 
and intended for general linguistic typologists and comparativists, are available freely on 
the Internet on the SEELRC Network (http://www.seelrc.org/projects/grammars.ptml). 
An annotated bibliography of the literature up to the early 1980s on the reconstruction 
of Common Slavic is available in Birnbaum 1979 and Bimbaum and Merrill 1983. The 
long-term multinational project on Slavic diatopy, the Common Slavic Linguistic Atlas, 
published in various places in book format, is now available in electronic format (see the 
Obsceslavjanskij lingvisticeskij atlas). The etymology of Slavic languages as a whole 
is covered in the as yet unfinished series begun under the editorship of the late O. N. 
Trubacev (1974—); Kopecny 1973, 1980 remain an invaluable source for the etymology 
of pronouns, particles, and grammatical elements. Single-volume etymological dictionar- 
ies that take into account recent research include Gluhak 1993 (which traces words back 
to Nostratic), Snoj 2003, and Derksen 2008. Kopecny 1981 provides a single- volume list- 
ing of Slavic comparanda for the ca. 2000 words of CS1. core vocabulary. Among the his- 
torical grammars of Slavic one could view as classic the works of Arumaa (1964—1985) 
and Vaillant (1950-1977), the latter remaining the most comprehensive treatment of 
Slavic comparative grammar to date. With regard to historical phonology, although dated 
in some respects (particularly with regard to the development of Slavic word prosody), 
Shevelov 1964 remains extremely valuable for the sheer amount of documentation for 
Slavic in a single volume. Shevelov also founded a series of (English-language) historical 
phonologies of Slavic languages that is still in progress, of which volumes on Belarusian, 
Macedonian, Polish, Slovak, Slovene, Sorbian, and Ukrainian have appeared to date. 
Of special note for the reconstruction of Slavic accent would be the watershed works of 
Stang 1957 and Illic-Svityc 1963, the latter of which is also available in English transla- 
tion (1979). These works have renewed energy in research on Baltic and Slavic accen- 
tology with particular regard to its Indo-European origins, notable monographs of which 
range from Kortlandt 1 975, Garde 1 976, Dybo 1981 and 2000 to the works of the newest 
generation of scholars, e.g., Olander 2009, Pronk 2012, and Kapovic 2015. Numerous 
handbooks of OCS exist in Slavic and other European languages, most of which are to a 
greater or lesser extent derivative of the classic textbook by Leskien (1969 [1871]); Lunt 
2001 is currently the best OCS reference in English. The authoritative OCS dictionary is 
the multi-volume Kurz et al. 1958-1997, while Sadnik and Aitzetmuller 1955 and Cejtlin 
et al. 1994 are one-volume works. Sadnik and Aitzetmuller also contains a reverse dic- 
tionary of OCS words. Outside of canonical OCS, medieval attestations of Slavic include 
the Novgorod Birchbark Letters, which are available mostly in Russian sources, of which 
Zaliznjak 2004 gives the most comprehensive overview; and the Freising Folia, which 
attests to a variety of Slavic spoken around 1000 A.D. and is most closely associated 
with Slovene (see Bernik 1993, which also contains English glosses and translation). 
A recently published work (Bocek 2014) gives an overview of competing theories of the 
development of Proto-Slavic, in particular the notion of Late Common Slavic as a lingua 
franca, as well as a discussion of frameworks for language contact with Slavic. 


NOTES 

1 A similar stylization of the phonological system of the reconstructed systems is 
employed by Holzer, whose system corresponds to both the PS1. and CS1. systems in 
this chapter. The details of Holzer’s system are found in Holzer 2007, esp. pp. 15-20. 

2 Possibly also from * g h woy-st-, cf. OPruss. svaigstan Tight, shining’ (see Gluhak 1993, 
s.v.). 
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ALBANIAN 

Alexander Rusakov 


INTRODUCTION 

Current geographic distribution. Albanian is currently spoken in Albania, Kosovo, the 
southern part of Montenegro, the southern part of Serbia, the western part of Macedonia 
and, to some extent, the northwestern part of Greece. Sizeable old Albanian diasporas 
exist in various regions of Greece (since the late 1 3th 14th centuries), in the southern 
Italy (since the 15th century), near Zadar in Croatia (since the 18th century), in Bulgaria 
and Ukraine (this population probably left Albania in the 1 6th century). Albanian is spo- 
ken by approximately 7.5 million speakers. 

Dialects. In terms of dialectal variation, the Albanian language area is divided into two 
large zones - the Northern (or Gheg) and the Southern (or Tosk ); the Shkumbin river, 
which crosses Albania from east to west, forms a border between the two zones (a narrow 
strip of transitional dialects is situated to the south of the Shkumbin river). The dialects of 
northern Albania, Kosovo, Montenegro, and Serbia; the diaspora dialect of Zadar; and the 
majority of the Albanian dialects of Macedonia belong to the Gheg zone, while all other 
dialects belong to the Tosk zone. 

The main differences between the Gheg and Tosk dialects in phonetics are (1) the 
so-called rhotacism (the change -n- > -r- in Tosk); (2) the presence of nasal vowel series 
in Gheg and the absence of nasal vowels in Tosk; (3) the correspondence between Gheg 
stressed nasal a and Tosk stressed a; (4) the correspondence between Gheg vo- and Tosk 
va- in some words. Concerning all these phenomena, the Gheg dialect features a more 
archaic language state than the Tosk dialect. Other phonetic isoglosses are not absolute. 
On the basis of the distribution of these phonetic phenomena in lexical borrowings one 
may conclude that the division of (pre)Albanian language continuum into the Gheg and 
Tosk zones had to occur in the last centuries of the first millennium A.D. 

The main morphological difference between the Gheg and Tosk dialects, which is also 
almost absolute, is the presence of the analytical infinitive form in Gheg. The Tosk dialect 
lacks the infinitive fonn strictu sensu (except some lexicalized remnants). There are some 
other less important morphological differences between the two dialects (e.g. lack of a 
possessive -reflexive pronoun in Tosk, some differences in word-formation affixes, and 
so on). Some other phonetic and morphological differences will be listed in the relevant 
sections of this chapter. 

Both the Gheg and Tosk zones are divided into numerous subdialects. The main Gheg 
subdialects are northern Gheg (further subdividing into western and eastern subdialects), 
central Gheg and southern Gheg (or Gheg of Central Albania). The Tosk dialect divides 
into northern Tosk (further subdividing into western and eastern subdialects) and south- 
ern Tosk (Lab and Cham dialects). The main diaspora dialects, Arbresh (the Albanian 
dialects of Italy) and Arvanitika (the Albanian dialects of Greece), are closest to southern 
Tosk, although they (especially Arbresh) also demonstrate some specific dialectal traits 
due to the complex scenarios of their respective speakers’ migrations. Both dialects are 
very important for Albanian language history. 
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MAP 12.1 THE ALBANIAN DIALECTS 

Source: Adapted from: Gjinari et al. 2007, map C and Gjinari J., Shkurtaj Gj. 2009, Dialektologji, Shtepia 
botuese e librit universitar, Tirane: p. 161 


Modem Standard Albanian was consolidated after the Second World War on the basis 
of the Tosk dialect but including some elements (mainly, lexical and, to a lesser degree, 
(morpho)phonological and morphological) of the Gheg dialect. Now the standard lan- 
guage is used as such in all parts of the Albanian language area. 
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Documentation of Albanian. The first well-documented mention of Albanians as a sep- 
arate ethnic entity is encountered in the “Historia” by Michael Attaleiates (1079-1080), 
where Albanians are mentioned with respect to the Durres region in the year 1078 (two 
earlier mentions under the years 1038 and 1043 are disputable). The first known men- 
tion of Albanian as a separate language is found in a legal document from a Raguza 
(Dubrovnik) archive dated 1285: Audivi unam vocem clamantem in monte in lingua Alba- 
nesca ‘I heard a voice on the mountain that shouted in the Albanian language’. The first 
Albanian language record (in the Gheg dialect) is the baptismal formula ( Unte paghesont 
premenit Atit et birit et spertit senit ‘I baptize you in the name of the Father and the Son 
and the Holy Ghost’) from 1462, included in the Latin pastoral letter of Archbishop of 
Durres Pal Engjelli (Paulus Angelus), a close associate of Scanderbeg. There are some 
other short language records (in both Gheg and Tosk) belonging to the end of 15th and 
beginning of the 16th centuries (see Roques 1932, Shuteriqi 1976, Matzinger 2010). 

The first important early Albanian written text is “ Meshari ” (Missal) by the Catholic 
priest Gjon Buzuku, printed in 1555 (most likely in Venice); only one copy is preserved, 
and the text comprises 188 pages. “ Meshari *’ was written in the northern variety of the 
Gheg dialect; it remains the main source for the study of the earliest stage of Albanian. 
“ Meshari ” opens the North Albanian Catholic tradition in the development of Alba- 
nian writing. Overall, some ten printed texts belonging to this tradition and published 
before the middle of 1 8th century are known. All these publications use Latin script with 
rather similar graphic and orthographic conventions. These books are mostly devoted to 
spiritual and church matters (however, there was also a Dictionarium latino-epiroticum 
(Roma 1635) by Frang Bardhi (Franciscus Blanchus)). Besides Catholic literature, some 
other traditions in the early Albanian writing can be identified. One of these traditions is 
the Orthodox tradition of Central and South Albania, which is represented by the texts 
created between the second third of the 1 8th and the first third of the 1 9th centuries. These 
texts were written in southern Gheg and Tosk and used both Greek script and various orig- 
inal alphabets invented by educated Albanians especially for their own language needs. 
Along with a small number of religious texts, two multilingual dictionaries of great value 
belong to this tradition: Theodor Kavalioti ’s “Protopeiria” (1770) and Daniel ofVoskopo- 
ja 's “Eisagogike didaskalia ” (1802). The Muslim tradition of Old Albanian writings cov- 
ered the entire territory of Albania but was especially prosperous in the cities of Central 
Albania, where some poets of a high poetical level wrote their verses using Arabic script 
from the beginning of the 18th till the middle of the 19th century. Great significance for 
the study of Albanian belongs to the Italo-Albanian (Arberesh) writing tradition, which 
started with “E mbsuame e kreshtere” (Christian Doctrine) by Luca Matranga (published 
in 1592 in Rome, written in Latin script; it represents a good example of the old stage 
of the southern Tosk dialect; the book is very valuable for the study of both the Italo- 
Albanian dialect and Arvanitica; see Sasse 1991: 8-9) and flourished in the 19th century. 

The period starting from the middle of the 19th century marks the beginning of the 
development of the Standard Albanian language. 

The ethnic and language name. The old Albanian ethnic and language name has been 
used since the oldest Byzantine and Latin sources in the two fonns of the root arb- and alb- 
(which fonn is etymologically primary remains unclear): Apfavizcov by Attaleiates, lingua 
Albanesca in the Dubrovnik document from the 13th century and so on (cf. also Akfavol 
and A/ifjavojroAig by Ptolemy; see below). Albanians themselves used this name only with 
the root arb- (see, however, the name of Albanian tribe lab, and of the corresponding area 
Laberi, probably from alb- with metathesis). The name with the root arb- is mentioned 
in old Albanian documents, but it went out of use in the main part of Albanian-speaking 
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area and remains in use only in diaspora dialects (It.-Alb. arberesh, Gr.-Alb. amanitas). 
In other areas, it has been replaced by the tenn with the root shqip shqip ‘Albanian (used 
as an adverb)’, gjuha shqipe ‘Albanian language’, shqiptar ‘Albanian (ethnic name)’ and 
so on. This name is derived from the verb shqiptoj (‘to speak clearly’, earlier ‘to under- 
stand’; see B. Demiraj 2010); it was first registered in the 16th century (at the beginning 
in its adverbial form) and consolidated in the 1 8th century. The ethnic name change was 
triggered most likely by the Ottoman conquest (end of the 14th century-second half of the 
15th century), which led to deep changes in the structure of the Albanian nation (for the 
Albanian ethnic and language names see, first of all, B. Demiraj 2010). 

The origin and early histoiy of the Albanian language. The fact that written evidence 
concerning the Albanian language and Albanians is historically rather shallow makes it 
problematic to reconstruct the earlier stages of Albanian language history. From the end 
of the 19th century, a central place in this problem belongs to the question of the possible 
development of Albanian from one of the languages that were spoken in Antiquity in the 
northern part of the Balkans. However, this question itself resulted from speculations 
that were largely deductive. Two main theories consider Albanian as a descendant of 
either Illyrian or Thracian languages, respectively (for an overview see Matzinger 2009, 
2012b). According to the former theory, Albanians were autochthonous to the area that 
roughly coincided with their modem territory; the latter theory assumes that the ancestors 
of Albanians came to the modern territory from a more eastern part of the peninsula. This 
aspect of the “origin problem” accounts for the political shade of the debate: the Illyrian 
theory became a sort of a sacred cow in Communist Albania and remains the mainstream 
theory among Albanian scholars nowadays. The situation is complicated by the fact that 
the exact extent of the idioms referred to as the Illyrian and Thracian languages, respec- 
tively, is not known. In the case of Illyrian, the issue is relevant, first of all, for Messapian; 
the status of this idiom is indeterminate and varies in linguistic writings from merely 
an Illyrian dialect to a different language. In the case of Thracian, one should mention 
“Daco-Misian,” which was assumed by V. Georgiev to be a separate language and a direct 
ancestor of Albanian. In the last decades, Dardanian is considered a separate language, 
also potentially important for Albanian language history. 

There are some arguments in favour of either theory. From a purely historical point of 
view, the absence of the mentions of Albanians till the 11th century cannot be considered as 
an argument against (or in favour of) either the “autochthony” or “migration” hypotheses. 
The mentions of the tribe Ahpavoi and the city (?) AXfSavonohi; (to the East of the Dyrra- 
chion) in Ptolemy’s Geography (2nd century A.D.; see also Apfcov, a city in Illyria with 
uncertain localisation by Polybius, 2nd century BC) don’t testify definitively in favour of 
the historical continuity of the Albanians in this territory: there are many known examples 
in world history where an ethnic or a language name shifted from one ethnos to another. 

There are studies that show some kind of archeological continuity between Antiquity 
and the second half of the first millennium A.D., first of all the so-called Koman culture 
near Kruja. However, regardless of the particular interpretation of excavation materials 
from an archeological point of view, these studies cannot be decisive for the solution of 
the problem of ethnic and linguistic continuity. 

There are also some linguistic data that can be taken as evidence in favour of the 
Albanian language’s descent from either the Illyrian or Thracian language, respectively. 
Unfortunately, most of these arguments are weakened by the insufficiency of our knowl- 
edge about the ancient Indo-European languages of the Balkan Peninsula. 

There are some reliable etymological correspondences between Albanian words and 
both Illyrian (including Messapian) and Thracian language material (a few glosses and 
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numerous onomastic tokens). Among the most commonly accepted Illyrian-Albanian 
correspondences are Illyr. pivoq ‘cloud, fog’ - Alb. re, re ‘cloud’; Messap. (3psv5ov 
‘deer’, Ppevnov ‘deer’s head’-Alb. bri, brV horn’; Messap. Juppiter Menzana ‘the God 
to whom horses were sacrificed’ - Alb. mez, maz ‘foal’; Tarentian pdpuKoraiSoiov - Alb. 
bark ‘belly’; Messap. aran ‘field’ - Alb. are ‘field’; Messap. bilia ‘daughter’- Alb. bije 
(dial. bile). The Thracian-Albanian (and Dacian-Albanian) correspondences include Dae. 
pavieia ‘blackberry’ -Alb. man ‘mulberry’; *5pavi<; (Hesychius) ‘deer’-Alb. dre ‘deer’ 
and some other. 

Lexical data of this kind are obviously insufficient for any kind of decisive conclusions 
about the relations between Albanian and the two ancient Balkan languages. The same 
is also true of phonetic Illyrian-Albanian and Thracian-Albanian isoglosses. Given the 
limits of our knowledge, it is preferable to consider Illyrian, Thracian and Albanian as 
separate branches within the Indo-European language family. 

There are, however, other pieces of linguistic evidence that may be relevant for the 
determination of the place of the “intermediate Balkan Urheimat” of the Albanians, 
regardless of whether or not the Albanian language originates from the Illyrian language. 

• One piece of evidence comes from the phonetic form of some ancient geographic 
names localised in the modem Albanian territory, first of all on the Adriatic coast. If 
it is possible to trace a linguistically continuous path in the phonetic development of 
these names from their ancient form to their modern form in accordance with Albanian 
“sound laws,” it would support the hypothesis that the Albanians have been living on 
their modem territory from Antiquity. There are, however, some phonetic difficulties 
in reconstructing such a continuous development, for example, with respect to such 
important Albanian toponyms as Shkoder < Scodra, ZicoSpa, Durres < Dyrrhachium, 
Auppaxiov, ShkumbUri) < Scampis, Scampinus. There is a long discussion on the his- 
torical development of these names; the most widespread view has it that the historical 
development of these names does not coincide with the laws of the Albanian histor- 
ical phonology (e.g. Scodra should have developed into something like *Hader; see 
Matzinger 2009: 23). By contrast, in the inner Balkans there are some geographic 
names that show a phonetic development according to Albanian “sound laws” (see 
e.g. Nish <Naissus, Naujooq). 

• Albanian has a rather modest number of old Greek borrowings (33 according to Olberg 
1972). If the ancestors of Albanians lived near the Adriatic coast and communicated 
with the inhabitants of Greek colonies (Dyrrhachium, Apollonia et al.), the number of 
Greek loans would have been higher. 

• Albanian lacks almost any inherited lexemes related to seamanship. We know, though, 
that the Illyrians, who were inhabitants of the seacoast, were artful seamen and pirates. 

All these pieces of evidence indicate, although somewhat indirectly, that in the epoque 
of late Antiquity the ancestors of Albanians did not live on the Adriatic coast or in close 
proximity to it. We can say nothing, however, about where they actually did live at this 
time. 

Another group of linguistic facts is more relevant for the solution of the Albanian 
homeland problem. These are the numerous Albanian-Romanian correspondences. These 
correspondences pertain to various linguistic levels. There are numerous correspondences 
in grammar, first of all a striking similarity in the architecture of the noun phrase (see 
the sections “Morphology” and “Syntax”). The source of the common development is, 
most likely, Albanian. There are also some phonological (e.g. diphthongization of e, see 



ALBANIAN 557 


the section “Phonology”) and morphonological (similar patterns of number expression 
in masculine nouns; see the section “Morphology”) resemblances. Numerous Albanian- 
Romanian lexical correspondences include early Albanian borrowings into Romanian, 
those words that entered both languages from some common (and unknown) sources as 
well as Latin words that are found exclusively in Romanian and Albanian (see Vata§escu 
1997, Kaluzhskaya 2001). These phenomena point to a long period of very intensive con- 
tacts between the ancestors of Albanian and Romanian. The time of these contacts should 
coincide with the period of the formation of the (Daco)Romanian language, that is, the 
middle of the first millennium A.D. As far as localization is concerned, we may hypoth- 
esize that it was the inner part of the Balkan Peninsula (maybe the territory of ancient 
Dardania; see Jokl 1924; the triangle Nis-Sofia-Skopje, see Weigand 1927). 

The layers of the Albanian lexicon. It is believed that Albanian belongs to languages 
with a high amount of lexical borrowings (in this respect Albanian is similar to Armenian; 
see Matzinger in print). The etymological dictionary of inherited (i.e. non-borrowed) 
Albanian words by Bardhyl Demiraj lists 572 tokens (without the derivates that origi- 
nated in Albanian; B. Demiraj 1997: 37). Since Albanian is a language with a relatively 
shallow written attestation, the importance of borrowings as data for the study of histori- 
cal phonetics (and, to a lesser degree, historical morphology) is extremely high. 

In terms of chronology, the earliest massive layer of borrowings in Albanian is the Latin 
layer (for a small number of old Greek loans, see above). The period of intensive Alba- 
nian-Latin language contacts might have begun in the early 1st century AD, after the final 
incorporation of the West Balkans into the Roman state (Romans’ first contacts with the 
West Balkan seacoast happened in the 3rd century BC) and continued until the 5th-6th 
century AD, eventually taking the form of Albanian-(Proto)Romanian contacts. The Latin 
borrowings in Albanian (see Mihaescu 1966, Haarmann 1972, Landi 1989, Vata§escu 
1997, Bonnet 1998) are very numerous: according to the various lists the number of Latin 
etymons in Albanian is no less than 600; these borrowings cover practically all parts of the 
Albanian lexicon in both semantic and morphological dimensions. It should be noted that 
although the main part of Latin loans belongs to the cultural lexicon (see e.g. gjvq ‘trial, 
court’ < iudicum; mjek ‘physician’ < medicus), there are also some basic vocabulary items 
that have been borrowed from Latin (see e.g. vij ‘come’ < venio; kembe Teg, foot’< *camba 
et al.). There are some derivational affixes borrowed from Latin (see the section “Word 
Formation”). Any direct influence of Latin on Albanian phonetics and morphology is rather 
uncertain. The Latin borrowings came into (Proto )Albanian at the time when (Proto)Alba- 
nian was experiencing great changes, and that is why these borrowings constitute the main 
source of our knowledge about the absolute chronology of Albanian language history. 

The Albanian-S/av/c language contacts (see Selishchev 1931, Jokl 1934, Desnickaja 
1968, Svane 1992, Ylli 1997) began after the Slavic invasion into the Balkans and con- 
tinue, in a sense, until now. Unlike Latin borrowings, which are for the most part com- 
mon Albanian, a great part of Slavic loans have a clear dialect distribution (according 
to Ylli 1997 only a quarter of some 1000 Slavic borrowings have a more or less pan- 
Albanian distribution). According to historical-phonetic criteria, there is a small group 
(circa twenty) of Slavic borrowings that had to enter Albanian relatively early (before the 
10th 1 1th century), whereas the chronological fixation of the most part of Slavic loans 
remains unclear. The main source of the Slavic loans for the southern dialects of Albanian 
was Macedonian-Bulgarian dialects, while the main source for the northern dialects was 
Serbo-Croatian dialects. Slavic borrowings belong to various semantic domains, but are 
mostly related to the cultural lexicon (oborr ‘yard’, cf. Bulg.-Mac. obor, SCr. obor; bujar 
‘noble’ < Slav, boljar). Some Slavic derivational suffixes also entered Albanian. 
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The Middle and Modem Greek borrowings (prevailing in southern Greek dialects of 
Albanian) date back to early medieval times. Italian loans date back to the beginning of 
active contacts by the Italian states with this part of the Dalmatian coast (11th century); 
many early Italian borrowings came from the Venetian dialect. 

After the Ottoman invasion (end of the 14th — 15th century) a great influx of Turkish 
words (including Arabic and Persian borrowings in Turkish) overflowed Albanian. In this 
layer, there are a great number of terms related to economic, administrative and spiritual 
life ( bajrak ‘banner, an administrative district in the Albanian mountains’ < bayrak, borxh 
‘debt’ < borQ , sevda ‘love’ < sevda), but also various pragmatically loaded, “gesture-like” 
words ( kismet ‘ fate (used as an interjection)’ < kismet), as well as some words that belong 
to basic vocabulary (jeshil ‘green’ <ve$il). Since the beginning of the Albanian national- 
cultural revival (the so-called National Renaissance or National Awakening) in the second 
half of 19th century the main trend in Albanian language policy was to eliminate Turkish 
loans as a symbol of national oppression and undesirable “eastemization”. Many Turk- 
isms really went out of use, and some shifted to the domain of popular, “sub-standard” 
language, but many Turkish words remain part of the present Albanian lexicon. Albanian 
also has a small amount of grammatical morphemes borrowed from Turkish (on Turkish 
borrowings see, first of all, Boretzky 1975-1976). 

Since the end of the 19th century and, especially, in the 20th and 21st centuries, an 
innumerable amount of internationalisms entered Albanian. The sources of these words 
depended on political circumstances (Italian and French before the Second World War, 
Russian in the first two decades of Communist rule, and English in recent decades). 

Albanian diaspora dialects have a great number of words from current and old neigh- 
boring languages: Arvanitica from Greek, Arbresh from Greek and Italian (a big part of 
the ancestors of present Italo-Albanians came from Greece) and so on. 

Areal position of Albanian. There have been numerous attempts to determine the areal 
place of Albanian within the IE languages (see, for example, Porzig 1974, Pisani 1950, 
1959, Jokl 1963, Hamp 1966, Orel 2000). The detailed investigation of lexical as well as, 
to a lesser extent, phonetic and grammatical isoglosses indicates that there are especially 
close connections of Albanian with North-Eastern IE languages (Baltic and Slavic), on 
the one hand, and with Greek, on the other hand. There are discrepant opinions with 
respect to the chronological assessment of these two groups of convergences. 

Recently, there has been a discussion of the so-called Balkan Indo-European, which 
includes Greek, Albanian, Armenian and Phrygian, and which is understood as an inter- 
mediate protolanguage (see the overview in Matzinger 2012a). 

Modem Albanian is considered one of the base members of the Balkan Sprachbund, 
which also includes Bulgaro-Macedonian, some Serbian dialects, Balkan Romance, mod- 
em Greek and Romani. Standard Albanian indeed shares the majority of the so-called 
Balkanisms, such as the presence of stressed hi, the postpositive article, the merger of 
genitive and dative, the lack of the infinitive, the volitive future, the object clitic doubling 
and some others (see “Phonetics”, “Morphology” and “Syntax”; see also Schaller 1975, 
Asenova 2002). The Balkanisms must have been formed in the broad temporal interval 
between the middle of the first millennium A.D. and the first centuries of the Ottoman 
invasion, which covers the second half of the Proto-Albanian and the Old Albanian period. 
The same time span is the period of the europeanization of Albanian: Albanian shares 8 or 
9 of 12 diagnostic features of Standard Average European as listed in Haspelmath 2001 
(among them the presence of both definite and indefinite articles, the possessive per- 
fect, relative clauses with relative pronouns, the participial passive etc.). Apparently, bal- 
kanization and europeanization of Albanian could have developed parallel to each other. 
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The Albanian alphabet. The modern Albanian alphabet, which was elaborated and 
accepted in 1908, is consistently phonological. It consists of 36 graphemes (including 
digraphs). The following graphemes should be explained: 


TABLE 12.1 ALBANIAN ALPHABET 


grapheme 

phonemic value 

c 

ts 

9 

c 

dh 

6 

e 

9 (stressed and unstressed, sometimes it isn’t pronounced) 

gj 

J 

1 

r 

11 

1 

n j 

n’ 

q 

c 

IT 

r: 

sh 

s 

th 

0 

X 

dz 

xh 

3 


Periodization of Albanian language history, reconstructed stages of Albanian 

In the following sections I shall use the following nomenclature for periods in Albanian 

language history. 

• Pre-Proto-Albanian period, the language of the ancestors of Albanians before the 
beginning of intensive contacts with Rome, i.e. before the first century AD. In the end 
of this period many important sound changes took place that moved (Proto )Albanian 
away from the (late) PIE state, e.g. the merger of PIE *o and *a as well as the merger 
of PIE *a and *e, considerable changes of the consonant system (the merger of the 
PIE Media and Media Aspirata, the changes in the gutturals, which, however, have 
preserved an opposition of the reflexes of all three series); and, probably, the stress 
changes (see the section “Phonology”). 

• Proto-Albanian, the language of the first millennium AD, the time of intensive con- 
tacts with Latin and early contacts with Slavic. A new wave of important changes took 
place during this period, among them the reduction of the pre- and post-tonic syllables 
(which drastically changed the phonetic shape of Albanian word); the old quantitative 
vowel opposition was lost; the intervocalic voiced consonants were weakened and, 
partly, lost (the exact conditions of this loss are not completely clear; see the section 
“Phonology”); as a result of the syllable reduction a large part of PIE endings were 
lost, and eventually many new morphological formants were agglutinated instead; 
important grammatical balkanisms arose in this period (most likely due to intensive 
contacts between the ancestors of the Albanians and Romanians; see the sections 
“Morphology” and “Syntax”). At the end of this period, just before the split into the 
Tosk and Gheg dialects, that is, at the end of the first millennium AD, the (Proto) 
Albanian language took a form that is quite similar to that of the Albanian language as 
known to us from written sources. 
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• Old Albanian, the language from the time of the first dialect split till the end of the 
period of the old written texts, that is, till the 18th century. This period can be further 
subdivided into Early Old Albanian (from the beginning of the first millennium AD 
to the earliest written texts) and Late Old Albanian ( 1 6th 1 8th centuries). Early Old 
Albanian was characterized by secondary contacts between the Gheg and Tosk dia- 
lects in the territories that roughly coincide with the modem Albanian-speaking area 
and by the processes of forced balkanization of Albanian, especially in the south of 
the Albanian-speaking territory. In the Late Old Albanian period (the time after the 
Ottoman conquest), the modern Albanian dialectal landscape was formed. 

• Modern Albanian, the Albanian language of the 1 9th-2 1 st centuries. 

This periodization presented here is almost identical to those used by B. Demiraj (1997), 
Matzinger (2006), Hock (2005, with an analytical overview of previous attempts at peri- 
odization), Schumacher/Matzinger (20 1 3) and de Vaan (in print ) and differs somewhat, 
mainly in terminological details, from those by Desnickaja (1984), Olberg (2013) and 
Orel (2000). 

In the sections considering phonetics and grammar I shall operate with reconstructed 
Albanian units (phonemes, fragments of the phonemes systems, forms, paradigms and so 
on). The following designations will be used: 

• PA for the stage that is reconstructed based on the comparison of the Gheg and Tosk 
dialects as well as the language of the Old Albanian written texts. It can be assumed 
that this reconstructed stage corresponds more or less to the Albanian language just 
before the dialect split, i.e. to Albanian at the very end of the Proto-Albanian period. 1 
Sometimes the designation EPA, early Proto-Albanian, is used for units that belong 
to relatively earlier stages in the development of Albanian within the Proto-Albanian 
period. 

• PPA (only when needed) for the stage of the Albanian development just before the 
beginning of the contacts with Rome, i.e. on the border between the Pre -Proto- 
Albanian and Proto-Albanian periods. 

In addition, the term Standard Albanian (SA) will be used. 


PHONOLOGY 

In this section I chiefly rely on the following surveys of Albanian historical phonology: 
B. Demiraj 1997, Matzinger 2006, Olberg 2013 (for vocalism); Schumacher/Matzinger 
2013; and de Vaan in print, I also take into account (jubej 1976; Sh. Demiraj 1996, Orel 
2000, Topalli 2006; see also the overviews of the development of Albanian sound system 
in Hock 2005 (vocalism and consonantism) and Vermeer 2008 (vocalism). 

In what follows the vowel, resonant and consonant subsystems of Albanian as well as 
the problem of Albanian stress patterns will be described in turn. For each subsystem, 
a discussion of the SA system will be followed by the reconstruction of the alleged PA 
stage, based on the data from dialects and the earliest written texts. The history of PIE 
sounds in Albanian will be the topic of the next part in each paragraph. Only the main 
lines of the sound development will be specified; for the others the necessary references 
will be given. The illustrative etymological material (based, first of all, on B. Demiraj 
1997, Schumacher & Matzinger 2013, Orel 1998 and Cabej 1976-2014) will be given in 
a very selective way. 
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Vowels 

SA vowel system 
i y u 

e a o 
a 

All Albanian vowels may be used in both stressed and unstressed positions. Therefore, the 
standard Albanian vowel system is very close to the idealized Balkan vowel system (five 
basic vowels + stressed a): the only extra vowel is the labialized high front phoneme y. 


Vowels: basic dialectal differences. Reconstruction of the PA system 

The Standard Albanian system above is very close to the system attested in the majority 

of Tosk dialects. Other dialects differ, however, in this respect from the standard language 

to a great extent. The major differences are the following. 

• In the Gheg dialects and in a small part of the Tosk dialects (namely in their southern 
parts, that is, the Lab and Cham dialects, as well as in some Arbresh dialects) there is 
an opposition between long and short vowels. This opposition is also attested in old 
Albanian written texts; thus, it used to be a pan-Albanian feature that was later lost in 
the greater part of the Tosk area. 

• The Gheg dialects have a special series of nasal vowels (a, f, u, y, e, but nasal o is 
lacking), which developed in the PA period and disappeared in the Tosk dialects. It 
should be noted that the authors of the first written texts either do not mark vowel 
nasalization at all or do so in a very inconsistent way. PA a developed into Tosk e /s/: 
Gheg asht ‘(s)he is’ - SA, Tosk. eshte id.; PA e developed into e or e depending on 
the dialect (and depending on the phonetic position in some dialects) in Tosk: Gheg 
vend ‘place’ - SA, Tosk vend , vend id. Whether there is/was a phonological opposi- 
tion between long nasal vowels and short nasal vowels in Albanian dialects and in PA 
remains an unsolved problem. 

• SA lacks diphthongs as independent phonological units; all vowel sequences within 
words are treated as biphonemic combinations. During the PA period the processes 
of diphthongization were at work in Albanian, which led to the arising of three diph- 
thongs, namely uo, ie and ye (on the origin of these diphthongs see below, section 1.3). 
These diphthongs later had different patterns of development in Albanian dialects: 

uo > ua in Tosk: gr'ua, gru'a ‘wife’ (with different stress patterns depending on the 
phonetic context and dialect); ue, u more rarely ua and even uo in various Gheg 
dialects: grue, gru ‘id.’; 

ie > ie in Tosk (with different stress patterns depending on the phonetic context and 
dialect): Tosk d'iell , dj'ell ‘the Sun’; i, more rarely ie, in Gheg dialects: Gheg 
dill ‘id.’; 

ye >ye (with different stress patterns depending on the phonetic context and dialect), 
sometimes ie in Tosk: kr've, by'e, krie ‘head’; ve, y in Gheg dialects: bye, by 
‘id.’ 

• The number of vowel phonemes differs in various dialects. The major differences 
(along with the presence/absence of long/short and oral/nasal oppositions) are (see 
Gjinari et al. 2007): 
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A) absence of y (> i) (as well as its correlates y and y) in some Albanian dialects, 
both Tosk (Lab and Cham dialects) and Gheg; 

B) In Gheg dialects only unstressed e hi is present. During the last centuries a 
process of the reduction of unstressed e has been under way in Albanian. This 
process is especially strong in the Gheg dialects. It seems that unstressed e is 
not phonemic in the majority of Gheg dialects. By contrast, analysis of the old 
Gheg texts points toward the phonemic status of unstressed e. 

In Lab and Cham dialects there is an opposition between hi and h:l. 

C) In northern Gheg dialects, there are two additional long vowel phonemes with 
very restricted spheres of use (only some words for each). 

a) /se/, /§/ < ae ( ae is used as such in the earliest written texts); this vowel is 
typical in a large part of the northern Gheg and central Gheg dialects: thlcel 
‘you said (aor.)’, other dialects: the ‘id.’ 

b) Idl < oe ( oe is used as such in first written texts); this vowel is found 
in some western northern Gheg dialects: v/e/se ‘dew’, other dialects: 
vese ‘id.’ 

• In the central Gheg dialects, a set of diphthongization processes (as well as some 
other vocalic changes) took place, probably in the Late Old Albanian period: /i / > /aj/, 
/ej/, /aj/, /e/; /u/ > /ou/, /au/; /a/ > /as/; lol > /oe/, /ou/; lyl > HI, /aj/, /oe/; the long and 
nasal correlates of these vowels followed parallel patterns of development. For the 
phonetic conditions of such changes, see Jusufi 2011; the phonemic status of the vari- 
ants that emerged in this process is not entirely clear. 

To sum up, the following vowel system can be reconstructed for the veiy end of the 
PA period : 

iilyyy uuuiTI ieyeuo 

e e e (a) o o 
a a a 

Notes: hi appeared only as an unstressed vowel. 


The opposition of the short and long vowels might have arisen either at the end of the 
PA or the very beginning of the OA period. In the latter case the opposition must have 
developed in the Gheg and Tosk zones independently under similar conditions, which is 
less plausible. It remains unclear whether there was ever an opposition between short and 
long vowels in unstressed syllables. 

Perhaps there also existed a special series of long nasalized vowels. 

The system as reconstructed above is very close to that of some conservative western 
northern Gheg dialects. 


1.2 Vowels: the origin 

The origin of PA long and nasalized vowels will not be traced in this section (see section 

1.3 on their origin). 
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TABLE 12.2 PIE SHORT VOWELS (IN STRESSED POSITION) 


PIE phonemes 

PA and SA reflexes 

examples 

1. *a (also *h,e), *o (also 
*h 3 e), *H (in the roots in 
zero-grade) 

PPA, PA *a > SA a 

SA bathe ‘broad bean’ < PIE 
*b h ak-; SA note ‘night’ 

< PIE *nok w t-; SA mas , mat 
‘to measure’ < PIE *mHt-(io)- 

2. *e (also *h,e) 

a) PA, SA e after CR- and/or 
before-NC; 

b) PA, SA ja; 

c) PA, SA iel *r, / 

d) PA, SA je (elsewhere) 

a) SA bredh ‘to wander’ < PIE 
*b h red (h) -; dhemb ‘to ache’ 

< PIE *gemb h - 

b) SA jam ‘I am’ < PIE *h,esmi 

c) SA bie ‘to bring’ < PIE *b h er- 

d) SA djeg ‘to bum’ < PIE 
*d h eg wh - 

3. *i (also *Hi) 

PA, SA i 

SA (i) lig ‘bad, ill’ < PIE *h 3 lig- 

4. *u (also *Hu) 

PA, SA u 

SA gjume ‘sleep’ < PIE * sup-no- 


Notes 


1. Latin o in borrowings manifests as o in SA; Latin a, a > Alb. a. Based on this evidence, the change 
from PIE *o to *a can be dated back to the PPA period. 

2. Latin “wide” e (< Lat. e, ce) experienced similar changes in roughly identical phonetic conditions 
also, so we may attribute them rather to the end of the PA period. 

b) The conditions are not entirely clear. 

c) Maybe the change has a later character and is parallel with the diphthongization of other PA 
vowels (see 1.3). 

?>-A. Lat. i > SA Alb. i (on the reflex Lat. i > Alb. e see below); Lat. u > SA u. 

PIE long vowels 

The vowels in the PIE lengthened grade, the sequences *Vh 1/2/3 and vowels resulting from 
the sequences *Vsr/l (with the following loss of the *s: S Adore < PIE *g h es-r-) must have 
merged in the PPA period. 


TABLE 12.3 PIE LONG VOWELS (IN STRESSED POSITION) 


PIE phonemes 

PA and SA reflexes 

examples 

1. *a (and *eh,),*e 

PPA *a > PA *o (earlier *o) > SA o 

SA moter ‘sister’ < PIE 

(and *ehj) 


*meh 2 -tr-; SA mot ‘time, 
weather’ < PIE *meh r t 

2. *6 (and *eh 3 *oH) 

PPA *5 > PA e (earlier *oe), SA e 

SA dere ‘door’< PIE *d h wor- 

3. *T (and *iH) 

PA, SA *i 

SA pi ‘drink’< PIE *pih 3 - 

4. *u (and *uH| 

PA *y > SA *y; later *y > H ( s )# 

SA gjysh ‘grandfather’ < PIE 
*suH-so-; SA mi, Gheg. mi 
‘mouse’ < PIE *mus 


Notes 


1. Old Greek borrowings in Albanian show the replacement of a by SA o. Lat. a gives Alb. a (for 
another opinion, see, however, Schumacher & Matzinger 2013: 221), and Lat. e > Alb. e, i. Based 
on this evidence, the merger of PIE vowels and their change to a vowel other than a must have taken 
place at the end of the PPA period. 

2. Latin 5 also changed to e, at least in the older group of Latin borrowings, which makes it possible 
to establish a terminus ante quem for this process. The intermediate stage of the development of this 
vowel should have been oe. Another reflex of Latin stressed 5 in Albanian is o; most likely, this reflex 
is found in the later layer of borrowings - it coincides with the reflex of the Latin stressed short o. 
Latin short o in Albanian also has the reflex u (maybe in the older borrowings). 
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It can be hypothesized (see Matzinger 2012b) that at the beginning of the contacts with Latin, that is, 
at the beginning of the PA period, the early PA vowel system still had quantitative distinctions. Thus, 
the Latin long o was replaced by early EPA *o (later *oe > SA e). Lat. short o by PA *u (short *o was 
lacking in the early PA system after the change from PIE *o to PPA *a), and both Lat. a and a by PA 
*a. Then, after the change *6 > *oe, both Lat. o and o (which have lost their quantitative distinctions 
in Late Latin and merged in East Romania) came to be replaced by PA *o (< PIE *a, *e). 

3. Lat. / reflects as SA *i (SA mik ‘friend’ < Lat. amicus ‘friend’). Latin short i had two reflexes: SA 
i and SA e. The same two reflexes are also characteristic of Lat. e. It seems that at the time of the 
contacts with Albanian, Lat. / and e merged in the close *e (it was characteristic of Vulgar Latin in 
general), which differed from open *e (< e, ae). In the first period of the contacts (when PIE *ay, *oy 
hadn’t yet changed into *e > SA e: see below, and also de Vaan 2004) Lat. *e was replaced by PA i: 
later, when this change was already in progress, Lat. e began to be reflected by *e. 

4. Lat. u has two reflexes in Albanian: SA y (older layer) and SA u. 


PIE diphthongs 

There is almost no reliable etymological data concerning the development of PIE long 
diphthongs in Albanian. 


TABLE 12.4 PIE DIPHTHONGS (IN STRESSED POSITION) 


PIE phonemes 

PA and SA reflexes 

examples 

1. *ay, *oy 

EPA *e > PA *e > SA e 

SA edh ‘kid’ < PIE *(H)ayg-; SA shteg 
'path, road’ < PIE *stoyg h - 

2. *ey 

PA, SA i 

S A dimer, Gheg dimen ‘winter’ 

< PIE *g h eymon 

3. *aw, *ow 

PA, SAa 

SA ag ‘dawn’ < *h 2 ewg-; SA laj ‘to 
wash’ < PIE *lh 3 -ew- 

4. *ew 

EPA *e >PA *e > SA e 

SA hedh ‘to throw’ < PIE *skewd- 


Notes 


3. Lat. au > SA a in earlier borrowings, and > af in later borrowings. 


“Inner Albanian ” development of the vowels in the PA period 
a) “Umlaut” (/-mutation) 

EPA *a (<PIE *a, *o, *H) > PA, SA e , if the following syllable has i (or i). This change 
affected inherited words and the words borrowed from Latin, but not the words of Slavic 
origin (a few exceptions are usually explained away as a result of analogical processes). 
This process had a great significance for the Albanian morphological system (see the 
section “Morphology”): SA net ‘nights’ < EPA *na(k)ti(h). The “umlaut” e does not have 
diphthong-like reflexes. The umlaut pattern EPA *o > PA, SA e and EPA *u > PA, SAy 
occurs in the same conditions but is less widespread. 

The conditions of the change PA *e > / in some verbal and nominal forms are not clear 
(/-mutation, Orel 2000: 145, de Vaan 2004: 71; change before the following consonant 
cluster, Hamp 1971: 224; the influence of the following palatal consonant, Schumacher & 
Matzinger 2013: 219-220). The significant role that the alternation e : / has assumed in 
the OA and SA verb system (see below, the section “Morphology”) could have been pro- 
pelled by analogical processes. 
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b) Diphthongization 

EPA *o (< *o, also < Lat. o, d) > *uo/_*r#, *1#, *n#; *n in this position disappeared: PA 
*kron (< Gr. Kpf|vr|, Dor. icpdva) > SA krua (Gheg krue, kru). In the anlaut *o (< *o) 
yielded Gheg vo-, Tosk, SA va- (SA vater , Gheg voter ‘hearth, fireplace’ < *h,eh,-tr-). 

EPA *oe > ve/_* r#, *n#; *n# disappears obligatorily, *r disappears optionally (see 
Schumacher & Matzinger 2013: 224, SA arsye ‘cause’ < Lat. ratidnem). 

On the development EPA *e > PA, SA ie /_*r#, *1# see above. 


c) Development of the unstressed vowels 

One of the most important phonetic changes in the PA period was the reduction of 
unstressed vowels. Two remarks that concern the chronology of this process should be 
made: (a) this process affected most Latin loans (see gusht ‘August’ < Lat. augustus); 
and (b) it continued to operate after umlaut processes happened (SA dash ‘ram’ < *des-i 
< *-oi). These facts indicate that the reduction of unstressed vowels was at least partially 
under way at the end of the PA period. These processes yielded a very complicated and 
hardly reconstructable picture. In the absolute anlaut unstressed vowels were lost (see 
gusht above), in word-internal positions (both pre -tonic and post-tonic) they were lost 
in some words (SA mbese ‘niece’ < Lat. nepotia ) and got reduced to /a/ in other words 
(SA kendoj ‘to sing, to read’ < Lat. cantare), and it is impossible to establish the strict 
rules that governed this distribution. As for the auslaut vowels, it seems that the process 
of reduction was influenced by morphonological conditions (see the section “Morphol- 
ogy”). Generally speaking, it could be supposed that the reduction of unstressed vowels 
was a rather long-term process: it presumably began as early as in the Pre-Proto-Albanian 
period and continued at least until the end of the Old Albanian period. A more detailed 
analysis with examples can be found in de Vaan in print, B. Demiraj 1997: 46^17. 


d) Emergence of nasalization 

In the last phase of the PA period a series of nasal vowels developed out of VN sequences 
with possible loss of *-n in the auslaut syllables (see also 1.1). The vowel o did not 
undergo nasalization and changed instead into the diphthong ua before nasal consonants. 


e) Emergence of the new quantitative opposition 

The new quantitative opposition arose, it seems, at the end of the PA or the very beginning 
of the OA period. The main sources of the lengthening were various processes of contrac- 
tion, which involved both inherited IE words and Latin borrowings in Albanian (see e.g. 
be ‘oath’ < *b h eyd h a; mjek ‘physician’ < Lat. medicus). Lengthening was also caused by 
the influence of the following resonants -r and -/. Although dialectally restricted, the pro- 
cesses of lengthening intensively continued, however, later in the Old Albanian period; 
e.g. vowel lengthening could occur as a result of apocope: eastern-northern Tosk plakbl 
‘old woman’ - Gheg lpla:kl ‘id.’ 


The development of the vowel system. Conclusion 

To sum up, the main trend of the development from the PIE vowel system into the Albanian 
vowel system was the loss of the IE quantitative distinctions and their partial replacement 
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by qualitative distinctions. This process took a long time. Some of the important changes 
that disrupted the symmetry of the long and short vowel subsystems had occurred already 
in the PPA period (the merger of PIE *a and *o, the merger of the PIE *a and *e). At 
the beginning of the PA period, the “old” quantitative opposition was still present in 
Albanian. Very important vocalic changes took place in the PA period. Reflexes of Latin 
borrowings make it possible to establish a relative chronology of these changes (see e.g. 
de Vaan 2004). Among the most important PA changes are the development of *u into 
*y, the change *a > *o > *o, and the development of various reflexes of PIE *e. At a cer- 
tain moment the PA system possessed two e-like phonemes: the open *e (< PIE *e) and 
close *e (< PIE *ay, *oy, *ew). The second one did not have diphthong-like reflexes. The 
presence of these two phonemes was paralleled by the Balkan Latin vocal system, which 
also had two e-phonemes (and only one /o/ phoneme, which is also true in Albanian). 
Whether there was a direct Latin (Roman) influence on Albanian phonetics in this respect 
remains, however, unclear. Later in the PA period the processes of the diphthongization 
before sonorant phonemes took place (*e > ie; *o (or *o) > uo; *oe (< PIE *5) > ye) as 
well as umlaut processes. The vowel e, resulting from the /-fronting of a (as well as from 
*ce < PIE *5), merged, at the end of the PA period, with the identical reflexes of *e and 
*e. Thus, a new quantitative opposition emerged; finally, nasalized vocal allophones gave 
rise to a special series of phonemes. 


Sonorants 

SA sonorant system 
m n n’ 

1 1 

r r: 

j 

Sonorants: main dialectal differences. Reconstruction of the PA system 

• The absence of the opposition r : r. (orthographic rr) in some dialects (both Gheg and 
Tosk). 

• The change of i into d (some Gheg and Lab varieties) and into y (some Arbresh 
dialects). 

• Some eastern northern Gheg dialects lack the phoneme n ’ (orthographic nj; they have 
ni or n instead). In many dialects (both Gheg and Tosk) as well as in SA n ’ changes to 
j in some positions, defined in either phonetic or morphonological terms. 

• In most Gheg dialects and in a few Tosk dialects there is a special phoneme ij, which 
originated from the cluster ng. 

• The Tosk rhotacism (the change -n- > -r-\ SA, Tosk bera - Gheg bana ‘I made (aor.)’). 
Rhotacism applied to both inherited Albanian words and Latin borrowings (SA, 
Tosk. rere ‘sand’ < Lat. arena). By contrast, only two early Slavic borrowings in 
Tosk demonstrate the traces of the rhotacism. These facts make it possible to date 
this change (and the very moment of Albanian dialectal split) to the last centuries of 
the first millennium AD. The rhotacism is not a positionally conditioned restriction 
anymore in SA (nor in the Tosk dialects), which has many words with an intervocalic 
-n-. Along with later borrowings, there are inherited words with -/?-, as well as Latin 
borrowings among these words; in these words -n- originated from various consonant 
clusters (see below) and had the form of a geminated or long n: at the moment of the 
beginning of rhotacism. 
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• There is an interdialectal correspondence between / in Cham and some Arbresh and 
Arvanitika and other Tosk diaspora dialects and j in most of the Tosk and all of the 
Gheg dialects as well as in SA that emerged as a result of the development of some 
clusters with *1: *ly > llj (Cham mile ‘thousand’: SA mije ); *-li > llj (in the plural 
forms: Cham popul ‘people pi.’ — SA popuj). This correspondence makes it possible 
to reconstruct the sound A for the PA stage (see Schumacher & Matzinger 2013: 
245). It seems, however, that this sound was in a complementary distribution with 
other laterals (/ and /) and can be treated as an allophone rather than an indepen- 
dent phoneme (possibly it continued to function as a cluster ly at the end of the PA 
period). 

Thus, the following subsystem of sonorants can be reconstructed for the PA stage: 
m n n: n’ 

i i a) 

f r: 

j 

Sonorants: the origin 


TABLE 12.5 

PIE SONORANTS 


PIE phonemes 

PA and SA reflexes 

examples 

i. *i 

a) PA *1, SA l everywhere except 
intervocalic position 

SA lagje 'quarter’ < PIE *log h - 


b) PA *1, SA / /V V (except the 
position before the *-i, see 
above, 2.1.) 

SA hell ‘spear, spit’ < PIE *skol- 

2. *r 

a) PA, SA r/V V 

a) SA (/) mire ‘good’ < PIE *miHr- 


b) PA, SA r:l#_ 

b) SA rrjedh 'to flow’ < PIE 
*h 3 reg- 

3. *m 

PA, SA m 

SA moter ‘sister’ < PIE *meh 2 -tr- 

4. *n 

PA, SA n 

SA nate 'night’ <PIE *nok w t- 


Notes 

The SA (or PA) j arose from the various sources (diphthongization of e, development of /A/, the 
anti-hiatus sound and so on). On the development of the PIE *y in Albanian see below (the section on 
obstruents). 

1 . Lat. / has reflexes with a similar distribution in Albanian. 

2. The distribution of the two reflexes of PIE *r is unclear. The picture is further complicated by the 
presence of numerous doublets with r/rr alternation among the Latin borrowings in the MA (also in 
the OA texts). PIE *VmV > SAr:; PIE *#sr, *wr > SAr:- (see examples in Schumacher & Matzinger 
2013: 248). *-ri > j (in the plural forms: bij ‘sons’ < *bir-i); whether an intermediate stage was fXJ 
remains unclear. 

Lat. r > PA, SA r/r: both in anlaut and in intervocalic position (with a tendency to follow the pattern 
of PIE reflexes). 

3. On the reflexes of PIE *-m in some nominal endings see the section “Morphology”. Lat. m > Alb. m. 

4. Lat. n > SA n\ in Tosk, this sound develops into -r- in the intervocalic position (see above on rhota- 
cism). Lat. gn > SA nj. 

PIE intervocalic *-sn-, *-Cn-, *-nC- yielded PA *-nn-, which further develops into -n- in both Tosk 
and Gheg (for examples, see Shumacher & Matzinger 2013: 249-250). 
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Further development of PA sonorants 

On rhotacism and the development of the sound k see 2.1. 

PA *nj, *ni ■ * 5 . \ nj,j . SA bt inj horns pi. ' [ t\ /j i in~ t , 5 . \ bej , t i 1 1 e p banj ~ I ^ [ t \ bo nvo 
(on the change nj > j see 2.1). PA *-rn- > SA -r:. 

On the development of n in closed auslaut syllables see 1.3. 

Syllabic sonorants 

PIE syllabic *m and *n > PA a , SA a: SA (/) gjate ‘long’ < PIE *dlng h -t-; SA shtate 
‘seven’ < PIE *septm-. 

The main reflexes of PIE *1 and *r are PA, SA li and ri respectively (a detailed analysis 
of the distribution of PIE syllabic sonorants in Albanian can be found in Schumacher & 
Matzinger 2013: 229-232). 

Obstruents 

SA obstruents 
pb td 

fv 0 6 

s z 

J 3 
ts dz 
tjd3 

Obstruents: main dialectal differences. Reconstruction of the PA system 

• In some dialects (both Tosk and Gheg) the phoneme h is absent; in other dialects 
(mainly Tosk) it is absent in certain phonetic positions. In many Gheg dialects h alter- 
nates in some positions with f It should be noted that SA h often has a non-etymolog- 
ical character. On SA h < PIE *sk, *sk, see below. 

• The palatal phonemes c, j (orthographic q, gj ) manifest in most of the Gheg dialects as 
affricates (and in some places as fricatives); in some eastern northern Gheg dialects (in 
Kosovo) they merged with the affricates tf dj (orthographic q, xh). In the majority of 
Albanian dialects the palatals c,j are the result of the merger of PA *c, *j and the PA 
clusters kl, gl (they were registered in this form in some Old Albanian written texts). 
In some dialects, however, the latter clusters either remain intact (Cham klumesht 
‘milk’: SA qumeshte; Cham gluhe ‘language’: SA gjuhe) or yielded other reflexes 
(western-northern Gheg kv, gv; eastern-northern Gheg k, g). 

• In some southern Gheg dialects the phonemes 9 and d are absent (they changed into x 
and y respectively). 

• In the majority of Gheg dialects and some Tosk dialects the consonant clusters mb, nd 
were simplified, respectively, into m, n (sometimes into b, d). 

• Devoicing of voiced obstruents in the word-final position is characteristic of an abso- 
lute majority of Gheg dialects and many Tosk dialects (on the potentially common- 
Albanian character of this phenomenon see Schumacher & Matzinger 2013: 267-270). 

Some Albanian obstruents are the relatively late products of inner Albanian developments 
(various kinds of sandhi, voicing in the position near sonorants and voiced obstruents, 
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spontaneous consonant strengthening) supported by borrowings. Such are the phonemes 
ts, dz, tf dj, j. It seems that these phonemes were formed in the PA period (except for the 
probably more recent j and dj). 

Thus, despite interdialectal differences, we should reconstruct the PA obstruent system 
as nearly identical with the SA system. The only difference is the possible absence of the 
phonemes j and rfj in the PA system. Besides, it can be supposed that the voiced labial 
spirant (> SA v) was bilabial (fi ) in the PA period. 


Obstruents: the origin 
Occlusives 

PIE Media Aspirata merged with Media before the beginning of the PPA period. 


TABLE 12.6 

PIE OBSTRUENTS 


PIE phonemes 

PA and SA reflexes 

examples 

i. *p 

PA, SA p 

SA (i) plote ‘full’ < PIE *pleh,-t- 

2. *b h , *b 

PA, SA h 

SA 0) bardhe 'white’ < PIE 
*b h org- 

3. *t 

PA, SA t 

SA tre 'three’< PIE *treyes 

4. *d, *d h 

a) PA, SA d in the initial position 
and in the inlaut after n and 
before r 

a) SA djeg ‘to burn’ < PIE *d l, eg wh - 


b) PA, SA -d- in the inlaut 

b) SA hedh 'to throw’ < PIE 


elsewhere 

*skewd- 

5. *k 

EPA *ts > Late PA *0 > SA 8 

SA (i) thelle ‘deep’ < PIE *kel- 

6. *g, *g h 

EPA *dz > Late PA *6 > SA d 

SA dhemb, Gheg dhamb 'tooth' 

< PIE *gomb 1 '- 

7. *k 

PA, SA/t 

SA kam ‘to have’ <PIE *keh,p- 

8. *g, *g h 

PA, SAg 

SA gardh ‘fence’ < PIE *g h ord h - 

9. *k w 

a) EPA *tf > Late PA *s > SA s 

a) SA sjell ‘to bring’ < PIE 


before front vowels 

*k w elh,- 


b) PA, SA k elsewhere 

b) SA pjek ‘to bake’ < PIE *pek w - 

10. *g w , *g wh 

a) EPA *d3W > Late PA *z > SAz 
before front vowels 

a) SA zorre ‘gut’ < PIE *g wh em- 


b) PA, SA g elsewhere 

b) SA djeg ‘to bum’ < PIE 


•I' hg p- wh_ 


Notes 

2. A special problem is the reflexes of PPA *b (as well as other voiced occlusives, see below) in 
the intervocalic position. On the one hand, there is a change from PPA *-b- to PA, SA -b- (SA 
grabe ‘erosion, hollow’ < PIE *g h rob h -a); on the other hand, there is also a change from PPA *-b- 
to PA, SA -#- (SA det ‘sea’ < PIE *d h ewb-et-; see another opinion in Schumacher & Matzinger 
2013: 233). The conditions of this distribution are not clear (see £abej 1976: 241-242, Orel 2000: 
78-80). The loss of the intervocalic -b- is also attested in Latin borrowings (the reflex of the Lat. 
b in other positions is SA b), at least in those borrowings that belong to the older layer: SA kale, 
Gheg kal ‘horse’ < Lat. caballum. 

4b) In some of the PIE lexemes *-6- has fallen out: SA be ‘oath’< *b h oyd h -a. Lat. d yielded PA, SA 
-6- in intervocalic position and after r; sometimes *-6- (< Lat. -d-) disappears. The weakening of 
*d (> d) in the sequence *-rd continued during the PA period and even later. 

5-10) Albanian belongs to the few IE languages that retain different reflexes for the three series of PIE 
gutturals. 
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5-6) PIE *k > PA, SA k, PIE *g, *g h > PA, SA g before resonants: SA quaj, Buzuk kluoj ‘to call, to give 
a name’ < PIE *kluw-. 

6) Sometimes in the initial position both PIE *g and *g h yield SA d : SA dore; PIE *g h esr-. The con- 
ditions of the distribution of the reflexes did are unknown. The idea that in anlaut the reflexes of 
PIE *g- (> SAd-) and PIE *g h - (> SA d-) differ (see Orel 1998: XX; B. Demiraj 1997: 63) does not 
have convincing etymological evidence. 

PIE *g h w > EPA *d 3 W > Late PAz > SAz: SAze, Gheg. za (< zari) < PIE *g h weno- (there are no 
reliable examples for a parallel development of PIE *kw and *gw). 

7-8) With further palatalization into c,j before front vowels; see below. 

9-10) With further palatalization into c,j before front vowels; see below. 


TABLE 12.7 PIE *S, *Y, *W 


PIE phonemes 

PA and SA reflexes 

examples 

1 *s 

a) EPA *z > PA, SA j/#_V 

a) SA gjak ‘blood’ < PIE *sok w - 


b) EPA *s > PA, SA//#_V 

b) SA shi ‘rain’ < PIE *suH- 


c) EPA *s > PA, SA Ji V(+front) 

c) SA gjysh ‘grandfather’ < PIE 
*suh-s-io- 


d) EPA *h > PA, SA 

d) SA neve ‘we (gen.)’ < PIE 


0/V V(+back) 

*nosom 


e) EPA *h > PA, SA 0/_# 

e) SA -# ‘nom. m. sg.’ < PIE *-os 

2. *y 

a) EPA *z (?) > PA, SA j/# V 

a) SA gjem ‘bridle’ < PIE *yom- 


b) PA, SA 0/V V 

b) SA tre ‘three’ < PIE *treyes 

3. *w 

a) PA *w > SA v/#_V 

a) SA vesh ‘to put on (clothes)’ < 
PIE *wes- 


b) PA, SA 0/V V 

b) SA ve ‘widow’ < PIE *wid h ew- 


Notes 


1 . There is no consensus with respect to the distribution and even the reflexes of PIE *s in Albanian. 
It is not known exactly what determines the distribution of } and / in the initial position. Two other 
reflexes of PIE *s (SA h for anlaut and inlaut and SA 6 for anlaut) are now rejected by some research- 
ers (see e.g. Schumacher & Matzinger 2013: 258-265). On some clusters with *s see below. Lat. 
*s > PA, SA / the same outcome is also found in the older layer of Slavic borrowings (in most Slavic 
borrowings s reflects as SAsj. For some chronological considerations related to the development of 
PIE *s, see below. 

2. See Schumacher & Matzinger 2013: 252-254; for other opinions see £abej 1976; de Vaan in print. 
The distribution of the reflexes of Lat. lyl is, generally speaking, similar to that of PIE *y (sometimes 
Lat. intervocalic lyl > SA j). 

3. The distribution of the reflexes of Lat. /w/ is, generally speaking, similar to that of PIE *w. 

Laryngeals 

On the reflexes of the PIE laryngeals that were adjacent to vowels, see 1 .2. When adjacent 
to consonants, laryngeals disappeared. For the possible influence of the laryngeals on 
Albanian consonants see B. Demiraj 1997: 60-61. For the influence of the laryngeals on 
the neighboring syllabic sonorants see Schumacher & Matzinger 2013: 230-231. For the 
development PIE *Hr- > PA, SA r- see B. Demiraj 1997: 70-71. 

Reflexes of some PIE consonant clusters in Albanian 


PIE *sk/sk/sk w > PA, SA h: SA hedh ‘to throw’ < PIE *skewd-. 

PIE *sp > PA, SA fl #_ : SA fare < PIE *spor-. Another source of PA, SA /is borrowings. 



ALBANIAN 571 


PIE *st > PA, SA ft. SA shteg ‘path, road’ < PIE *stoyg h -. 

PIE *zd > PA, SA d : SA pidh ‘female pudenda’ < PIE *peysd (h) -. Lat. sc, sp, st give, 
respectively, SA sk, sp, st. 

Clusters of the type “dental + y,” “guttural (< PIE velars and labiovelars) + y” devel- 
oped into EPA *tf, *d 3 W, PA, SA s, z, depending on whether the occlusive was 
voiced or voiceless. 

For the development of some other consonant clusters see B. Demiraj 1997: 54—68, 
Matzinger 2006: 68-81, Schumacher & Matzinger 2013: 243-244. 


Chronology of Albanian palatalizations 

As mentioned above, there were in Albanian different processes of palatalization of PIE 
velars (sometimes developed further into assibilation): (a) “satem”-assibilation of PIE 
palatals gave rise to interdental spirants 0 and d; (b) assibilation of PIE labiovelars (and 
of the groups “palatal + w” but not of the “pure” velars) before the front vowels resulted 
in s, z; and (c) palatalization of *k, *g (from both PIE velars and labiovelars), also before 
front vowels, resulted in c and j. 

a) As for the satem-palatalization, a common opinion is that there was an intermediate 
stage of the development, namely the affricates of the type ts, dz. It is not known 
exactly when the process of assibilation took place, but it is possible to assume that 
it appeared in the first half of the PA period (the reason for this hypothesis is that 
various Latin palatalized consonantal sequences never reflect as SA 0 or ff). It is 
not completely clear when the result of the development of the PIE voiced palatal 
merged with *d < PIE *d (h) (and whether this process could have influenced the dis- 
tribution of the reflexes d/d that go back to PIE *g ,h) ). 

b) The assibilation of PIE labiovelars before the front vowels had to take place also rel- 
atively early (before the change PIE *e > PPA a; see SA zorre ‘gut’ < PIE *g wh em-). 
The reflexes of PIE labiovelars merged in SA with the results of the development of 
the PIE groups *ty, *dy, *k (w) y, *g (w) y as well as Lat. tj, dj. This fact as well as the 
reflection of the 5 of Latin and early Slavic borrowings as SA sh indicates convinc- 
ingly that in the EPA period the reflexes of labiovelars must have been represented 
a kind of affricates, and the change into the s had to take place at the very end of the 
PA period. 

c) The process of palatalization that resulted in c and } sounds belongs, on the con- 
trary, entirely to the PA period. This change took place in the positions before i, 
y and e (including the je and ja reflexes of PIE *e). This wave of palatalization 
should have begun before the change in the unstressed vowels (see SA /pled 
‘old men (pi.)’ < *plaki; SA virgjer ‘virgin’ < Lat. virginem ) and continued to be 
at work after the umlaut processes (that is, e < *a did trigger the palatalization, 
which explains e.g. this pattern of palatalization of PIE labiovelars: OA qell ‘to 
bring’ < EPA *kal-it < PIE *k w olh 1 -). Lat. c, g before front vowels as well as Lat. 
clusters cy, gv also give SA c, j. In the later borrowings, k, g are reflected as c, 
} rather seldom and can be explained as a result of analogical processes. In the 
PA period the palatalized reflex of *g merged with reflexes of the PIE *s- and 
*y- (which may have led to the phonologization of f). During the OA period c,j 
merged in the majority of Albanian dialects with the reflexes of the PA clusters 
*kl, *gl (see 3.1 above). 
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Stress 

The stress in modem Albanian is dynamic. It shows a clear tendency to be columnal and 
to fall in the vast majority of cases on the stem. In effect, the morphological forms of 
words typically are not differentiated by the position of the stressed syllable. The only 
exception is a small group of nouns and adjectives that have different positions of the 
stress in the singular and plural forms. These include some nouns with the plural suffixes 
-inj, -ij, -enj (SA sg. dr'aper ‘banner’ - pi. draper'inj, SA sg. I'ume ‘river’ - pi. lum'enj ), 
SA sg. dh'ender ‘groom, son in law’ -pi. dhend'ure. Two adjectives (SA sg. m. (i) madh 
‘big’ - pi. m. (te) medh'enj, SA sg. m. (/') keq ‘bad’ - pi. m. (te) keq'ij ) have an additional 
vocalic alternation. The same rightward shift of the stress in plural forms is also observed 
in Turkish borrowings with the plural suffix -llarel-lere (SA sg. ag'a ‘aga’ - pi. agall'are ); 
however, in these words the pattern is explained by the parallel borrowing of both sin- 
gular and plural forms (the very suffix -llarel-lere is used only with Turkish loanwords). 
Two nouns have a leftward stress shift in the plural: SA sg. njer'i ‘man’ - pi. nj'erez, SA 
sg. kall'i ‘ear’ - pi. k'alleza. It can be noticed that in all these cases there is a non-produc- 
tive pattern with clearly different singular and plural stems. The majority of derivational 
suffixes in Albanian are stressed. 

As for the position of the stress within the word, it is relatively free. There is a tendency 
to place the stress on the penultimate syllable; however, there are multiple exceptions to 
this pattern. For example, there are many uninflected words with the last syllable stressed 
( edh'e ‘still’ and so on). As for inflected words, the majority have forms with the penul- 
timate syllable stressed (on the rules of the SA accent see Buchholz & Fiedler 1987: 53). 

Albanian dialects have relatively few differences with respect to their stress patterns. The 
tendency to stress the penultimate syllable is strongest in Gheg dialects: there are words 
(nouns and pronouns) with stress on the penultimate syllable in Gheg, whereas in Tosk coun- 
terparts the stress is on the last syllable (SA, Tosk njer'i ‘man’ - Gheg nj'eri ). There is also a 
tendency in Gheg to shift the stress towards the penultimate position in Turkish loans (SA, 
Tosk par'a - Gheg p'are ‘money’ < Turk, par' a). Dialectal variation is also observed in the 
realization of stress in diphthongs ue, ye, ie in various phonetic contexts (see “Vowels”). 

Generally, the stress pattern in old written texts is close to the modem system. The 
main difference is a weaker degree of reduction in unaccented vowels, in particular, the 
vowels in word-final syllables. As a result there are a number of words in these texts with 
their prepenultimate syllable stressed that correspond to SA words with the penultimate 
syllable stressed (e.g. old Gheg gj'arpene ‘snake’ - SA gj'arper). For possible differences 
in the stress position in verbal non-active forms see “Morphology”. 

Thus, at the end of the PA period the stress patterns must have been very near to the 
modem ones, the difference mainly amounting to a greater variability in the position of 
stress in the word, which is accounted for by a lesser degree of vowel reduction in PA. 

It is well known that Latin borrowings (as well as later loans; see, however, above on 
Turkish borrowings) maintained their accent patterns in Albanian. It can be hypothesized, 
then, that immediately before the beginning of the Latin-Albanian contacts (that is, at the 
end of the PPA period) Albanian stress patterns didn’t differ essentially from the late PA 
ones. For vowel reduction, which was accounted for by the dynamic character of Alba- 
nian stress and changed the shape of Albanian words, see above (“Vowels”). 

It is not possible to define the time limits and the exact rules of the transition from 
the late IE stress system to the Albanian one. The only thing that is certain is that Alba- 
nian somehow abandoned before the beginning of the PA period both the IE final accent 
and accentual mobility within nominal and verbal paradigms (Schumacher & Matzinger 
2013:208-209). 
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Morphonological alternations 

Albanian is a language in which morphonological alternations play a relatively important 
role in the expression of morphological oppositions. Most of these alternations reflect the 
phonetic changes in the PA period (on umlaut, diphthongization, change e > i, see “Vow- 
els”; on palatalization, see “Obstruents”; on loss of the final -n, development of *-rn-, see 
“Resonants”). The old PIE alternations, such as first of all ablaut, left but some minor 
traces in the Albanian verb system. For more information on the realization of the main 
morphonological alternations, see “Morphology”. 


MORPHOLOGY 

0. Albanian is a fusional language that combines synthetic and analytic features. Synthetic 
features are only partly inherited from the PIE state, while a considerable part of the flex- 
ion, especially in the nominal system, is the result of the processes of the new synthesis 
that took place in the PA and even in the OA periods. The level of analyticity is, however, 
rather high. It is fairly typical of the numerous analytic constructions to make broad use 
of clitics, in both the nominal and verbal domains. Albanian has some grammatical traits 
that are rather unusual from an areal-typological perspective: a morphological optative 
and the analytic double case agreement; see below. At the same time, many grammatical 
features, both morphological and syntactical, identify Albanian as a language that belongs 
both to the Balkan Sprachbund and to the languages of the Standard Average European 
type. The morphological part of this chapter is based, first of all, on Buchholz and Fiedler 
1987 (for SA), Gjinari et al. 2007 (for dialectological material), and Sh. Demiraj 1986, 
Matzinger 2006, Schumacher and Matzinger 2013, Fiedler 2004, Klingenschmitt 1981, 
1994, and Orel 2000 (for historical and comparative interpretation). 

Noun 

The SA noun system 

Grammatical categories of Albanian nouns. The noun in SA is characterized by the cat- 
egories of gender, number, case and definiteness. The gender of the noun is defined by 
the agreement features of the dependent elements (determiners, agreeing adjectives and 
so on) and has three main values: masculine, feminine and neuter. For animate nouns the 
gender (masculine or feminine) is mostly semantically determined. The neuter encom- 
passes derived deverbal and deadjectival nouns with abstract meaning as well as a hand- 
ful of old concrete nouns (all of these concrete nouns also have masculine or feminine 
variants that are more common in SA). Feminine and neuter nouns don’t differ in the plu- 
ral by their agreement properties. Besides, there is a fourth agreement class that includes 
inanimate nouns that agree according to the masculine model in the singular and accord- 
ing to the feminine model in the plural (this is called “ambigender” in the Albanian gram- 
matical tradition). On the inflectional morphology of Albanian gender see below. 

The category of number has two values: singular and plural. The rules of the derivation 
of plural forms are extremely complex and opaque (see below). 

Albanian has the following cases', nominative, accusative, genitive -dative and abla- 
tive; the ablative is distinguished from the genitive-dative only by one optional fonn in 
the noun paradigm as well as in certain pronominal paradigms. Genitives and datives, 
which are traditionally differentiated in Albanian grammars, actually differ syntactically. 
The form that is used in the possessive constructions (= genitive) is always preceded 
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by a special prepositive clitical formant (“article” in the Albanian grammatical tradi- 
tion), which agrees with the nominal head of the construction: djal-i i fshatar-it boy- 
NOM.M.SG.DEF. ART.NOM.M.SG. peasant-GEN./DAT.M.SG.DEF. (on the forms of 
this “article” see “Adjective” below). The form that is used in the syntactic position of the 
indirect object and sometimes adverbially (= dative) requires the use of the pronominal 
clitic before the verb form (see “Syntax” below). 

Definiteness is expressed in Albanian by the means of the postpositive article, which is 
agglutinated to the singular and plural noun stems. Thus, there are two sets of the nominal 
paradigms in Albanian: the definite and the indefinite ones. Besides, there is a prepositive 
indefinite article (homonymous to the numeral ‘one’). Thus, the category of definiteness/ 
indefiniteness is best analyzed as having three values in Albanian. 

Declension types of Albanian nouns. The Albanian noun has four declension classes in 
the singular; see Table 12.8. 


TABLE 12.8 NOUN DECLENSION IN SINGULAR 



I (masculine) 

II (masculine) 2 

III (feminine) 

IV (neuter) 


indef. 

def. 

indef. 

def. 

indef. 

def. 

indef. 

def. 

nom. 

mur 1 

muri 

mik 

miku 

vajze 

vajza 

te 5 folur 

te folurit 6 

acc. 

mur 

mur in 

mik 

mikun 

vajze 

vajzen 4 

te folur 

te folurit 6 

gen.-dat./abl. 

muri 

murit 

miku 

mikut 

vajze 3 

vajzes 4 

te foluri 

te folurit 


Notes: mur ‘wall’, mik ‘friend’, vajze ‘girl’, te folur ‘speech’. 


All words and word forms in the sections on SA morphology are from Standard Albanian, unless indicated 
otherwise. 

Each class can be further divided into minor subclasses, which would differ by the alternations of the stem 
in various case forms and, in the cases of classes III and IV, by the phonemic and/or orthographic shapes 
of the endings. 

The correlation between the declension class and gender isn’t straightforward (except for the IV class, 
which is exclusively neuter). The exceptions, however, aren’t numerous (these are mostly proper nouns). 
Besides, the numerous ambigender names belong to the I and II subclasses. 

1 Although there is a homonymous nominative and accusative form in the indefinite declension, these 
paradigmatic cells are differentiated by the form of agreeing elements. 

2 The Class II includes all the nouns whose stem ends in -k, -g, -h, a portion of the masculine nouns 
that end with a stressed vowel, as well as the word krye ‘head’. 

3 In some subclasses the ending is -je. 

4 In some subclasses the endings are -ne, -se. 

5 Deverbal and deadjectival nouns retain the prepositive “article” of the adjectives (see “Adjective” 
below). 

6 In some subclasses the ending is -(e) t, -te. 


The plural forms a single declension class that is divided into subclasses that differ 
according to the phonetic/orthographic shapes of endings depending on the plural stem 
auslaut. It is noteworthy that this auslaut typically is not determined by the gender of the 
noun. The general schema of the plural declension is as follows: 
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TABLE 12.9 NOUN DECLENSION IN PLURAL 

indefinite 

definite 

nom./acc. 1 mure 

gen.-dat. mureve 2 

abl. mureve/ mures h 3 

muret 

mureve 

mureve 


Notes 


1 The nominative and accusative forms differ neither morphologically nor by the shape of the agreeing 
elements. 

2 In the plural, there is no opposition between indefinite and definite forms for the genitive-dative 
case. 

3 The indefinite ablative ending has two allomorphs. Their sphere of use is determined (with many 
exceptions) by the syntactic position of the ablative word forms. 


Derivation of nominal plural forms. Only for some nouns can the plural form be pre- 
dicted based on the singular one (for other nouns the plural is idiosyncratic). 

For masculine nouns the plural form is derived by means of several consonantal ( k/q , 
g/gj, lllj, n/nj, r/j) and vocalic (a/e) alternations, as well as by means of more or less 
productive suffixes (-e, -e, -a, -ra, -inj, -llare/lere). Some masculine nouns have identi- 
cal stems in the singular and in the plural. Plural suffixes for feminine nouns are -a and 
-ra, but a considerable portion of feminine nouns do not differentiate singular and plural 
stems. There are a few nouns, both masculine and feminine, that have irregular plural 
forms. Most of the old neuter nouns have the plural suffix -ra. Some SA nouns have 
doublet plural forms. 


Basic dialectal differences in the nominal system 

1) An additional case - the locative - is preserved in both old texts and in the dialects 
(northern Tosk and southern and central Gheg). This case is used after some prep- 
ositions with the locative meaning (ne mal-t ‘on the mountain’, ne male-t ‘on the 
mountains’) and doesn’t differ formally from the accusative in the indefinite form. 

2) In SA, the ablative is the case that tends to merge with the genitive -dative. A fur- 
ther step of this development is observed in some diaspora dialects (Arvanitica, 
Ukraine, Bulgaria), where the ending -sh disappeared completely from the case sys- 
tem. By contrast, a consistent formal opposition of the ablative and genitive-dative is 
observed in the old written texts. Some remnants of this situation are preserved in the 
western northern Gheg dialects, in which there is an opposition of the ablative and 
genitive-dative forms for feminine definite nouns (abl f. def. delet ‘sheep’ - gen. -dat 
f. def. delese). A dedicated ending -shit for the ablative plural definite form existed 
in the old written texts. 

3) There are fluctuations in gender assignment of individual words. Thus, the number 
of underived neuter nouns in the earliest written texts as well as in some dialects 
is bigger than it is in SA. There are also gender fluctuations between masculine 
and feminine genders. In the first written texts and in some Gheg dialects there is a 
special subclass of feminine nouns of Latin origin that correspond to SA masculine 
nouns: SA (and majority of dialects) shendet m. ‘health’ - OA shendet f. 
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4) The fluctuations in the sphere of definiteness are manifested in the varying degrees 
that the opposition “definite : indefinite” is reflected in the nominal paradigm. Thus, 
in the old texts and some dialects there was a special definite ending (-vet) for the 
genitive-dative case in the plural, which is absent in SA. By contrast, in some dialects 
the weakening of the “definite : indefinite” opposition has gone further than it did in SA. 

5) There are many dialectal variations in the means of plural formation. For example, 
the plural suffix -a is systematically used in Gheg dialects, whereas the suffix -ra is 
frequently found in northern Tosk dialects. 

6) There are differences in the form of individual endings. The following cases are 
especially relevant from the historical point of view: 

a) Many dialects as well as the old texts show the masculine definite accusative 
ending -ne without the “connecting” vowel Hu: SA mikun ‘friend (dat.)’ - dial. 
mikne. 

b) In Buzuk’s book, the masculine definite genitive -dative form is formed not only 
with the endings -itl-ut but also with the ending -(e)t, which is attached to those 
nouns whose stem ends in -e: Buzuk (i) djalet ‘boy (gen.)’. In some dialects 
and in the old texts, the masculine definite genitive -dative forms of some kin- 
ship terms are formed with the help of endings without “connecting” vowels: 
Buzuku d birt ‘son (gen.)’. 

c) In the old texts as well as in some dialects (both Tosk and Gheg) the genitive- 
dative plural ending has the form -e after the stems that end in consonants (pleq-e 
‘old man (gen.-dat. pi.)’) and -ve after the stems that end in vowels (plaka-ve 
‘old woman (gen.-dat. pi.)’). 

7) There are, finally, morphological distinctions that are due to phonetic dialectal dif- 
ferences: SA and Tosk plural endings -er, -ra - Gheg. -(e)n, -na. In Gheg dialects 
there is a compensatory apocope as a consequence of the loss of the final -e: Gheg 
pla:k ‘old woman’ - SA plake. 


Nominal system: the origin 

Albanian has radically rebuilt the PIE noun system: its declension system has been reg- 
ularized, and the “old” PIE endings have been for the most part lost. The details of this 
process cannot be reconstructed, but it can be supposed that it began in the PPA period 
(the time of an essential rebuilding of the PIE stress system in Albanian) and continued at 
least till the end of the PA period (the time when the process of unstressed vowel reduc- 
tion was still under way). 

In many cases, this radical rebuilding makes it impossible to establish a reliable histor- 
ical source of Albanian flexions. Many nominal flexions are reconstructed on the basis of 
our general ideas about the development of the PIE grammatical system (this is also true 
for the history of the verbal system). As a consequence, in many cases there are competing 
etymological hypotheses; below I will provide only the most reliable of these hypotheses. 
In those instances when there are competing proposals, the relevant bibliographical refer- 
ences are given, although in most cases I provide references only for modem overviews 
rather than for those original studies where these respective solutions were proposed. 


The indefinite declension 

In this section and below, when discussing various historical explanations I will employ 
PIE notations in the form that was used by the respective authors. 
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Masculine 

The “masculine” declension type (I and II declension types in Table 12.8) continues the 
PIE o-declension (with the -yo- subtype). In Albanian, these types encompass mascu- 
line nouns that used to belong to other declensions in PIE (that is, the consonant stems, 
M-stems, /-stems and so on). Some nouns that belonged to the PIE /-declension type man- 
ifest traces of the PA umlaut processes, which indicates that the final stage of the rebuild- 
ing of the declension system happened in the PA period (SA elb ‘barley’ < PIE *h,olb h i-). 

The flexion of the indefinite declension is deeply decayed due to reduction that 
continued to happen in the PA period (these processes affected Latin borrowings; see 
“Phonology”). The following endings for the o-declension can be reconstructed: 

nom. sg. SA -e, -0 < PIE *-os. The double reflex e/0 is conditioned by the phonetic 
shape of Albanian words, although the distribution isn’t completely clear. Orel (2000: 
233) explains the -e ending as a reflex of PIE oxytonic stress, but this explanation isn’t 
supported by the majority of specialists. 

acc. sg. SA -e, -0 < PIE *-om (see also below in the section devoted to the definite 
declension). 

gen.-datJabl. sg. SA -/, -u. (a) < PIE *-osyo (the genitive ending of o-declension) 
(Klingenschmitt 1994, Matzinger 2006: 98, Schumacher & Matzinger 2013). The Hu alter- 
nation is explained as the result of the development of unstressed o under different pho- 
netic conditions (through an intennediate stage *u). (b) < PIE *-oy (the dative ending of the 
o-declension) or *-oy/-ey (the locative ending of the o-declension; see Orel 2000: 234); see the 
overview in Sh. Demiraj 1986: 274—282; -u is a secondary, phonetically conditioned variant. 

Indefinite 

The nom. pi. endings demonstrate significant variability: 

a) The fact that these forms are characterized by various kinds of consonant alterna- 
tions (palatalizations) and vocalic alternations (umlaut) (see “Phonology” above) 
enables us to trace them back to originally pronominal ending: SA -0 < PA *-i 
(which caused palatalization and umlaut) < PIE *-oy. The morphonological alter- 
nations that emerged in the nom. pi. form later got extended analogically into other 
members of the plural paradigm. 

b) SA -e has different etymological explanations: (a) -e < PIE *-os (Sh. Demiraj 1986: 
23 1 with some reservations; Orel 2000: 235); (b) -e < PIE *-es (initially a consonant 
stem ending; Matzinger 2006: 102); and (c) -e < PIE *-a (the unstressed ending of 
the PIE collective; Desnickaja 1976: 37-38). 

c) SA -e. (a) < PIE *-os, *-oy in stressed position (Orel 2000: 235); (b) < PIE *-ya (the 
ending of the PIE collective; Desnickaja 1976: 37-38); and (c) < PIE *-ew-es, which 
is a development of the PIE u-stems (Klingenschmitt 1994: 25, Matzinger 2006: 
102). Extension by analogy from the feminine declension is also plausible. 

d) SA -a. (a) This ending arose due to the analogical influence of feminine nouns (Orel 
2000: 236; Topalli 2011: 242); (b) < PIE *-a (the stressed ending of the PIE collective; 
Desnickaja 1976: 38-39); (c) contamination of the endings -e and -e (Pedersen 1895: 
10, Matzinger 2006: 102; cf. similar contractions in pronominal clitics, see below). 

The plural suffixes -ra/-na, -er/-(e)n, -inj and some others are in all probability the 
result of the reanalysis of the old singular -n stems and their further concatenation with 
various plural endings. 

The acc. pi. ending is always identical with the nom. pi. ending. Somewhat conven- 
tionally, it is possible to reconstruct the PIE *-ons > SA -0 (probably, sometimes -e) 
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development and later analogical extensions of various types of nom. pi. endings into the 
ace. pi. form (Orel 2000: 236). 

The gen.-dat. pi. -e < PIE *-om. The v component in the ending -ve has emerged as a 
means to avoid hiatus and was extended analogically from those plural stems that termi- 
nate in vowels onto stems of other types. 

The abl. pi. -sh < PIE *-oy-su. Unstressed -u developed in PA in -i; see OA definite abl. 
pi. -shit (see below, Schumacher & Matzinger 2013: 263). 

Feminine 

Feminine declension types also underwent a very serious unifonnation, mostly using the 
PIE *-a (*-eh 2 ) declension type as a model. Other types of PIE feminine declensions were 
included in the -a declension. 

nom. sg. endings The most widespread feminine stem-final -e is the reflex of PIE *-eh 2 . 
SA feminine nouns with unstressed final -e are interpreted as reflecting the PIE 
-ih 1 /-yeh 1 - stems (“c?evf-type”: SA hije ‘shadow’ < PIE *ski-yeh,); see Matzinger 
2006: 96. 

acc. sg. SA -e, (-e) < PIE *-eh,m (see also below, the section devoted to the definite 
declension). 

gen.-dat. /abl. sg. SA -e can be traced back to alternative sources: (a) the PIE dative 
ending *-eh 2 ey (Orel 2000: 238) or the PIE locative ending *-eh 2 i (Sh. Demiraj 
1986: 285); and (b) the analogical development from the masculine gen.-dat./abl. 
sg. ending, so we may reconstruct something like *-ayya < *-asya (Klingenschmitt 
1994: 223, Matzinger 2006: 99-100). 

Feminine nom. pi. endings are characterized by great variability, although to a lesser 
degree than masculine nom. pi. endings: 

a) SA -e < PIE *-eh,cs. 

b) SA -e < PIE *-ih,es. Desnickaja (1976: 40-41) considers both sg. and pi. feminine -e 
forms as the continuation of the PIE collective in *-ya. 

c) SA -a. (a) < PIE stressed *-as (*-eh 2 es) (Orel 2000: 239); and (b) contamination of 
the endings -e and -e (Matzinger 2006: 103). 

The acc. pi. ending is always identical with the nom. pi. ending. 

The gen.-dat. pi. SA -e as well as abl. pi. -sh are identical to the masculine ending. 
They result from the development of PIE *-eh,om (or *-asom; see Klingenschmitt 1994: 
224) and *-eh 2 su endings, respectively. 

Both in the masculine and feminine gender, the plural endings of oblique cases are 
agglutinated to the plural stem, which simultaneously functions as the nominative plural 
form (after some types of stems that end in a consonant, a connecting vowel is used). It 
should be stressed that the morphonological alternations that first developed in the indef- 
inite nominative plural form of masculine nouns later extended into all members of the 
plural paradigm. 


Neuter 

The only ending that is different from the masculine ones in the singular is nom. -acc. of 
the definite conjugation (see below). According to Pedersen (1897: 288), the nom. -acc. 
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sg. indef. SA -e/0 of “concrete” neuter nouns goes back to PIE *-a (*-eh 2 ) (nom.-acc. pi. 
of PIE neutra). 

On the origin of some irregular plural forms see Sh. Demiraj 1986: 244-248; for the 
stress shift in some plural types see “Stress” above). See also an interesting example of 
umlaut in the gen.-dat./abl. sg. of at ‘father’: et(i) (< *ati). 


The definite declension 

The definite nominal forms originated from concatenation of indefinite forms with the 
corresponding case forms of the postpositive definite article, which in its turn evolved 
from the demonstrative pronoun. The postpositive article is one of the main balkanisms 
of Albanian that is shared with Balkan Romance and Balkan Slavic. The definite declen- 
sion must have been formed in the PA period, it is very likely that the time of this process 
coincided with the period of intensive (proto )Albanian-(proto)Romanian language con- 
tacts (in this case Albanian must have been the source language for this shared feature). 

There are two main etymological hypotheses about the origin of demonstrative pro- 
nouns. According to the first of them, they evolved from an amalgam of three different 
PIE pronouns, namely, *is, *ih, (*ya) for nom. m and f. sg, *ki-/*ky- for gen/dat/abl. f. sg. 
and *so, *seh 2 , *tod for the remaining forms (Meyer 1892: 79, Orel 2000: 243). Accord- 
ing to the second hypothesis, all Albanian demonstrative pronominal originated from PIE 
*so, *seh 2 , *tod (Pedersen 1900: 311-315, Matzinger 2006: 109-110). 

Masculine 

nom. sg. SA -i, -u < PIE *-os + *is (Meyer 1892: 40, Orel 2000: 246); (b) PIE *-os + 
*so (Pedersen 1894: 251, Klingenschmitt 1994: 224). Both etymologies assume that the 
phonetically conditioned distribution of the forms in -i and -u arose later than these forms 
as such; in terms of conditions this distribution is the same as the distribution of formally 
similar endings of the gen.-dat./abl. sg. indefinite (see above). 

acc. sg. -{i/u)n < PIE *-om + *tom. The Hu evolved under analogical influence from 
the gen.-dat. sg. 

gen.-dat./abl. sg. SA -it, ut are the result of the concatenation of the forms of the indef- 
inite declension with the cased forms of the demonstrative pronoun (that is, *tosyo, *toy 
or *tey according to alternative etymological hypotheses; see above). The old and dialec- 
tal forms in -et, -t can be explained by accentological considerations (Matzinger 2006: 
98). 

nom. pi. SA plural stem + -t(e) < PIE *toy. 
acc. pi. SA plural stem + -t(e) < PIE *tons. 

gen.-dat. pi. SA -ve is identical with the corresponding indefinite ending (see above). 
The older ending -(e/ve)-t could have evolved from PIE *tom. 

In the old SA ending of the abl. pi. -shit, -t could have evolved from PIE *toysu; on the 
change unstressed -u- > -i- see above (“Indefinite Declension”). 

Feminine 

nom. sg. SA -a: (a) < PIE *-a- + *ya (Orel 2000: 247); (b) < PIE *-a- + *sa (Klingen- 
schmitt 1994: 223) 

acc. sg. SA -(V)» < PIE *-(V)m + *tam 

In the gen.-dat. sg. SA -(’V)s(e)-s(e) could have evolved from PIE *tesyasyo (according 
to Matzinger 2006: 99) or *kyay (according to Orel 2000: 247). 
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The old and dialectal abl. sg. ending -et evolved due to analogical influence from the 
masculine declension type. 

From a synchronic point of view singular oblique feminine forms of the definite 
declension are, unlike masculine forms, merely combinations of the feminine indefinite 
stem with the case forms of the definite article. 

nom. and acc. pi. SA: plural stem + -t< PIE * tas. 

gen.-dat. pi. SA -ve: see above (“Masculine”); older -t could go back to *teh,om (or 
*tasom; see Matzinger 2006: 100). 

abl. pi. SA -shit: see above (“Masculine”); -t could arise according to the masculine 
model (Klingenschmitt 1994: 224). 

It is possible that feminine plural definite declension in general has been restructured 
according to the masculine pattern. 

Neuter 

nom.-acc. sg. -O', e)t(e) < PIE *-a- + *ta (Pedersen 1897: 288) or PIE *-od + *tod (see 
Matzinger 2006: 99). 

Adjective 

There are two main classes of adjectives in SA, which differ in their morphological 
features as well as in the number of agreeing categories they possess. The first class 
(henceforth articulated adjectives) comprises those adjectives that are always used with 
a special prepositive clitic formant ( article in the Albanian grammatical tradition) that 
agrees with the head noun in gender, case and number and is formally identical with the 
“article” of the genitive noun form (see above). No other grammatical formants can be 
placed between the article and the adjective. 

The second class (henceforth unarticulated adjectives ) includes those adjectives that 
have no such “article” and agree with the head in gender and number only by means of 
their endings. 

All Albanian adjectives are strictly divided into these two classes. 

The first class, articulated adjectives, includes underived adjectives of Indo-European 
origin as well as Latin and Slavic borrowings, adjectives derived by some suffixes (-/(e), 
-me, -shem), adjectives with the negative prefix pa-, composite lexemes that have an 
adverb as their first part, and, finally, deverbal adjectives (participles used with the prep- 
ositive “article”). 

The second class, unarticulated adjectives, includes a small group of old words that can 
be used as either nouns or adjectives (plak ‘old man’ and ‘old’, trim ‘brave man, warrior’ 
and ‘brave’, etc.), the majority of derived adjectives (many of them are also homonymous 
with nouns), the majority of composite adjectives, and later borrowings. 

The normal word order in Albanian is “noun + adjective”. The reverse order is possi- 
ble, too, but often has an additional expressive value. The majority of unarticulated adjec- 
tives cannot be used in the construction with the inverted order (composite adjectives 
make an exception). If an adjective (or an adjective-like pronoun or an ordinal numeral) 
occupies the first position in the noun phrase, it declines as a noun, whereas the noun 
loses its ability to mark the case in this construction (see the table). This rule according 
to which it is always the first component of the noun phrase that get case inflection is a 
characteristic that Albanian shares with Romanian and Bulgaro-Macedonian. 

There are no traces of synthetic formants of degrees of comparison in Albanian. Both 
comparative and superlative meanings of adjectives and adverbs are expressed by the 
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construction me (Gheg ma) ‘more’ (< PIE *meh 2 -is) + adjective or adverb. In some con- 
structions nouns can also have degrees of comparison. 


The declination of adjectives 

TABLE 12.10 ARTICULATED ADJECTIVES (UNMARKED ORDER) 



singular 

plural 


masculine 

feminine 

masculine 

feminine 

nom. 

djali i mire 

vajza e mire 

djemte e* mire 

vajzat e* mira 

acc. 

djalin e* mire 

vajzen e*mire 

djemte e* mire 

vajzat e* mira 

gen./dat./abl. 

djalit te mire 

vajzes se*mire 

djemve te mire 

vajzave te mira 


a) The table reflects the most widespread declination type of articulated adjectives. Besides this type, 
there are (a) adjectives with suffixes -shem, ~(e)m, -me, which have a morphological opposition of 
the forms of the masculine (both sg. and pi.: i mesem, temesem ‘middle’) and feminine (both sg. and 
pi.: e mesme, te mesme ‘middle’) gender; (b) a small number of adjectives that do not change their 
endings at all, and (c) some irregular adjectives. 

b) The case forms of the “article” in noun phrases with adjectives do not differ from those forms of 
articles that are used in noun phrases with the genitive (see above). 

c) The forms given in the table are used when the head noun is in the definite form, and the adjective 
occupies an adjacent position and functions as an attribute (not as a predicate). Whenever any of 
these conditions is violated, those forms of articles that are marked with the asterisk in the table are 
replaced by the article form te ( vajzen me te mire ‘the best girl (acc.)’, and so on). 

d) Neuter nouns in the singular require the article form e for the nom./acc. and te for oblique cases 
when the noun phrase has an attributive dependent and the article occupies the position immediately 
after the definite head noun. Otherwise, the form te is used for all cases. In the plural, the adjectives 
change according to the feminine pattern. 


TABLE 12.11 ARTICULATED ADJECTIVES (INVERTED ORDER) 



singular 

plural 


masculine 

feminine 

masculine 

feminine 

nom. 

i miri djale 

e mira vajze 

te miret djem 

te mirat vajza 

acc. 

te mirin djale 

te miren vajze 

te miret djem 

te mirat vajza 

gen./dat./abl. 

te mirit djale 

se mires vajze 

te mireve djem 

te mirave vajza 


Substantivized articulated adjectives decline identically to adjectives in the noun 
phrases with inverted word order. 

Unarticulated adjectives agree with the head nouns only in gender and number. They 
are divided into several inflectional classes. In the singular, the most widespread endings 
are zero for masculine and -e for feminine gender. In the plural, the most typical endings 
are -e for masculine and -e for feminine gender. 

Adjectives: dialectal differences and historical background 

a) In the old written monuments as well as in many dialects the “articles” that func- 
tion in noun phrases with adjectival or genitival dependents are not used after the 
dative-genitive definite forms of nouns, cf. njeri-ut m/reperson-GEN./DAT.SG.DEF. 
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good ‘good person (gen./dat.)’. In some dialects, these articles are not used after 
definite nominative forms, too. It remains unclear whether this feature reflects an 
older state or is an innovation. 

b) Tosk dialects do not have deverbal adjectives with the suffix -shem, which have the 
meaning of the possibility or ability (i paharrueshem ‘unforgettable’). These adjec- 
tives exist both in Gheg dialects and in SA. 

Some Albanian adjectives are etymologically related to PIE adjectives, other adjectives 
are Albanian derivatives (deverbal or, less often, denominal), and there is a significant 
number of borrowed adjectives. 

As for the origin of the grammatical layout of Albanian adjectives, a common opin- 
ion is that the prepositional article (in noun phrases with both adjectives and genitives) 
evolved from the PIE demonstrative pronoun and is thus etymologically identical with 
the postpositive definite article. Particular details of the grammaticalization of these two 
sets of morphemes remain a subject of a long-standing debate. Generally speaking, the 
origin of Albanian articulated adjectives is often compared with the development of 
Slavic and Baltic “long adjectives”. Unarticulated adjectives could evolve from apposi- 
tive constructions of the type “noun + noun”. 

Articulated adjectives have an adverbial function when used without article. 

There is a large set of prepositions in Albanian; some of them are primary, and others 
are homonymous with adverbs (for more information on prepositions, see “Syntax”). 

Pronouns 

Personal and demonstrative pronouns 

TABLE 12.12 DECLENSION OF THE PERSONAL AND DEMONSTRATIVE PRONOUNS 


singular plural 



i 

2 

3 

1 

2 

3 




m 

f. 



m 

f 

nom. 

une 

ti 

ai 

ajo 

ne 

ju 

ata 

ato 

acc. 

mua (me) 

ty(te) 

ate (e) 

ne (na) 

ju (ju) 

ata (i) 

ato (i) 

(gen.)/dat. 

mua (me) 1 

tv(te) 

atij (!) 

asaj (i) 

neve (na) 

ju (ju) 

atyre (u) 

abl. 

meje 

teje 

atij 

asaj 

nesh 

jush 

atyre 


1 The (gen.)/dat. form of the personal pronoun is used very rarely in possessive constructions (only if the 
pronoun has its own attribute), as there are dedicated possessive pronouns that are normally used in the 
possessive construction (see below). 


The so-called weak (clitic) forms of personal pronouns are given in parentheses (on the 
use of weak forms see “Syntax”). 

The nominative neuter form ata is used very rarely; other neuter case forms don’t differ 
from masculine forms. 

The 3rd person personal pronouns are used in the function of (or are homonymous to) 
distal demonstrative pronouns. Proximal demonstrative pronouns are formed by means of 
the formant k(e) (instead of a-). The form of nom. m. sg. kv has a different vocalic pattern, 
while other proximal forms are distinguished from distal pronouns exclusively by the 
initial formant, cf. k-jo, ke-te and so on. 

Dialectal differences in the domain of personal and demonstrative pronouns are rather 
insignificant. An older (see below) form u (instead une) for the nom. 1 sg. pronoun is used 
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in many dialects as well as in the old texts. For the nom. 1 pi. and 2 pi. forms, an expan- 
sion of oblique forms {neve, na,juve) into the nominative slot is observed in some dia- 
lects, whereas na is, most likely, the original nominative form. An older form ay (instead 
ai) is used in the eastern part of the northern Tosk dialects. There are also some dialectal 
variants of the oblique case forms of the 3 sg. and 3 pi. pronouns. 


The origin 

une < PIE *swom (La Piana 1949: 69, Orel 2000: 241) or < PIE *h,egoH (Bopp 1854: 
504—505, Matzinger 2006: 107); ne is a kind of pronominal extension, cf. Gr. -vai, 
-vf). oblique forms of the 1 sg. pronoun developed from the PIE stem *me- ( mua < 
PIE *mem, meje < PIE *mey). 

ti < PIE *tuH, oblique forms reflect the same stem ( ty < PIE *twem, teje < PIE *tey, 
or under the analogical influence of meje). 

na < PIE *nos; ne < PIE *noHs; neve, nesh (as well as juve,jush) could have devel- 
oped under the influence from the nominal paradigm. 

ju < PIE *wos 

The deictic elements k(e)- and a- of the demonstrative (and 3rd person personal) pro- 
nouns must have been agglutinated to the demonstrative pronominal elements no earlier 
than the end of the PA period. The element a- could be etymologically related to the PIE 
deictic element *aw-, and k(e)- to the PIE element *ko-. On the origin of demonstrative 
pronouns see above (“The Definite Declension”). 

Clitic forms of personal pronouns evolved through the process of reduction of full 
forms. Contracted forms of dative and accusative pronouns should be mentioned: ma 
< me (1 sg. dat.) + e (3 sg. ace.), and so on. These forms must have emerged in the 
PA period, because this type of sandhi phenomena is also encountered in some nominal 
forms but is not a productive phonological alternation in SA. 

Albanian has rather complex system of possessive pronouns. The first and second per- 
son possessive pronouns agree with the possessee in gender, number and case and reflect 
the person and number of the possessor. Most of the 1st and 2nd person possessive forms 
consist of a demonstrative element that is etymologically identical to Albanian article 
formants and a pronominal element in the first or second person, respectively. These latter 
elements originate from the PIE personal or possessive pronouns, although there are dif- 
ferent opinions with respect to what kind of grammatical forms were their PIE etymons. 
Possessive pronouns of the 1 and 2 sg. of the possessor have analytical forms when they 
are used with the plural possessee. These analytical forms contain the prepositional arti- 
cle (see “Adjective” above) and reflect a later grammaticalization process. 

The following table illustrates the declension pattern of the 1 sg. possessive pronoun: 


TABLE 12.13 DECLENSION OF THE POSSESSIVE PRONOUNS 



singular of possessee 

plural of possessee 


m. 

f. 

m. 

f. 

nom. 

libri im 

dora ime 

librat e mi 

duart e mia 

acc. 

librin tim 

doren time 

librat e mi 

duart e mia 

gen.-dat./abl. 

librit tim 

dores sime 

librave te mi 

duarve te mia 


libri im ‘my book’, dora ime ‘my hand’ 
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The 3rd person possessives are analytical constructions of the type “prepositional arti- 
cle + demonstrative pronoun without deictic elements”; these constructions are differen- 
tiated according to the gender of the possessor: (i tij ‘his (masculine possessee)’, i saj ‘her 
(masculine possessee)’. These pronouns decline as adjectives. 

Basic word order in the possessive phrase is “noun + possessive pronoun”. 

There is a specific (probably archaic) construction that is used for the expression of 
the 3rd person possessive relations with some old kinship terms, inherited from PIE or 
borrowed from Latin, such as ate ‘father’, erne ‘mother’, vella ‘brother’, moter ‘sister’ 
and some others, as well as with the word zot ‘lord’: i ati ‘his, her, their father’, e motra 
‘his, her, their sister’, te vellezerit ‘his, her, their brothers’. In this construction, the ele- 
ment that is homonymous to the prepositive article reflects the gender and number of the 
possessee only. This construction alternates with the regular possessive construction. The 
same kinship terms also allow the prepositive use of personal pronouns of 1st and 2nd 
person: im ate ‘my father’ along with ati im ; in this case the noun should be used in the 
indefinite form. 

Albanian has a special reflexive pronoun vetja ‘himself’, which inflects as a noun. 
It marks coreference with an antecedent of any person, number and gender. The 
possessive-reflexive pronoun (i, e) vet exists only in Gheg dialects and in SA; it remains 
unclear whether it was ever a common-Albanian form. Etymologically, vetja goes back 
to PIE *swe with extension *-t- (for the morphological means of the expression of reflex- 
ive meaning, see below, “Verb”). Albanian has two reciprocal pronouns. The first, njeri 
tjetrin, lit. ‘one another. ACC.’ (and its other case forms), follows the widespread model 
typical of the Balkan languages. The second construction, shoku shokun (lit. ‘friend 
friend. ACC.’, shok < Lat. socius) (and its other case forms as well as phonetic vari- 
ants), could have arisen under Slavic influence (cf. Old Slavic drugb druga, lit. ‘another 
another.ACC.’, but homonymous to ‘friend friend.ACC.)’. For the morphological means 
of the expression of the reciprocal meaning, see below, “Verb”. 

Albanian has several pronouns that are derived from PIE *k w o-/k w i- and have both 
interrogative and relative functions. Among these pronouns are kush ‘who’ < PIE 
*k w os-so-, ku ‘where’, kur ‘when’, se ‘what’ and qe/ge ‘what’ (for the use of the last 
two pronouns, see also “Syntax”). The non-inflected qe/ge is used as a general relative 
pronoun; in modern SA it is being replaced by the declinable pronoun (i, e) cili (<*t-sili) 
‘which, who’, which developed from the same PIE *k w i-. 


Numerals 

Cardinal numerals 


TABLE 12.14 CARDINAL NUMERALS 


1 

nje' 

< PIE *sem-/sm- or < PIE *oyno- 

2 

dy 2 

< PIE *dwo-/duwo- 

3 

tre (m.), tri (f.) 

tre < PIE *treyes, tri < *trih. 

4 

hater 

< PIE *k w a tur-/k w 3 twor- 

5 

pese 3 

<PIE *penk w e 

6 

gjashte 

<PIE *seks-t- 

7 

shtate 

<PIE *septm-t- 

8 

tete 

<PIE *oktoh,-t- 

9 

nente 4 

<PIE *newn-t- 

10 

dhjete 

<PIE *dekmt- 
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Notes 

1 The variant nji is characteristic of Gheg dialects. Cardinal numerals from ' 1 ’ to ‘4' are inflected for 
case; nje also has the opposition of definite and indefinite forms. 

2 There is a special feminine form dy: in Gheg and southern Tosk dialects. 

3 pes in Gheg dialects. 

4 nan(d) in Gheg dialects. 


The composition of cardinal numerals from ‘11’ to ‘19’ follows the model typical of 
Slavic languages (including Slavic languages of the Balkans), ‘x at ten’: njembedhjete 
‘11’, dymbedhjete ‘12’ and so on. 

The cardinal numerals for ‘30’ and from ‘50’ to ‘90’ are formed according to the model 
‘x-ten’: tridhjete ‘ 30’, pesedhjete ‘50’and so on. The cardinal numerals for ‘20’ ( njezet ) 
and ‘40’ ( dvzet ) are formed according to the model ‘x- twenty’, where -zet < PIE *wlkm- 
tihj. In most Gheg dialects kat(e)rdhjet(e) is used for ‘40’; in some Tosk dialects trizet is 
used for ‘60’, and katerzet for ‘80’. 

The terms for ‘100’ and ‘1000’ are Latin borrowings: ( nje)qind < Lat. centum, mije < 
Lat. mTlia. 


Ordinal numerals 

Ordinal numerals are, morphologically speaking, articulated adjectives. The ordinal 
numerals from ‘2nd’ to ‘5th’ (as well as composite ordinal numbers that end on these 
digits and ‘100th’ and ‘1000th’) are formed by the means of the suffix -te: i, e dvte ‘sec- 
ond’, and so on. Ordinal numbers from ‘6th’ to ‘10th’ (and composite ordinal numbers 
that end on these digits) are differentiated from the corresponding cardinal numbers only 
by the presence of the prepositive article: i, e gjashte ‘sixth’ (-te is here a result of the 
simplification of the sequence “-te of the stem + suffix —te”). Ordinal number i, e pare 
‘first’ < PIE *prH-wo-. 


Verb system 

General remarks 

The Albanian verb system is very complex with respect to both the number of catego- 
ries and the number of conjugational types and subtypes. Grammatical meanings are 
expressed both synthetically and analytically. In the synthetic forms, vocalic and conso- 
nantal alternations play a significant role along with affixation. Analytical constructions 
are formed by means of conjugated auxiliary verbs and clitic particles. 

The verb has the following grammatical categories: person, number, tense, aspect 
(partly overlapping with tense), mood, evidentiality (admirative) and voice. 

The person-number (1st, 2nd and 3rd person, singular and plural) distinctions are 
expressed by the verb endings and, in some conjugational types, by vocalic and conso- 
nantal alternations. Different tense -aspect-mood forms and sometimes different conjuga- 
tional types often have different sets of person-number endings. 

The organization of TAM categories is very complex, because all the moods (except 
imperative) have temporal distinctions. Some forms that are traditionally described as 
“tenses” are in fact opposed to each other in terms of their aspectual meaning (first of all, 
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the aorist and imperfect). Albanian has indicative, subjunctive, conditional, optative and 
imperative moods. 

Indicative has the richest temporal system. It includes three synthetic tenses, which 
form the bulk of the Albanian temporal system: present, aorist and imperfect. Present 
and aorist are differentiated by their stems (not for all verbs) and paradigms of endings. 
The forms of the imperfect are derived from the present stem (or from one of its variants 
in cases of morphophonemic alternations) and have a special set of endings; historically, 
imperfect endings in the active voice are similar to aorist endings. Semantically, the aorist 
is close to a prototypical perfective past, and the imperfect to a prototypical imperfective 
past as they feature in many Indo-European languages. The perfect subsystem has forms 
of the perfect and two pluperfects. In the active voice these are expressed by combina- 
tions of the auxiliary verb kam ‘to have’ in the corresponding tense form (present for the 
perfect, imperfect and aorist for the two pluperfects) and the participle of the main verb. 
The perfect has a bundle of meanings characteristic of this aspectual form from the typo- 
logical point of view, such as perfect of the current relevance, experiential perfect and so 
on. The two pluperfects occupy the semantic niche that is typical of the traditional pluper- 
fects: relative tense precedence or absolute temporal distance. The difference between the 
two pluperfects has not been studied in sufficient detail, but the aorist pluperfect is clearly 
a rather rare form. Albanian has also a set of so-called supercompound forms, which are 
characterized by the use of the compound {perfect or pluperfect) forms of the auxiliary 
verbs. These supercompound forms have restricted dialectal distributions but are repre- 
sented, although very poorly, in standard texts. Semantically, they express various shades 
of temporal remoteness. The future subsystem is formed in SA by the means of a special 
future particle do and the subjunctive of the main verb. This subsystem includes future 
rfdo + present subjunctive”), future in the past rfdo + imperfect subjunctive”), future 
perfect (“ do + perfect subjunctive”) and future perfect in the past (“do + pluperfect sub- 
junctive”). Albanian also has a “parallel” future subsystem with an additional necessitive 
semantic shade; these forms are formed by means of the verb ‘to have’ (in the present, 
imperfect and aorist) and the so-called new infinitive (see below). 

SA also has two additional analytical constructions with aspectual meanings. The first 
construction consists of jam ‘to be’ (present, imperfect or aorist) + analytical gerund 
(see below) and has a progressive meaning: jam duke punuar be. 1SG. PRES. GERUND 
work.PTCR ‘I am working’. The second construction consists of the particle po and the 
present or imperfect fonn of the verb; it expresses the progressive meaning and relates the 
time of action to a certain reference time, either the moment of speech or a certain interval 
of time in the past: po punoj PROG. work. 1SG. PRES. ‘I am working (right now)’. 

The subjunctive mood is formed with the special particle te (which is derived from 
an old conjunction?) and synthetic or analytical verbal forms that partly coincide with 
those of the indicative mood; for the absolute majority of verbs only the 2 sg. and 3 sg. 
subjunctive present tense forms of the active voice are different from the corresponding 
forms of the indicative mood. The subjunctive is used in complement clauses as an analog 
of the lacking infinitive (see below), in the main clauses in imperative and optative func- 
tions; it is also used in various hypothetical constructions (see Breu 2010: 465 — 466). The 
subjunctive has present, imperfect, perfect and (imperfect) pluperfect forms. In the old 
texts the subjunctive is used without te after the negation particle mos (often) and after 
the possibility particle munde, mune (nearly always). 

The conditional is formed by combining the subjunctive with the prepositive particle 
do. The main context of the use of the conditional is the apodosis of hypothetical periods 



ALBANIAN 587 


(see Breu 2010: 467). The conditional has present (“ do + imperfect subjunctive”, homon- 
ymous to the future in the past) and past (“r/o + pluperfect subjunctive”, homonymous to 
the future perfect in the past) forms; thus, the paradigm of forms with do can be viewed 
as a single future-conditional system. 

Additionally, there are some analytical constructions formed with various particles and 
subjunctive forms. They have various modal meanings (jussive, possibilitive and so on), 
but their status as real grammatical moods is questionable. 

The optative mood has present and perfect forms. The present optative is a synthetic 
form that has a special stem, which in most classes of verbs is derived from the aorist 
stem, and a special set of endings. The perfect optative is an analytical form, consisting 
of the optative of the auxiliary and a participle of the main verb. In SA, the optative is 
mostly used in various fossilized or semi-fossilized expressions (blesses, curses and so 
on); in the optative function proper, which is restricted and stylistically marked; and in 
the protasis of hypothetical periods. 

The imperative mood has two synthetic forms, the 2 sg. and the 2 pi. form. 

The basic meaning of the SA admirative is mirative, but it also has a secondary com- 
mentative function. Admirative distinguishes at least two moods, indicative and subjunc- 
tive (the forms of the conditional admirative are very rare), and some tense forms. The 
synthetic forms of the present and imperfect indicative admirative emerged through amal- 
gamation of inverted perfect and pluperfect forms, respectively, cf. bekam [do.ADM. 
PRES.1SG.] < be(re) kam [do. PART. have.PRES.lSG.]. From a synchronic point of view 
they have a special stem and a special set of endings. The perfect and pluperfect admira- 
tive forms are formed by the admirative forms of the auxiliary verb and the participle of 
the main verb. The subjunctive-admirative forms (present, imperfect and pluperfect) are 
formed with the particle te and corresponding admirative verbal forms. 

There are two voices in Albanian, active and non-active. Albanian non-active forms 
can have different semantic interpretations: passive, reflexive, reciprocal, anti-causative 
etc. Passives are regular (inflectional) voice forms of transitive verbs, whereas other func- 
tions are valency-reducing derivations of active verbs; some of these derived verbs sig- 
nificantly deviate from their active counterparts in terms of their meaning. There is also a 
rather small number of semantically active deponent verbs in Albanian. 

The formal structure of the Albanian non-active voice system is extraordinarily com- 
plex. There are three morphological devices that can signal non-activeness. At least in 
SA, these three devices are rigidly distributed among the tense -modal forms of verbs. 

a) A set of special endings; this device is employed for the present, imperfect and future 
indicative as well as for the present and imperfect subjunctive and present of the 
conditional (= future in the past indicative). 

b) A construction that consists of the particle u and an active verb form; this device is 
employed for the aorist indicative, present and imperfect admirative, present opta- 
tive and imperative; besides, for most Albanian verbs the 3 sg. aorist form in this 
construction is formally distinct from the corresponding active form. 

c) A corresponding finite form of the verb jam ‘be’ + participle; this device is employed 
for the forms of the perfect system. 

In modem standard Albanian, these three types of non-active forms can express all of the 
meanings that belong to the non-active semantic spectrum as outlined above: reflexive, 
passive, anti-causative and so on. 
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Non-finite forms 

Albanian has a single participle formed from both transitive and intransitive verbs. 
The participle is used in the analytical verbal constructions of the “perfect system’’ and 
in other analytical constructions (see below), as a predicative attribute and in relative 
constructions. On the adjectivization of participles, see above (“Adjective”). The (non- 
adjectivized) participle is a non-inflected form. Besides, there are some analytical non- 
finite constructions that are formed by means of a participle. The most widespread among 
these constructions are the following: 

• duke + participle (“gerund”). This construction chiefly expresses the meaning of 
simultaneity. It has a non-active form ( duke + u + participle) and a rare perfect active 
(duke + pase + participle) and passive ( duke + qene + participle) forms. 

• pa + participle (“privative”). This construction behaves syntactically just like the ger- 
und and has the same voice and temporal forms as the gerund. 

• per + te + participle (“new infinitive”). This construction is mainly used with the final 
(= purpose) meaning; it has a non-active counterpart. 

• me + te + participle (“absolutive”). This construction’s main meaning is temporal 
precedence. The new infinitive and the absolutive are in fact formed with the help of 
the deverbal noun, which is in turn derived from the participle. 

• The ablative of the deverbal nouns is used in a function that is close to that of the 
infinitive after the phasal verbs. This fonn is mostly used after verbs that denote ter- 
mination or cancellation of the action: mbaroi se fol-ur-i finished ART.ABL. speak- 
PART.-ABL. ‘s/he stopped speaking’. 


Conjugational classes 

There is no uniform and commonly accepted system for the classification of the numer- 
ous SA conjugational classes. The majority of the classification attempts (see, first of all, 
Boretzky 1977, Buchholtz & Fiedler 1987) take as a basis for the first-level classification 
the form of the present stem and, accordingly, the characteristics of the conjugation in the 
present (for a different approach, see Schumacher & Matzinger 2013). 

Under this approach, three major classes may be distinguished: (a) the verbs with the 
stem ending in a vowel that take endings in the singular present indicative, (b) the verbs 
with the stem ending in a consonant, and (c) the verbs with the stem ending in a vowel 
that do not take endings in the singular present indicative (this class is smallish). Besides 
these classes, there are some verbs that can be synchronically viewed as exceptions. 

Further classification of the conjugational subclasses is based both on the peculiarities 
of the conjugation in present (first of all on the consonantal and vocalic alternations) and 
on the relation between the present and aorist stems; sometimes other verbal forms - 
imperative, participle and so on - are also taken into account. The present and the aorist 
stem types are two subsystems that are connected to each other by many-to-many rela- 
tions. As a consequence, an independent classification based on the fonn of the aoristic 
rather than the present stem can also be elaborated (see Boretzky 1977). The vocalic and 
consonantal alternations are also independent of the verb class as established by the stem 
formation processes, although some kinds of alternations as well as some types of the 
aorist stems are not encountered in certain classes as defined in tenns of present stem 
formation. 
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The only productive conjugational subtype of modem SAis the 1st class with the stem 
ending in -o (type punoj). 

Table 12.15 summarizes examples of major conjugational classes. 

TABLE 12.15 VERB CONJUGATION (THE MAJOR CONJUGATIONAL CLASSES) 

l. punoj 'to work’ II. kap' ‘to seize’ ill. /;/ 'to drink’ 


Active 


Indicative present 

1 sg. 

punoj 

kap 

pi 


2 sg. 

punon 

kap 

pi 


3 sg. 

punon 

kap 

pi 


1 Pi 

punojme 

kapim 

pime 


2 pi. 

punoni 

kapni 

pini 


3 pi. 

punojne 

kapin 

pine 

Indicative imperfect 

1 sg. 

punoja 

kapja 

pija 


2 sg. 

punoje 

kapje 

pije 


3 sg. 

punonte 

kapte 

pinte 


lpl. 

punonim 

kapnim 

pinim 


2 pi. 

punonit 

kapnit 

pinit 


3 pi. 

punonin 

kapnin 

pinin 

Indicative aorist 

1 sg. 

punova 

kapa 

piva 


2 sg. 

punove 

kape 

pive 


3 sg. 2 

punoi 

kapi 

piu 


lpl. 

punuam 

kapem 

pime 


2 pi. 

punuat 

kapet 

pite 


3 pi. 

punuan 

kapen 

pine 

Subjunctive Present 3 

2 sg. 

te punosh 

te kapesh 

te pish 


3 sg. 

te punoje 

te kape 

te pije 

Optative present 

1 sg. 

punofsha 

kapsha 

pifsha 


2 sg. 

punofsh 

kapsh 

pifsh 


3 sg. 

punofte 

kapte 

pifte 


lpl. 

pun of shim 

kap shim 

pifshim 


2 pi. 

punofshit 

kapshit 

pifshit 


3 pi. 

punofshin 

kap shin 

pifshin 

Admirative present 

1 sg. 

punuakam 4 

kapkam 

pikam 


2 sg. 

punuake 

kapke 

pike 


3 sg. 

punuaka 

kapka 

pika 


lpl. 

punuakemi 

kapkemi 

pikemi 


2 pi. 

punuakeni 

kapkeni 

pikeni 


3 pi. 

punuakan 

kapkan 

pikan 

Admirative imperfect 

1 sg. 

punuakesha 4 

kapkesha 

pikesha 


2 sg. 

punuakeshe 

kapkeshe 

pikes he 


3 sg. 

punuakesh 

kapkesh 

pikesh 


lpl. 

punuakeshim 

kapkeshim 

pikes him 


2 pi. 

punuakeshit 

kapkeshit 

pikeshit 


3 pi. 

punuakeshin 

kapkeshin 

pikeshin 

Imperative 

2 sg. 

puno 

kap 

pi 

Non-active 

2 pi. 

punoni 

kapni 

pini 

Indicative present 

1 sg. 

punohem 

kapem 

pihem 


2 sg. 

punohesh 

kapesh 

pihesh 


3 sg. 

punohet 

kapet 

pihet 


lpl. 

punohemi 

kapemi 

pihemi 


2 pi. 

punoheni 

kapeni 

piheni 


3 pi. 

punohen 

kapen 

pi hen 
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Indicative imperfect 

1 sg. 

punohesha 

kapesha 

pihesha 


2 sg. 

punoheshe 

kapeshe 

piheshe 


3 sg. 

punohej 

kapej 

pihej 


1 Pi 

punoheshim 

kapeshim 

piheshim 


2 pi. 

punoheshit 

kapeshit 

piheshit 


3 pi. 

punoheshin 

kapeshin 

piheshin 

Indicative aorist 5 

3 sg. 

u punua 

u kap 

u pi 

Participle 


punuar 

Kapur 

pire 


In accordance with the Albanian lexicographic tradition, I use the 1 sg. indicative present form as the 
citation form. 

1 In SA, there are three verbs with the stem ending in a consonant, namely eci ‘to go’, iki ‘to leave, to 
depart’ and hipi ‘to raise’, which take a special set of endings in the singular present indicative : 1 sg. 
-i, 2-3 sg. -en. 

2 The ending -i is used after -o- and consonants (except -k-, -g-, -h-)\ the ending -u is used elsewhere. 

3 Only those forms that deviate from the indicative pattern are given. 

4 The variant forms: punokam, punokesha etc. 

5 Only those forms that deviate from the active forms are given. 


Major types of the aorist stems 

• The aorist stem is identical to the present stem. The verbs of this type belong to all 
three conjugational classes. If the stem ends in a vowel the consonant -v- is inserted in 
the 1 sg. and 2 sg. forms. 

• The aorist stem is characterized by the extension -(/)/- in both singular and plural, cf. 
1 sg. pres, di- 1 sg. aor. dita ‘to know’. This type of correspondence is characteristic 
of some verbs from the 1st and 3rd classes. 

• The aorist stem is characterized by the extension -r- in the singular, cf. 1 sg. pres, bej - 
1 sg. aor. bera ‘to do’. The correspondence of this type is encountered in some verbs 
from the 1st and 3rd classes. Historically -r- (Gheg -«-) was also present in the present 
stem. 

• The aorist stem ends in a vowel, whereas the present stem has an extension -s-/-t~, cf. 
1 sg. pres, vras - 1 sg. aor. vrava ‘to kill’. This type of correspondence is encountered 
in some verbs of the 2nd class. 


Major vocalic and consonantal alternations 

Below I discuss only those alternations that are realized in a considerable number of 
verbs. 

- a-l-e e is used in the 2 sg. and 3 sg. present indicative active forms of some verbs 
of the 2nd class. 

-e- (-ye-, -ie-)/-i-: i is used in the 2 pi. present indicative (and subjunctive, future and so 
on) active, in the imperfect active, in non-active synthetic forms, and in the imper- 
ative. This alternation is characteristic of verbs of the 2nd and 3rd classes (and for 
one verb of the 1st class). 

-a-, -e-!-o-\ -a-, -e- (e further alternates with see above) is encountered in all forms 
except the aorist; it is typical of some verbs of the 2nd class. 

-o-/-ua-; -e-/-ve-: the diphthongs are used in the plural forms of the aorist (and in the 3 
sg. of the aorist non-active), in the admirative and in the participle. This alternation 
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is characteristic of the absolute majority of verbs of the 1st class that end in -o and of 
the majority of verbs of the 1st class that end in -e. 

-ua-/-o-; -ye-/-e-: o resp. e is encountered the singular forms of the aorist (except 3 
sg. of the aorist non-active) and in the optative of those verbs of the 1st class that 
have a present stem ending in -ua, ye. In the synthetic forms of the non-active these 
verbs demonstrate u resp. e. This alternation pattern is found in a small group of 
the verbs of the 1st class. 

- t-l-s -: the alternant s is used in the 1 sg. present indicative active and in the 2 sg. and 
3 sg. present subjunctive active and, often parallel to the forms with -t-, in the 2 pi. 
present indicative active, in the imperfect indicative active and in the 2 pi. impera- 
tive. The alternation is quite typical of the verbs of the 2nd class. 

- k-/-q -g- / -gj: palatal consonants are used in the 2 pi. present indicative (and sub- 
junctive, future and so on) active, in the imperfect active, in non-active synthetic 
forms, in the imperative and in the aorist. This alternation pattern is typical of some 
verbs of the 2nd class. 

The conjugation of the two most important irregular verbs is illustrated in Table 12.16. 


TABLE 12.16 CONJUGATION OF THE VERBS JAM ‘TO BE’ AND KAM ‘TO HAVE’ 




jam ‘to be’ 

kam ‘to have 

Indicative present 

1 sg. 

jam 

kam 


2 sg. 

je 

ke 


3 sg. 

eshte 

ka 


1 Pi 

jemi 

kemi 


2 pi. 

jeni 

keni 


3 pi. 

jane 

kane 

Indicative imperfect 

1 sg. 

isha 

kisha 


2 sg. 

ishe 

kishe 


3 sg. 

ish(te) 

kish(te) 


lpl. 

is him 

kishim 


2 pi. 

is hit 

kishit 


3 pi. 

ishin 

kishin 

Indicative aorist 

1 sg. 

qeshe 

pata 


2 sg. 

qe 

pate 


3 sg. 2 

qe 

pati 


lpl. 

qeme 

patem 


2 pi. 

qete 

patet 


3 pi. 

qene 

paten 

Subjunctive present 

1 sg. 

te jem 

te kem 


2 sg. 

te jesh 

te kesh 


3 sg. 

te jete 

te kete 


3 pi. 

te jene 

te kene 

Optative present 

1 sg. 

qofsha 

paga 


2 sg. 

qofsh 

pag 


3 sg. 

qofte 

paste 

Admirative present 

1 sg. 

qenkam 

paskam 

Participle 


qene 

pasur 


Among other irregular verbs are the following (I provide only the 1 sg. present indicative 
form, the 1 sg. aorist indicative form and the participle): bie - rashe - rene ‘to fall’ ; bie - 
prura - prure ‘to bring’; dua - desha - dashur ‘to want, to love’; ha - hengra - ngrene 



592 ALEXANDER RUSAKOV 


‘to eat’ ; jap - dhashe - dhene ‘to give’; le - lashe - lene ‘to leave’; rri — ndenja - ndenjur 
‘to sit, to stay’; shoh - pashe—pare ‘to see’; them - thashe—thene ‘to say’; vdes - vdiqa - 
vdekur ‘to die’; vete - vajta - vajtur ‘to go’; vij - erdha - ardhur ‘to come’. 


Basic patterns of dialectal variation 

1) Admirative forms are almost absent in the Arbresh and Arvanitika dialects, as well 
as in the southern Tosk Cham dialects. 

2) The supercompound forms are alien to the northern Tosk dialects. 

3) In old Albanian written texts, as well as in a portion of modern Gheg dialects (mainly 
in the dialects of the eastern northern group), several intransitive verbs are used in 
the perfect tenses with the auxiliary ‘to be’, cf. jam shkue (I.am go. PART.) ‘I have 
walked,’ instead of SA kam shkuar (I.have go. PART.). 

4) In Gheg dialects there is a special fonn of the analytical infinitive that is formed 
according to the model me + participle {me shkue ‘to go’). Most likely, the form is a 
Gheg innovation. 

5) In Gheg dialects there is a special form of the future, which is formed according to 
the model ‘have + analytical infinitive’, cf. kam me shkue have. 1SG. PRES. INF. 
PART. ‘I shall go’; this fonn coexists with the SA form of the future, which appar- 
ently has radiated from the Tosk area. Other Gheg forms of the future and condi- 
tional subsystems can also be formed according to this model (see Breu 2013: 43). 
In Arbresh dialects there is a “hybrid” future form {kam + subjunctive). 

6) There are various dialectal fonns of the gerund particle (for example, tue in Gheg). 

7) There are numerous dialectal differences in the field of participle formation. Deriva- 
tional types are more variegated in Gheg dialects than in Tosk dialects (see below). 
Besides, in Gheg dialects there is a special short fonn of the participle that is used in 
analytical constructions. 

8) In some Gheg dialects (especially in the eastern part of the northern and central Gheg 
dialects) there is a tendency towards non-differentiation of the 2 sg. and 3 sg. present 
indicative active fonns with conesponding subjunctive fonns. 

9) In most Gheg dialects (except South Gheg dialects) the imperfect is formed by 
means of the suffix -sh- (Gheg punojsha ‘I worked’). The almost total absence of 
such fonns in the old texts is indicative of the recent origin of this innovation. There 
are also many other dialectal peculiarities in the imperfect endings (for example, 3 
sg. -ke in eastern northern Gheg dialects). 

10) The imperfect passive is formed in the majority of Gheg dialects by means of the 
particle u and not synthetically as it is in SA (Gheg u lajsha REFL. wash. 1 SG.IMP. 
ACT. vs. SA and Tosk lahesha wash.lSG.IMP.PASS. ‘I was washed’). In a few 
dialects (Gheg, southern Tosk and Arbresh) as well as in the old texts a parallel non- 
active perfect construction is also encountered, which is formed with the particle u: 
“u + perfect (pluperfect, aorist 2) active”. 

11) It is often the case that many verbs shift in various dialects from one conjugational 
class to another. For example, in some northern Tosk dialects there is a tendency 
towards the extension of the aorist stems in -t- at the expense of those verbs that 
have aorist stems with -v- in SA. In the old texts, the conjugation type that is typi- 
cal of verbs ik, ec and hip in SA is extended to a significant number of verbs of the 
2nd conjugational class (see de Vaan 2014); thus, these verbs constitute a special 
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subclass of the 2nd class in the old texts. In some Gheg dialects, the verbs of the 3rd 
class conjugate in the present according to the model of the 1st class. 

12) There are also dialectal differences in conjugation that are due to phonetic processes. 
For example, the SA and Tosk -r- in the forms of the aorist corresponds to -n- in 
Gheg; cf. SA and Tosk 1 sg. aor. act. bera ‘I did’ - Gheg bana. In a part of the Tosk 
dialectal area the present and imperfect endings with -y'(-) retain the archaic fonn 
-nj(-), cf. punonj ‘I work’ as opposed to the SA punoj. 

13) There are some archaic endings in the old texts. They are important for the recon- 
struction of the history of the Albanian conjugation; see below on some of them. 


Verb system: the origin 
Present stems 

The PIE verb system has been significantly reshaped in Albanian. Major conjugational 
types of SA often encompass verbs that evolved from verbs that have different types of 
PIE stems. Only three verbs that use -m as their 1 sg. present indicative and subjunc- 
tive ending can be treated as the continuation of the PIE athematic type: SA jam ‘to be’ 
(< PIE *h,es-/h[S-), SA them ‘to say’ (< PIE *keHs-) and SA kam ‘to have’ (< PIE *keh 2 p-); 
the latter verb could have been influenced by the verb jam. All other (inherited) verbs 
evolved from PIE thematic types, with or without suffixes. 

• “Pure” thematic type; verbs that originated from this type mostly belong to the SA 
second conjugational class: SA bredh ‘to roam’ < PIE *b h red h -e/o-; vjedh ‘to steal’ 
< PIE *weg h -e/o-. 

• Suffix *-ske-/-o-: SA njoh ‘to know, acquaint’ < PIE *gnh r ske-/-o- (PIE *sk >SA -h, 
see “Phonology”). 

• Suffix/infix *-ne/o-: SA dal ‘go out’ < *d h l-ne/n-h,-; SA bind ‘to convince’ < PIE 
*b h i-ne/n-d h -. 

• The present suffix *-ye/-o- and causative-iterative suffix *-eye/o- play a key role in the 
development of the Albanian conjugational system. Some verbs of the SA 2nd and 3rd 
conjugational classes are formed with these suffixes: SA bie ‘to bring’ < PIE *b h er-ye-; 
SA ndez ‘to burn’ < PIE *dog wh -eye/o-; SA vesh ‘to put on’ < PIE *wos-eye/o-. When 
the verb stem ended in attachment of this suffix resulted in the alternation -s/-t. For 
example SA 1 sg.flas ‘to speak’ is a denominal formation based on the Latin borrowing 
fabula > SA jjale < *-at-yo, 2 sg .flet < *-at-is < *at-yes (see Matzinger 2006: 122). 
This alternation became a productive pattern in a certain period of Albanian language 
history (it is typical, for example, of verbs borrowed from Slavic) and is now found in 
a large number of verbs of the 2nd class. The agglutination of the suffix to the nasal 
presents (for example, SA bej ‘to do’ < PIE *b h -n-h,-ye/o-) resulted in the development 
of the suffix -nyole-, which has become the main tool for verb derivation and thus 
eventually formed the 1st conjugational class. Many denominal verbs were formed 
with this suffix (SA punoj ‘to work’ from SA pune ‘work’ < PIE *pudneh 2 ). In addition, 
some verbs from other conjugational classes demonstrate or demonstrated a tendency 
to shift into this class (see, for example, SA qaj ‘to cry’, which belonged to the 3rd 
class in Buzuk’s book). Starting from the early Albano-Latin contacts, most borrowed 
verbs are adapted as verbs of the 1st class (ending in -oj or, more rarely, in -ej). 
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Present endings 
Active indicative 

1 sg.: athematic SA -m < PIE *-mi; thematic SA -0 < PIE *-o (*-oh 2 ); SA -j, OA -nj 
(in the 1st class) < PIE *-n-yd; SA -5 (in the 2nd class) < PIE *-t-yo. 

2 sg. and 3 sg.: The zero endings of the 2 sg. and 3 sg. of the thematic conjugation 
can reflect either the PIE injunctive (“secondary”) endings *-es, *-et, or the “pri- 
mary” endings *-esi, *-eti. Umlaut in some Albanian verbs shows that the imme- 
diate stage of development was *-is, *-it. The forms ending in -n (1st class) and -t 
(2nd class) indicate at the loss of the -y- in the suffix (see Schumacher & Matzinger 
2013: 253). -te in the 3 sg. of -mi verbs must have developed from PIE *-ti. 

1 pi.: thematic SA -me < PIE *-omes; athematic SA -mi < PIE *-mes. 

2 pi.: SA -ni (in the old texts and in dialects a more archaic ending -i is used). The 
ending -ni is a later analogical development influenced by the verbs of the 1st class; 
-i could have evolved due to the influence from the non-active paradigm (Klingen- 
schmitt 1994: 226). 

3 pi.: SA -ne < PIE *-(o)nti (an alternative interpretation is given in Schumacher & 
Matzinger 2013: 51-52, 215). 

Non-active indicative 

Non-active present endings continue PIE medial present endings. If the verb’s stem ends 
in a consonant (2nd class), these endings are attached directly to the stem; if the verb’s 
stem ends in a vowel (3rd class), a connecting consonant -h- is inserted (it is either a later 
anti-hiatus consonant or a reflex of PIE *-sk-). The same -h- is also found in the verbs of 
the 1st class, but in the old texts and in some dialects non-active endings are attached to 
a stem ending in -n: SA behet ‘s/he becomes (3 sg. pres, non-act.)’ vs. OA benet. Non- 
active forms, both present and imperfect, of the verbs with vocalic alternations have the 
stem variant with -i- (as well as with final palatal consonants -q, -gj). Some peculiarities 
of the spelling of non-active present forms in the old Gheg texts make it possible to 
conclude that in OA the stress fell on the thematic vowel (see Schumacher & Matzinger 
2013: 127-131). 

1 sg. SA -em < PIE *-o-may. 

2 sg. SA -esh: OA -e < PIE *-e-soy. 

3 sg. SA -et, OA -ete < PIE *-e-toy. 

1 pi. SA -emi < PIE *-o-med h h 2 (?). 

2 pi. SA -eni, OA -i < PIE *-e-d h we (?). 

3 pi. SA -en, OA -ene < PIE *-o-ntoy. 

The Albanian present active subjunctive, at least those forms that are different from indic- 
ative forms, resulted from the development of the PIE subjunctive. 

2 sg. SA -sh < PIE *-e-esi; 3 sg. SA -e < PIE *-e-et(i); PIE *-e-si, *-e-ti for athematic 
verbs. 

Imperfect active 

Imperfect forms are based on the present form. The verbs with vocalic alternations have 
in the imperfect the stem variant with (as well as with final palatal consonants -q, -gj). 
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The imperfect demonstrates a very high level of diatopic and diachronic variability; in 
old Albanian various forms of the imperfect are encountered in texts by different writ- 
ers. In what follows I will rely on the reconstruction of late PA imperfect endings offered 
by St. Schumacher (Schumacher & Matzinger 2013: 134—140). It is generally accepted 
that imperfect endings in Albanian developed from the PIE “secondary” endings. 

1 sg. SA -ja, PA *-/a/ < PIE *-(o)-m/-m. In SA (and, partly, in old texts) the modern 
endings with the element -j- (dialectal -nj-) in 1 sg. and 2 sg. and with -n- in the 
plural have extended under the influence of the verbs of the 1 st class verbs and also 
of the OA verbs of the ikinj type, see above; -a in the ending -ja could have arisen 
under the influence of the corresponding aorist form, see below. 

2 sg. SA -(/)<?, PA *-e < PIE *-(e)-s. 

3 sg. SA -te, PA *-0, *-j < PIE *-(e)-t; SA -te is a relatively late extension. 

1 pi. SA -nim, PA *-im < PIE *-(o)-mo/e. 

2 pi. SA -nit, PA *-ita < PIE *-(e)-te. 

3 pi. SA -nin, PA *-(i)nna < PIE *(o)-nti (an alternative explanation is suggested in 
Schumacher & Matzinger 2013: 215) 


Imperfect non-active 

The endings of the imperfect non-active did not evolve continuously from the PIE for- 
mants; rather, the paradigm has been deeply reshaped according to the model of the 
conjugation of the verb jam ‘to be’ in the imperfect active. This pattern thus shows an 
interesting parallel to the pattern in Modem Greek (Schumacher & Matzinger 2013: 143, 
Fiedler 1981). The accent patterns of the imperfect non-active in oldGheg texts resemble 
those of the present non-active. 


Aorist stems 

The Albanian aorist represents a highly heterogeneous phenomenon. From a historical 

point of view the following types can be distinguished: 

• The verbs that use -she in the 1 sg. aor. act.: qeshe ‘I was’, thashe ‘I spoke’, lashe ‘I 
left’, dhashe ‘I gave’, see above the list of the irregular verbs (pp. 589-590). These 
forms have been traditionally viewed as directly descending from the PIE sigmatic 
aorist. According to Klingenschmitt 1994 (see also Matzinger 2006: 128-129), they 
are innovations that developed under the analogical influence of the imperfect of ath- 
ematic verbs (Buzuk’s jeshe ‘I was (impf.) < PIE *h[es-m). 

• By contrast, the aorists in -v- (1st and 3rd classes) must have developed from the 
PIE sigmatic type, in which the intervocalic *-s- has dropped and an anti-hiatus -v- 
has emerged (Klingenschmitt 1994, Matzinger 2006: 129, Schumacher & Matzinger 
2013: 148-154). Another hypothesis with respect to the origin of the -v- aorist assumes 
that it is a form that is parallel to Latin -vf- perfects (Orel 2000: 199). 

• There is no commonly accepted view on the etymology of aorists in A possible 

suggestion is that it evolved from the periphrastic construction consisting of the ver- 
bal adjective (with *-to- suffix) and esse (Schumacher & Matzinger 2013: 154-158, 
Klingenschmitt 1994: 231). 
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• Among the Albanian root aorists that are especially interesting are some 30 verbs that 
have an alternation between -a- or -e- in the present and -o- (< PIE *e) in the aorist: SA 
hedh ‘I throw’ : hodha ‘I threw’. It is usually assumed that these aoristic forms have 
evolved from the perfect with a lengthened root vowel; this pattern is then similar to 
analogical patterns in Latin, German, Celtic and Tocharian (for a detailed analysis, 
see Schumacher & Matzinger 2013: 161-173; for an interpretation of these aorists as 
“Narten-imperfects,” see Jasanoff 2012). 


The aoristic endings in SA 

1 sg. -a, 2 sg. -e, 3 sg. -i/-u, 1 pi. -m(e), 2 pi. -t(e), 3 pi. -n(e), have evolved from the 
PIE secondary endings (similarly to imperfect active endings). The 1 sg. and 3 sg. 
call for some comments. 

1 sg. SA -a is probably the result of contraction of -e- and -e (on parallel contrac- 
tions in nominal and pronominal system, see above), -e- could have developed 
from PIE *-m or from the perfect ending *-h,a (see Matzinger 2006: 129). -e 
may be a reflex of a deictic element. 

3 sg. -H-u (on the distribution of these variants see above) goes back to a deictic 
element that is identical to the definite article (and to the pronominal element of 
the demonstrative pronoun). This element has entered the paradigm relatively 
recently (in the O A period). 


Aorist non-active 

Non-active particle u < PIE *swe. 

1 sg. -she, attested in old texts and in some dialects, has evolved under the influence 
of the imperfect paradigm. 

3 sg. SA -0 - the active ending without the deictic particle. 

All other non-active fonns don’t differ from the active forms. 


Optative 

The optative is traditionally considered an innovation formed on the basis of aorist stems 
(see e.g. Matzinger 2006: 132; there is also a proposal that the Albanian optative evolved 
from the PIE “optative of s-aorist,” Schumacher & Matzinger 2013: 177). Thus, the opta- 
tive element -sh- is either a direct reflex of the PIE aoristic *-s- or an element that devel- 
oped under the analogical influence of some aorist and imperfect forms. If the former 
hypothesis is correct, the element -sh- (< PIE *-s- + *-!-) must have initially evolved 
in the verbs with the -v- aorist (this form itself probably developed from the PIE sig- 
matic aorist) and later extended to other conjugation classes by analogy (Schumacher & 
Matzinger 2013: 180-182). The element -f- in the verbs ending in a vowel is a result of 
the devoicing of the Albanian aorist element -v- (in the verbs ending in a consonant the 
-sh- marker is attached directly to the stem). As for the endings, they could have been 
influenced by those of the aorist and imperfect indicative active. 

The form 2 sg. SA -sh- < PIE *-s-Is (Matzinger 2006: 116); 3 sg. SA -te could have 
evolved under the influence of the 3 sg. present subjunctive active (Matzinger 2006: 116). 



ALBANIAN 597 


Imperative 

2 sg. SA -0 in verbs belonging to all conjugational classes, -e in some verbs with con- 
sonantal stems, -j in some verbs with vocalic stems. The stem of verbs with vocalic and 
consonantal alternations demonstrate /-vocalism and the final palatal consonant, -e < PIE 
*-c (?). 

The 2 pi. imperative form is identical to the corresponding indicative form. An object 
clitic pronoun can be inserted between the stem and the ending: degjo-me-ni hear-I. 
ACC.-2PL.IMP. ‘hear (pi.) me!’; see “Syntax”. 


Participles 

The participles in SAare formed by means of several derivational suffixes: -r(e) / Gheg 
-/?(<?); -ur / Gheg -un; -ne. Their distribution is determined by the verb class and the stem 
auslaut; for the majority of verbs, participles are derived from aorist stems. Etymolog- 
ically all these participle suffixes result from the development of the PIE suffix *-no-. 
In some (especially Gheg) dialects and in the old texts there are also participles formed 
with the formants -m (< PIE *-mo-) and -te (< PIE *-to-). One and the same verb can 
often use different suffixes of the participle in different dialects or in old texts written 
by different writers, and sometimes even in the texts of the same writer. Semantic dif- 
ferences between participles formed with different suffixes are not registered. On the 
short forms of particles in modem Gheg dialects, see above (“Basic Patterns of Dialectal 
Variation”). 


WORD FORMATION 

This section is mainly based on Xhuvani and (jubej 1976a, 1976b, Camaj 1966, Orel 
2000: 151-177; Matzinger 2006: 137-140, in print ; Topalli 2011: 1149-1175; see also 
Schumacher and Matzinger 2013: 188-190, Sasse 1991: 233-267. 

The main word-formation means in Albanian are affixation, compounding and 
conversion. 


Affixation 

Albanian uses both suffixation and prefixation; as in other Indo-European languages suf- 
fixation prevails in Albanian. 


Suffixation 

A. Nouns. There are several chronological layers in the nominal word-formation suffixes 
of Albanian. First, there are suffixes that can be traced back to the PIE epoch (see Camaj 
1966: 113-124): -k-/-g- (SA petk ‘clothes’, cf. pete ‘thin layer of dough’), -p-/-b- (SA 
thelb ‘kernel of a nut’), -l-l-r- ( kermill ‘snail, slug’). As a mle, these suffixes are unpro- 
ductive in modem Albanian, and their derivational meaning is not transparent. 

Then there are many suffixes that are productive or semi-productive in modern Alba- 
nian. They derive Albanian nouns from nouns, adjectives and verbs; some of these suf- 
fixes have been borrowed from Latin, Slavic and Turkish. Among the most widespread 
nominal suffixes are the following. 
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• Suffixes of abstract nouns. SA -i (SA liri ‘freedom’ from SA (i, e) lire ‘free’); this 
suffix is used to derive nouns from nouns, verbs and adjectives. It goes back to PIE 
*-iyeh 2 . SA -im, cf. SA pushim ‘rest, pi. recreation’, from SA pushoj ‘to rest’, derives 
nouns from verbs and goes back to PIE *-imu (see Klingenschmitt 1994: 225); etc. 

• Suffixes of nomina agentis. SA -(e)s and its variants -es, -ues, onjes ( ndihmes ‘assis- 
tant’ from ‘ ndihmoj ‘to help’) is used to derive nouns (and adjectives) from nouns 
and verbs < PIE *-ik w yo- (cf. Ann. -ic 0 -). SA -tor (with variants -tuar, -tuer), -tar, -or 
(- uer , -uar), -ar (SA sherbetor ‘attendant’ from SA sherbej ‘to serve’); in all proba- 
bility all these suffixes are borrowed from Latin; they are used to derive nouns and 
adjectives from both nouns and verbs. There are also some suffixes of Slavic origin; 
they can be used to derive not only nomina agentis in a narrow sense but also other 
nouns that refer to people and sometimes have a pejorative semantic shade: -ad-ec, 
-ash, -avec, -nik; cf. SA mburravec ‘boaster’ from mburr ‘to brag’. 

• Feminine suffixes: -eshe, ushe (SA luaneshe ‘lioness’ from SA luan ‘lion’), < Lat. 
-issa; onje < Lat. -onia (?); and a borrowed Slavic suffix -ice. 

• Diminutive suffixes: -th (for nouns of masculine gender, < PIE *-ko-), -ze (for nouns of 
feminine gender, < PIE *-dya), cf. SA birth ‘sonny’ from SA bir ‘son’. In some Albanian 
dialects, especially in Arbresh, the use of diminutives is abundant: diminutive suffixes 
are used there not only with nouns but also with adjectives, adverbs and even verbs. 

• Suffixes with the collective and locative meaning: -shte (SA vreshte, Gheg vnesht(e) 
‘vineyard’ from SA vere, Gheg vene ‘vine’) < *-V-st- (this suffix is encountered in 
some old Balkan geographic names). 

B. Adjectives. The adjectives share some derivational suffixes with nouns (see above); 
this is characteristic of unarticulated adjectives. There are some suffixes typical of artic- 
ulated adjectives: 

• -m(e), cf. SA ( i , e) sotem/me ‘today’s’ from sot ‘today’, is used to derive adjec- 
tives from words belonging to various parts of speech, < PIE *-mo-. 

• -shem, SA (i, e) kujdesshem/me ‘careful’, from kujdes ‘care’, is used to derive 
adjectives from nouns, adverbs and verbs (the latter possibility is attested only in 
the Gheg area), < *-sh (ablative ending?) + *-mo- (see above). 

• -t(e), cf. SA (i, e) hekurt ‘iron (adj.)’, from SA hekur ‘iron’, is used to derive 
adjectives from adverbs, verbs and nouns, < PIE *-to-. 

C. Adverbs. A large part of Albanian adverbs are formed by means of conversion (see 
below). There are, however, some suffixes that are used to derive adverbs from other 
parts of speech: -isht (SA besnikerisht ‘faithfully’ from SA besnik ‘faithful’, < PIE *-ist- 
o-; -thi (SAfluturimthi ‘as in the flight’, from SA fluturim ‘flight’); -asl-azi (SA gjunjas 
‘on one’s knees’ from SA gjunje ‘knee’) and some others. 

D. Verbs. On the old IE verbal stem extension in Albanian see above (“Morphology, 
Verb system: the origin”). 


Prefixation 

In modern Albanian, prefixation is characteristic of both the nominal and verbal system. 
Many prefixes are homonyms to adverbs and prepositions: SA mbijetoj ‘to survive’ from 
mbi ‘over’ + jetoj ‘to live’; SA nengmim ‘underestimation’ from nen ‘under’ + gmoj ‘esti- 
mate ’ . Some prefixes in SA are borrowed: SA riarmatos ‘to reweapon’ from ri- (from Ital. 
77-) + armatos ‘to weapon’. There are, however, preverbs that form an older chronological 
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layer. Many such preverbs (see the list in Schumacher & Matzinger 2013: 188-190; also 
Camaj 1966: 103-113) have lost their meaning in the modern language: cf. e.g. SA v-des 
‘to die’ and des in texts by old writers. Combinations of two preverbs are encountered 
very rarely: SA sh-per-blej ‘reward’, cf. SA blej ‘buy’. 

Compounding is widely represented in modern Albanian. There are various types 
of compounds, for example: SA deledash ‘hermaphrodite’ from dele ‘sheep’ + dash 
‘ram’ ( dvandva ); SA ditelindje ‘birthday’ from dite ‘day’+ lindje ‘birth’ (. tatpurusa ); SA 
lulekuqe ‘poppy’ < lule ‘flower’ + (i, e) kuq/e ‘red’ ( karmadharaya ); SA zemergur ‘heart- 
less’ from zemer ‘heart’ + gur ‘stone’ ( bahuvrihi ). The order of the components within 
compounds is parallel to word order in the syntax. Compounding in the verbal domain is 
rarer; see, for example, SA udheheq ‘to direct’ from udhe ‘way’ + heq ‘to remove’. 

There are also compounds with opaque structure, e.g. SA kushtrim ‘appeal’ from kush 
‘who?’ and trim ‘brave’. 


Conversion 

Two clear patterns of conversion are represented by the processes of adjectivization of 
participles and substantivization of articulated adjectives: SA folur ‘to speak (participle)’ 
— > (;', e) folur ‘colloquial’ — » (te) folurit ‘speech’. Conversion is also characteristic for the 
“adjective-adverb” pairs: SA (i, e) mire ‘good (adjective)’ - mire ‘well (adverb)’, but it 
is difficult to determine the direction of the derivation in this case. 


SYNTAX 

This section focuses on some of those features of the Albanian syntax that are interesting 
from an areal and typological point of view. Some remarks on Albanian morphosyntactic 
characteristics can be found above (see “Morphology”). Albanian syntax is very poorly 
described from a historical point of view (see, however, Joseph in print , Matzinger 2006: 
140-144, Topalli 2011: 1179-1281). 

Albanian is a nominative-accusative language that codes this alignment by case mark- 
ing, verb agreement and word order (on the additional marking of objects by means of 
clitics, see below). Albanian is a language with a relatively free word order (as far as 
major constituents are concerned). Nevertheless, SVO is clearly the basic word order pat- 
tern. Numerous deviations from the SVO pattern are accounted for by the topicalization 
of the object and by other properties of information structure, by actionality character- 
istics of the verb, as well as by emphatic and stylistic aspects. Albanian is a pronounced 
pro-drop language. 

In general, Albanian syntax does not differ significantly from the syntax of other mod- 
em European languages: Albanian shares the majority of diagnostic features determined 
for the Standard Average European (SAE) languages (Haspelmath 2001: 1505) (these 
diagnostic features are mostly syntactic). Some important syntactic traits are shared by 
Albanian with other languages of the Balkan Sprachbund (see below). 

Syntax of the noun phrase. The main characteristics of the Albanian noun phrase 
were described above (“Morphology”). Probably the most striking phenomenon is the 
presence of a special linking element that agrees with the head noun and is placed both 
before the adnominal genitive (this pattern is thus a kind of Suffixaufnahme) and before 
adjective-like words. This peculiarity is also found in Balkan Romance (first of all, in 
Daco-Romanian), which is one of the main pieces of evidence pointing toward intensive 
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(Proto-)Albanian - (Proto-)Rumanian contacts. A feature that Albanian shares with both 
Romanian and Bulgaro-Macedonian is a rule according to which the first component of 
the noun phrase is marked for case and other categories if this position is occupied by an 
adjective, ordinal numeral, possessive pronoun or another element whose default position 
is after the head noun, cf. : djal-i i mire boy-NOM.M.SG.DEF. ART.NOM.M.SG. good 
‘good boy’ (unmarked word order) and/ mir-i djale ART.NOM.M.SG. good-NOM.M.SG. 
DEF. boy ‘good boy’. In old texts there are examples where both members of the noun 
phrase get inflections; cf. an example from Buzuk: vepra-vet mira-ve act.PL. -GEN./DAT. 
DEF. good.F.Pl. -GEN./DAT. ‘good acts (dat.)’. 

Along with postposed nominal dependents, described above, there are some nominal 
dependents that are found in preposed preposition: cardinal numerals, demonstratives, 
indefinite and interrogative pronouns and the indefinite article. Some of them are indeclin- 
able when used in the noun phrase (cardinal numerals, some indefinite pronouns, indefi- 
nite article); thus, in corresponding noun phrases it is only the noun that declines: ndonje 
njeri-u some person-GEN./DAT.M.SG.INDEF. ‘(of/to) some person’. The interrogative 
pronoun (/, e) cili/a ‘what’ behaves as an adjective: i cil-i njeri? ART.NOM.M.SG. what- 
NOM.M.SG.DEF. person ‘what person?’ In noun phrases with demonstrative pronouns 
both members of the construction decline, and the noun can be either in the indefinite 
or in the definite form: ketij njeri-u this.GEN./DAT.M.SG. person-GEN./DAT.M.SG. 
INDEF. vs. ketij njeri-ut this.GEN./DAT.M.SG. person-GEN./DAT.M.SG. DEF. ‘of/to 
this person’. 

Ablative construction. Albanian has a special construction in which the attribute is 
expressed by an ablative form in the singular or in the plural and has a non-referential, 
qualifying meaning: kenge dashuri-e song love-ABL.F.SING.INDEF. ‘love song’, leng 
fruta-sh juice fruit.PL.-ABL. PL. INDEF. ‘fruit juice’. In this construction, both the head 
and the dependent must be used in the indefinite form; this means that when the head 
noun is in the definite form, the attribute should be expressed by the genitive, while the 
ablative construction is ruled out. When the ablative form is in the plural, the ending -sh 
(and not -ve) must be used. 

Prepositional constructions. Albanian prepositions typically govern nouns in the accu- 
sative and ablative cases; there are also some prepositions that evolved through grammat- 
icalization of prepositional phrases, which govern the genitive: ne rast te ‘in (the) case 
of’. The majority of prepositions govern one case. The exceptions are the prepositions 
per ‘for, about’ and ndaj ‘toward, by’, which govern both the accusative and ablative 
cases. Location at some point and motion towards that point are grammatically expressed 
by identical prepositional constructions in Albanian, which is also typical of other Balkan 
languages: ne shtepi [in house] ‘at home, to the house’. Albanian has also two preposi- 
tions, nga ‘from’ and te(k) ‘to’, which govern the nominative. 

Albanian belongs to the languages with transitive have-possessive constructions. Exis- 
tential clauses are formed with the 3 sg. form of the verb kam ‘to have’: Ketu ka disa 
studente Here have. 3SG. PRES, some students. ‘There are some students here’. A con- 
struction with jam ‘to be’ can also have an existential meaning: Jane ca njerez, qe . . . 
be.3PL.PRES. some people, which . . . ‘There are some people, who. . .’. 

Weak (clitic) pronouns. These pronouns double the direct and indirect objects. They 
may be the only means of expression of direct or indirect objects in clauses with 1st 
or 2nd person objects or in clauses where the weak 3rd person pronoun is used in an 
anaphoric function. On the form of weak pronouns, see above (“Morphology”). Dative 
weak pronouns are obligatory in SA; they double not only “true” indirect objects but 
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all clause-level dependents in the dative case, including nominal dative forms that are 
used in adverbial functions. In the old texts and in the dialects dative weak pronouns are 
not consistently obligatory. Pronominal doubling of direct objects is optional, except for 
direct objects expressed by the pronouns i gjithe ‘all’ and i tere ‘all’, which are obligato- 
rily doubled. Direct object doubling depends on information-structural properties of the 
clause: rhematic direct objects are never doubled, cf. (1) and (2): 

1) Agim-in e gjeta ne dhome 

Agim-ACC.M.SG.DEF. PRO.ACC.3SG I. found in room 

‘I found Agim in the room’. 

2) Ne dhome gj eta Anil-en 

in room I. found Anila-ACC.F.SG.DEF. 

‘In the room I found Anila (It was Anila whom I found in the room)’. 

Accusative forms that are not direct objects are never doubled. 

Weak pronouns are placed immediately before the verb. The only exception are imper- 
ative constructions: weak pronouns can be placed after the 2 sg. form and can be incor- 
porated within the 2 pi. form (see “Morphology” above). When both a dative and an 
accusative weak pronoun are used in the same clause, the order is always “dat. + acc.” As 
a result, there is a series of coalesced forms: ma (me + e) ‘1 sg. dat. + 3 sg. acc.’, ta (te + 
e) ‘2 sg. dat. + 3 sg. acc.’, ia (i + e) ‘3 sg. dat. + 3 sg. acc.’, etc: 

3) Ia dhashe libr-in 

PRO.3SG.DAT.PRO.3SG.ACC. I.gave book-ACC.M.SG.DEF. 

mesues-it 

teacher- DAT.M.SG.DEF. 

‘I gave the book to the teacher’. 

Pronominal doubling of objects is a characteristic trait of the Balkan Sprachbund. The 
form of the coalesced pronouns suggests the PA period as the period when they evolved 
and thus again the period of (proto )Albanian-(proto)Romanian contacts. 

Negation. Albanian has different negation particles for the real (indicative, admirative) 
and irreal (subjunctive, optative, imperative) moods; it thus preserves the PIE distinction, 
which is similarly preserved in Indo-Iranian, Armenian and Greek. In real contexts SA 
s (for clause negation also a longer variant as < PIE *(ne) h 2 oyu-k w id) or nuk (< PIE 
*ne-k w e) is used; in irreal contexts SA mos is used (< PIE *me-k w e). 

Complementation. There are two main models of complementation in SA. One pat- 
tern employs conjunctions se ‘that’ (< PIE *k w e) or qe ‘that, what’ (< PIE *k w i/o-); se is 
preferred in contexts with most semantic types of matrix verbs. Another pattern involves 
the use of the subjunctive in the dependent clause (see “Morphology” above). The choice 
between these two models is determined by ( 1 ) whether the event in the dependent clause 
is factual (real) or non-factual (non-real) and (2) the degree of boundedness of the events 
in the main and subordinate clauses (see Matras 2002: 179-185). Thus, there is a kind 
of continuum stretching from epistemic verbs and perception verbs on one pole of the 
continuum (these verbs favour the conjunction pattern; cf. (4)) to modal and phasal verbs 
on the other pole of the continuum (these verbs favour the subjunctive pattern; cf. (5)). 
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The conjunction qe can introduce a dependent clause with a subjunctive verb form (for 
example, if the subjects of of the main and dependent clauses are not coreferential) (6). 

4) A i e di se keni 

He PRO.3SG.ACC. he.knows that have. 2PL. PRES. 

dobesi 

weakness 

‘He knows that you have weakness’. 

5) Dua te shkoj 

Want. 1SG. PRES. SUBJ go.lSG.PRES. 

‘I want to go’. 

6) Dua qe te shkosh 

Want.lSG.PRES. that SUBJ. go.2SG.PRES. 

‘I want that you go’. 

Such a two-fold complementation strategy is a characteristic trait of the Bal- 
kan languages. In Gheg dialects (including old Gheg texts) the subjunctive strategy 
competes with a strategy with the infinitive of the “me + participle” type (see Arapi 
2010). 

Conditional sentences. There is a great deal of variation in the ways Albanian can 
express conditional relations, especially in the protasis, where the choice of the verb form 
is determined by (1) the choice among several conjunctions (po , ne, nese, sikur) or the 
absence of the conjunction; and (2) the type of conditional meaning: real or potential vs. 
irreal. The indicative, optative, admirative and subjunctive forms are used in the case of a 
real or potential condition, whereas the imperfect or the pluperfect of subjunctive (or the 
subjunctive of admirative) is used in the case of an irreal condition. In the apodosis of the 
irreal conditional sentences the imperfect and pluperfect indicative as well as the present 
and past conditional can be used. 

Relative sentences. SA has the inflected relativizer (j, e) cili ‘which, who’ (7), which 
can be used for relativization of any syntactic position in the dependent clause. This 
pronoun is ousting the old general relativiser qe/qe, which can be used in modem SA 
exclusively for the relativization of subjects and direct objects (8). 

7) ky trim, per te cilin 

this brave. man on ART.ACC.SG. which.ACC.M.SG.DEF. 

tregoheshin legjenda, trokiti mu ne deren 

tell.3PL.IMPEPASS. tales he.knocked just in door.ACC.EDEF. 

e tyre 

ART.ACC.SG.DEF. their 

‘This brave man, about whom the tales were told, knocked directly at their door’. 

8) Njerezit qe benin sehir nuk nxitonin 

people which do.3PL.IMP. looking not hurry.3PL.IMPF. 

‘People, which looked around, did not hurry’. 
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FURTHER READING 

Bibliographic data on Albanian and Albanian linguistics are published regularly in “Bib- 
liographic linguistique” and “Indogennanische Chronik” (before 1949 Albanian linguis- 
tics was reviewed in Indogermanisches Jahrbuch ); see also Schmitt-Brandt 1968, Hamp 
1972, Daka, 1975, 1984, Kodderitzsch 1994. 

Overviews of the history of the albanological research can be found in Jokl 1917, 
Schmitt-Brandt 1968, Hamp 1972, Kodderitzsch 1994. There are two substantial intro- 
ductions to Albanian studies: Fiedler 2006 and Matzinger 2006; see also an overview of 
the main problems of Albanian language history in Sh. Demiraj 2013 (with a detailed 
English summary). 

The basic monolingual dictionaries are Kostallari 1980 and Thomai 2006 (see also 
Cipo et al. 1954). Albanian-English dictionaries include Mann 1948 (a “historical” dic- 
tionary with very rich dialect material) and Newmark 1999. The annotated Albanian 
corpus (19 million tokens) is now available at http://web-corpora.net/AlbanianCorpus/ 
search/. 

The main etymological dictionaries are Meyer 1891 (the first Albanian etymological 
dictionary), Huld 1984 (basic IE etymologies of Albanian), B. Demiraj 1997 (it encom- 
passes only the IE part of the Albanian lexicon), Orel 1998 (IE etymons and early bor- 
rowings), and Cabej 1976-2014 (this opus magnum by the greatest Albanian linguist 
encompasses, unfortunately, only a selection of the Albanian lexicon there is a French 
summary); see also the etymological dictionary of Albanian verbs of PIE descent in 
Schumacher and Matzinger 2013: 965. 

The best Albanian reference grammar is Buchholtz and Fiedler 1987; see also New- 
mark, Hubbard and Prifti 1982 (based in many parts on Domi 1976) and Camaj 1984. 

Among the historical grammars Mann 1977 and Orel 2000 employ a comparative 
approach, whereas Sh. Demiraj 1986 is the most detailed collection of Old Albanian 
and dialectal language material (see also Sh. Demiraj 1993); Schumacher and Matzinger 
2013 contains a thorough analysis of the history of the Albanian verb system. 


NOTE 

1 This reconstructed PA stage may actually reflect a somewhat later stage in the devel- 
opment of the Albanian language. Indeed, on the one hand, there are some grounds to 
believe that (Proto)Gheg and (Proto)Tosk dialects had intensive contact in the first half 
of the second millennium AD. However, on the other hand, some similar processes of 
language change may have developed in both Gheg and Tosk independently. These 
circumstances can influence the assumed reconstruction. 
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Albanian (Alb.) 3, 5, 13, 422, 427, 455-6, 489, 
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Albanian period 559-60; pronouns 582-4; 
Proto- 557-60; Slavic borrowings 557, 597; 
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word formation 597-9; see also Albanian 
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types of aorist stems 590; major vocalic and 
consonantal alterations 590-1; non-active 
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morphosyntax 191; nominal derivation 180; 
nominal inflection 178-80; phonology 15-18, 
21, 176-7; pronouns 180-4; subordinate 
clauses 193; word order 191-3; see also 
Anatolian verbs; Proto- Anatolian 
Anatolian subgroup: Carian 3, 172, 175; Hittite 
3, 13, 15-17, 19, 21, 171-3, 178, 357; Luwian 
13, 21, 171-4; Lycian 3, 21, 172, 174-5; 
Lydian 2, 172, 175; maps 173-4; overview 
171-2; Palaic 3, 172-3; Pisidian 3, 175; 

Sidetic 3, 175 

Anatolian verbs: about 184-5; active inflection 
185-8; medio-passive inflection 188; non-finite 
forms 190; verb stem formation 189 
Ancient Greek 13, 21, 33, 49, 55, 120, 134, 145, 
147, 155, 165,238,317, 321,323 \ see also 
Greek; Homeric 
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‘bride’ 436; n- stems 436; paradigm of hayr 
‘father’ 437; r-stems 437; vocalic stem classes 
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(*CHC) 430; laryngeals 429-30; metathesis 
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y-clusters 433 
Aromanian 545 
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Attic Greek (Att.) 291, 294, 296-300, 304, 307, 
314,317 

Attic-Ionic Greek (AI) 4, 287, 289, 302, 306-7, 
313; see also Proto- Attic-Ionic Greek 
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Austro- Asiatic 155 

Avestan (Av.) 4, 13, 18, 21; Late 13, 15-17, 19, 
283; see also Gatha- Avestan 
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Bactrian 266 

Balkan Indo-European 558; see also 
Indo-European 

Balkan Romance 558, 579, 599; see also 
Romance 

Balkan Slavic 544, 579; see also Slavic 
Balkan Turkish 1 ; see also Turkish 
Baltic (Balt.) 5, 389, 454, 499, 503, 521, 522, 

529, 558; about 486-7; adpositions 514; 
early texts 487-8; East Baltic dialects 488-9; 
extinct languages 486; further reading 515; 
map of Baltic tribes 487; ruki rule 497; syntax 
512-15; verbs 486-7; Wackemagel’s Law 
514-15; word order 513-14; see also Baltic 
morphology; Baltic nouns; Baltic phonology; 
Baltic prosody; Baltic verbs; Balto-Slavic; 
Common Baltic; East Baltic; Latvian; 
Lithuanian; Old Prussian; Proto-Baltic; West 
Baltic 

Baltic Finnic 389, 486, 494, 498, 516n6; see also 
Finnish/Finnic 

Baltic morphology: adjectives 503-5; nouns 
499-503; numerals 505-6; pronouns 506-8; 
verbs 508-12 

Baltic nouns 499-503; declension of a-stem 
nouns 502; nominal derivation 503; nominal 
inflection 499; nominal stem types 500-1; 
sample o-stem case endings 501-2 
Baltic phonology: accent paradigm 

correspondences 526; accentual paradigms 527; 
consonant + *i sequences 498-9; consonants 
496-9; palateovelars 498; vowels 489-91 


Baltic prosody: Endzelin’s Law 494-5; 

Leskien’s Law 493; Saussure’s Law 493; tonal 
correspondences 495-6; tonal oppositions in 
Latvian 494; tonal oppositions in Lithuanian 
492-3; tonal oppositions in Old Prussian 495 
Baltic verbs: aspect 512; athematic presents 
509; diathesis and ablaut 510; mood 510-11; 
participles 511-12; preterit 510; semi-thematic 
verb type 509-10; thematic verb type 509; 
verbal derivation 512; verbal inflection 508 
Balto-Slavic (BS1.) 5, 6, 389, 428, 455, 6, 

479-83, 496, 506; accentology 481-2, 486n5, 
486n7, 486n8; circumflex tone 479; palatalized 
velars 21; voiced aspirated stops 18, 19; 
see also Baltic; Proto-Balto-Slavic; Slavic 
Basque 1, 157, 160, 164-5, 356 
Batsbi (Tsova-Tush) 1 64 
Bavarian 397 

Belorussian/Belarusian (BRuss.) 5, 519, 524, 547; 

syntax 545; see also Slavic 
Bengali 1,4,214,215,217 
Boeotian Greek (Boe.) 287, 293-5; see also 
Greek 

Bosnian 519; see also Slavic 
Bosnian/Croatian/Montenegrin/Serbian 

(BCMS = Serbo-Croatian) 5, 527; dialects 557; 
phonology 16-17 
Brabantic 397 

Breton (Bret.) 4, 352, 361, 364, 378; see also Old 
Breton; Primitive Breton 
British 2, 352, 358, 364-5, 367, 389; see also 
Brythonic; English 

Brythonic (Brittonic/British) 4, 352-3, 357-8, 
362-3, 367-8, 370, 375-6; Gallo- 358; 
phonology 364-5; vocalism 364-5 
Bulgarian (Bulg.) 5, 519, 522, 543-4, 546; 

dialects 557; syntax 545; see also Slavic 
Bulgaro-Macedonian 558, 600; see also 
Bulgarian; Macedonian 
Burgund 5 

Burmese, Inscriptional 389 
Burushaski 156, 164, 208 
Byzantine/Medieval Greek 4 

Cakavian Croatian (Cak.) 527, 538, 544 
Cappadocian 287 
Carian 3, 172, 175 
Catalan 4 

Caucasian 32, 153, 155-161, 164, 166-7, 205 
Celtiberian (CIb.) 4, 352, 357, 359-60, 362, 364, 
367-8, 377 

Celtic (Celt.) 3, 4, 321, 349-50, 353-4, 389, 

454, 456, 529, 596; further reading 377-8; 
morphology (nominal) 367-70; morphology 
(pronouns) 370-1; origins 354-6; P Celtic 
357-8, 358, 363; phonology 15, 18-19, 

21, 361-4; position within IE 356-7; Q 
Celtic 357-8, 363; subdivisions of 357-61; 
syntax 377; word formation 376-7; see also 
Brythonic (Brittonic/British); Celtiberian; 
Celtic verbs; Common Celtic; Continental 
Celtic; Gaelic (Goidelic/Irish); Insular Celtic; 
Proto-Celtic 
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Celtic verbs: active preterit stem 376; future stem 
375-6; inflection 371-2; personal endings 
372-3; present stem 374-5; subjunctive 
stem 375 

Cham dialect (Albanian) 50, 567 
Chechen 159, 160 
Chinese 2, 7 
Chinese Qiuci 452 
Chinese Yanqi 452 
Choresmian 266 

Church Slavic (CS) 17; see also Late Church 
Slavic; Old Church Slavic; Slavic 
Chukchi 164 

Chukchi -Kamchatkan 153 
Circassian 156 

Classical Latin (Class. Lat.) 28, 33, 50-1, 322-6, 
329-38, 341, 343-4, 346-8; see also Latin 
Common Baltic (CBalt.) 483, 486; vowels 489; 
see also Baltic 

Common Iranian 264, 266, 269-71 
Common Slavic (CS1.) 16, 482, 489, 521-2, 530, 
533-4; see also Slavic 

Continental Celtic 4, 352, 353, 358, 362, 368, 

370, 376-7; see also Celtic 
Continental West Germanic 407; see also 
Germanic 

Cornish (Com.) 4, 352, 378; Primitive 361, 364 
Cretan (Cret.) 29 1 

Crimean Gothic 388; see also Gothic 
Croatian (Croat.) 3, 519, 522, 545; see also 
Kajkavian Croatian; Serbo-Croatian; Slavic 
Cuneiform Luwian (CLuw.; Luvian) 3, 173-4, 
178-9, 181-3; phonology 16, 21; see also 
Luwian 

Curonian 486, 489 
Cymraeg see Welsh 
Cymry 353 

Cypriot (Cypr.) 291, 294, 314 
Czech (Cz.) 2, 3, 5, 519, 522; Doudleby dialect 
522; Old 17; phonology 15; syntax 545; 
see also Slavic 

Dacian (Dac.) 5 
Daco-Misian 555 

Daco-Romanian 599; see also Romanian 
Daic 155 

Danish (Dan.) 3, 394; Old 406 
Dardanian 555 
Dardic 3 
Dargi 159 

Doric Greek (Dor.) 4, 287, 294, 317; see also Greek 
Doudleby dialect 522 

Dravidian 1, 7, 153, 155, 157, 159-60, 165, 167, 
208-9,211 

Dutch 3, 5, 393, 403, 405-7; Modem 408; 
phonology 19; see also Old Low Franconian 

Early Modem Nuremberg 397 
East Baltic (EBalt.) 5, 482, 486, 491, 495, 
499-502, 505-9, 511-12, 514; dialects 488-9; 
vowels 490; see also Baltic 
East Germanic 5, 29, 388; see also Burgund; 
Germanic; Gothic; Vandal 


East Greek (EGr.) 287, 306-7; see also Attic-Ionic 
Greek; Arcado-Cypriot Greek; Greek 
East Norse 406; see also Old Danish; Old 
Gutnish; Old Swedish 

East Lithuanian (ELith.) 500; see also Lithuanian 
East Pmssian Lithuanian 488, 493; see also 
Lithuanian 

East Slavic 519, 522, 524-5, 533, 539, 544-6, 
557; see also Slavic 
Eastern Armenian 42 1 
Eastern Middle Iranian 266, 283 
Eastern (Balkan) South Slavic 545 
Egyptian 7 
Empire Luwian 173 

English (Eng.) 2, 5, 7, 394, 403, 558; American 
400; aspirated stops 15; compared to Italian 
8; compared to Spanish 8; Liverpool 394; 
phonology 15—19, 21; see also British; Modem 
English; Old English 
Epic Sanskrit 242; see also Sanskrit 
Eskimo-Aleut 153 
Estonian (Est.) 1 
Etmscan 317, 321,324, 329 

Faliscan (Fal.) 4, 317 
Faroese 388, 397-8 
Farsi see Persian/Farsi 
Finnish/Finnic (Finn.) 1, 159, 389, 522 
Flemish 397 

French (Fr.) 4, 7, 321, 558; Old 389 
Frisian 5, 387 

Gaelic (Goidelic/Irish) 352, 357-8; Irish 4, 352; 
phonology 365-7; see also Middle Irish; 

Old Irish 
Galatian 352, 359 

Gallo-Brythonic 358; see also Brythonic 
Gatha-Avestan (GAv.) 16, 18, 65, 97, 101; 
see also Avestan 

Gaulish (Gaul.) 3, 4, 13, 352-3, 358-61, 364, 
367-9, 371-2, 377-8; phonology 16-19 
Georgian 7, 18, 159, 164 

German (Germ.) 3, 5, 8, 407, 596; High 393; Low 
5, 16; Middle High 398, 402; phonology 16; 
see also Modem German; Old High German 
Germanic (Gmc.) 4-5, 357, 359, 364, 389, 403, 
405-6, 454-5, 483, 488-9, 522, 529; about 
387-90; accent shift 396; consonantal shifts, 

18, 20, 392; Germanic foot 398; morphology 
389, 398-403; noun classes 401; palatalized 
velars 21; pre-Proto- 20; prosody 396-8; 
sub-groupings of 388; syntax 405-12; 
word formation 403-5; see also Germanic 
morphology; Germanic phonology; Germanic 
syntax; Proto-Germanic; West Germanic 
Germanic morphology: inflectional 398-400; 
nominal system 400-3 

Germanic phonology 16-21, 394-8; consonants 
391-4 

Germanic syntax: clausal 407-8; definites 

408-9; rise of V2 406; tense, aspect, and mood 
410-12; V2 over time 406-7; word order 
405-6 
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Gheg dialect 5, 552-4, 560, 567; dialectical 
variations, verbal 592-3; nominal differences 
575-6; stress patterns 572; see also Albanian 
Gilyak 153 
Goidelic 4 

Gothic (Goth.) 3, 5, 7, 13, 387-9, 396-7, 

401-2, 406-7, 409; consonantal shifts 20; 
Crimean 388; phonology 15, 17-19, 21; and 
Tocharian 456 

Great Attic Greek 287; see also Attic Greek; 

Attic-Ionic Greek; Greek 
Greek (Gr.) 2, 3, 4, 6, 287, 321, 389, 421-3, 
428-30, 447, 454, 456, 481, 526, 545, 558; 
accent 296-7; Albanian borrowings 557-8; 
Arcado-Cypriot (AC) 4, 287, 301, 308; 
aspirated stops 18-19; Attic (Att.) 291, 294, 

296, 297, 298-300, 304, 307, 314, 317; 
Attic-Ionic (AI) 4, 287, 289, 302, 306-7, 313; 
Byzantine/Medieval 4; consonantal shifts 20; 
dialects of 317; East 287, 306-7; Grassman’s 
Law 19; Great Attic 287; influence on Latin 
349; Ionic 290, 294, 317; Koine 4, 287, 291, 

297, 299, 311, 313; map 288; Modem 4, 

297, 558, 595; mood, complementation and 
subordination 311; morphology (adjectives 
and adverbs) 302-4; morphology (nominal) 
298-302; morphology (pronouns) 304-5; 
morphology (verbal) 305-10; prepositions 
313; Proto- 292, 301; references to Keltoi 
and Galatae 354; syntax 311-14; Thessalian 
(Thess.) 287, 294, 297; West 287, 307, 313; 
word formation 313-14; see also Ancient 
Greek; Greek phonology; Homeric 

Greek phonology 15-21, 291-2, 295-297; 
diphthongs 293; IE fricative 293; labiovelars 
292; laryngeals 292; semivowels 292-3; vowel 
systems 293-4 
Griko 287 

Hamitic 356 
Hattie 164 
Hebrew 2, 3, 166 

Hellenistic Greek see Koine Greek 
Hieroglyphic Luwian (HLuw.) 3, 21, 173 
High German 393; see also German; Old High 
German 

High Latvian (Selonian and Latgalian) 489, 494; 

see also Latvian 
Hindi/Urdu 1, 4, 217 

Hittite (Hitt.) 3, 13, 171-3, 192, 357; Old 178; 

phonology 15-17, 19, 21 
Homeric (Horn.; Homer) 63, 96-7, 112, 117, 135, 
145, 157, 167, 287, 290, 291-2, 298-9, 301-2, 
304-5, 308-10,313-14 
Hungarian 1, 522 
Hurrian 164, 205 
Hurro-Urartian 205 

Icelandic 397, 401; see also Old Icelandic 
Illyrian (Illyr.) 5, 555-6 
Indie see Indo-Aryan 

Indo- Aryan (Indie) 3, 160, 264; ablaut alternation 
230-2; chronology of 214-17; classification 


of root morphemes 232-3; dentals 227-8; 
diphthongs 222, 223-5; further reading 252-5, 
252-6; general information 214; genetic 
classification 217-21; Grassman’s Law 19; 
labials 228; Middle Indie 4; morphology 
(noun) 233-8; morphology (verb) 238-48; 
morphophonology 229-33; morphotactics 
229-30; New Indie 4; palatals 226-7; 
phonemes 223-6; phonology 15, 18-19, 21, 

221- 9; plosives 225-6; retroflex 227; sibilants 
and h 228-9; sonants with vocal variants 

222- 3; syntax 249-50; and Tocharian 456-7; 
typological features 250-2; typological features 
of later languages 250-2; velars 226; vowels 
221, 223-5; see also Indo-Iranian; Proto-Indo- 
Aryan 

Indo-European (IE) 7; further reading 8; and 
Germanic stops 392; Italic 317-18; Late 
Western 357; morphology 434; phonology 
563-4; and Tocharian languages 453-7; verbs 
539; see also Proto-Indo-European 
Indo-Iranian (Hr.) 3-4, 6, 269, 389, 428, 

456; diachronic typological perspective 
208-11; further reading 211; lexicon 207 - 8 ; 
morphology 207; origins 205; phonology 21, 
206; see also Proto-Indo-Iranian 
Ingush 157 
Ingvaeonic 389 
Inscriptional Burmese 389 
Insular Celtic 4, 352-3, 357-8, 363-4, 367-8, 
370, 372, 377; see also Celtic 
Ionic Greek (Ion.) 290, 294, 317; see also Greek 
Ionic- Attic Greek see Attic-Ionic Greek 
Iranian (Iran.) 421, 430, 441, 446-7; adjectives 
276; Armenian and 422; Avestan 263; 
consonantism 268; dental affricates 271; 

Eastern Middle 266; first palatal series 
271; further reading 283-4; Modem 
263, 277; morphology 266-72, 274-6; 
morphophonology 272-4; New 4; Old Iranian 
4, 272, 283; Old Persian 263; phonology 18, 

19, 457; pronouns 276-7; stems with ablaut 
variation 275; syntax 282-3; and Tocharian 
456-7; verbs 277-81; word-formation 281-2; 
see also Avestan; Indo-Iranian; Middle Iranian; 
Persian/Farsi; Proto-Iranian 
Irish Gaelic 4, 352; see also Gaelic; Middle Irish; 
Old Irish 

Italian (Ital.) 3, 4, 7, 321, 558; compared to 
English 8 

Italic (It.) 3, 4, 6, 454, 456; epigraphy 350; within 
Indo-European 317-18; languages of ancient 
Italy 317; maps 318-19; phonology 18-21; 
verb system 336; see also Proto-Italic 
Italo- Albanian 554, 558 
Italo-Celtic 319-20, 357 

Japanese 85, 159, 165, 172 
Jatvingian 486 

Kafiri 264 

Kajkavian Croatian 519, 527, 538, 542; see also 
Croatian 
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Kartvelian 7, 153, 155-6, 162, 166 
Kashubian 519 
Ket 155, 156 

Khotanese (Khot.) 263, 266, 271, 274, 282, 

453, 457 

Kizzuwatna Luwian 173; see also Luwian 
Koine Greek 4, 287, 291, 297, 299, 311,313; 

see also Greek 
Korean 32 
Kucha 452 
Kurdish 1, 4 
Kurzeme 494 
Kusi 452 

Lab dialect 552 
Laconian 29 1 
Lak 163 

LateAvestan 13, 15-17, 19, 283; see also Avestan 
Late Church Slavic (LCS1) 523; see also Church 
Slavic 

Latgalian 489 

Latin (Lat.) 2, 3, 4, 7, 8, 13, 21, 317, 596; accent 
328-9; alphabets 324-5; borrowings into 
Albanian 557, 572, 597; case endings 329-33; 
conjugation classes 337; consonantal shifts 20; 
declensions 331-3; diachronic developments 
339-40; diachronies 326-8, 334; documentary 
evidence 323-4; lexicon and word formation 
349-50; medieval and later 322; morphology 
(nominal and pronomial) 329-33; morphology 
(verbal) 336-43; morphophonology 339; 
non-finite verb forms 341-2; origins of 
320; phonology 13, 15-17, 19, 21, 325-6; 
references to Keltoi and Galatae 354; and the 
Romance languages 321-2; spread of 321; 
varieties of 322-3; verb system 336; see also 
Classical Latin 
Latino-Faliscan Italic 4 
Latvian (Latv.) 5, 13, 482, 488, 495, 498; 
adjectives 503-4; dialects 489; glottalic acute 
481; Low 489; Old 65; palatalized velars 21; 
tonal oppositions in 494; verbs 515n3; vowels 
491-2; see also Baltic 
Lepontic (Lep.) 4, 357, 359, 362-4, 368 
Lesbian (Lesb.) 287, 294, 297, 302 
Letts 489 
Lezgian 159 
Ligurian 5 

Lithuanian (Lith.) 3, 5, 13, 82, 86, 94, 102, 139, 
160-1, 480-2, 486-8, 495-6, 498, 526; accent 
55, 494; adjectives 503-4; diphthongs 496; 
nominal stem types 500-1; phonology 15-19, 
21, 28-9, 33, 35-6, 38, 41, 46, 48-50, 498; 
tonal opposition in 492-3; vowels 490-1; 
Zemaitic dialect 481, 488-9, 496, 508; see also 
Baltic 

Liverpool English 394; see also English 
Low German 5, 16; see also German 
Low Latvian 489; see also Latvian 
Lower Sorbian (LSorb.) 519 
Lusitanian 3, 5, 21 

Luwian (Luw.) 3, 13, 21, 22, 25-7, 171-88, 

190, 194 


Lycian (Lyc.) 3, 21, 22, 26, 27, 171-2, 174-82, 
185-8, 190-2 

Lydian (Lyd.) 3, 171-2, 175, 177-82, 186, 188, 
190-2 

Macedonian (Mac.) 5, 288, 519, 522, 544-7; 

dialects 557; see also Slavic 
Magadhi 4 
Maharashtri 4 
Malayalam 1 
Maltese 1 
Mandarin 155 

Manichean Middle Persian (ManMPers) see 
Middle Persian; Persian/Farsi 
Manx 4 
Marathi 4 
Median 264 

Messapian/Messapic (Messap.) 5, 321, 357, 555-6 
Middle Armenian 42 1 ; see also Armenian 
Middle Breton (MBret.) see Breton 
Middle Dutch (MDutch) see Dutch 
Middle High German (MHG) 398, 402; see also 
German; Old High German 
Middle Indic/Prakrits 4; see also Indo-Aryan 
Middle Indo-Aryan (MIA) 250-1 
Middle Iranian (MIran.) 4, 263, 266, 272, 277, 

28 1 ; see also Iranian 

Middle Irish (Mir.) 367; phonology 16, 17, 19; see 
also Irish Gaelic 

Middle Parthian (MParth.) 422; see also Parthian 
Middle Persian (MPers.) 266, 422; see also 
Persian/Farsi 

Middle Welsh (MW) 15, 17, 352-3, 371, 372, 

376; see also Welsh 
Milyan (Lycian B) 3 

Modem Albanian 553, 558, 560; see also 
Albanian 

Modem Dutch 408; see also Dutch 
Modem English 399, 403-5; see also English 
Modem German 399, 404, 407-8, 410; see also 
German 

Modem Greek 4, 297, 558, 595; see also Greek 
Modem Icelandic 401; see also Icelandic 
Modem Iranian 263, 277; see also Iranian 
Modem Standard Albanian 553; see also 
Albanian; Modem Albanian 
Moldovan 546 
Mongolic 7 

Montenegrin 519, 545; see also Slavic 
Munda 160, 208 
Myanmar 155 

Mycenaean (Myc.) 29, 291-2, 294, 298, 301-2, 
304, 306, 308,313-14 

Nakh-Daghestanian 205; see also Caucasian 
Navajo 18 
Nenets 159 
Nepali 1,217 

Netherlandic 396; see also Dutch 
New Indie 4; see also Indo-Aryan 
New Iranian 4; see also Iranian 
New Persian/Farsi 4, 266, 267; nominal inflection 
274; see also Persian/Farsi 
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New Phrygian (NPhryg.) see Phrygian 
Nivkh 156 

Non-Slavic Balkan 43 
North Germanic 388-9, 396, 399, 405 
North Sea Germanic 389 
North-West Block language 5 
North-West Greek (NWGr.) 287 
Norwegian (Norw.) 398; Old 397, 406 
Nostratic language family 153 
NuristanI languages 208, 264 

Old Albanian period 558, 560 

Old Avestan (OAv. = Gatha-Avestan) 13, 249, 

265, 277, 283-4; see also Old Iranian 
Old Bavarian 394 

Old Breton (OBret.) 354, 361 ;see also Breton; 

Primitive Breton 
Old Chinese 2 

Old Church Slavic (OCS) 5, 7, 13, 53, 66, 82, 

86, 95-6, 102, 104, 482, 519, 521, 547; dative 
absolute 166; phonology 15-19, 21; pronouns 
538; syntax 543, 544; see also Slavic 
Old Czech (OCz.) 17; see also Czech 
Old Danish 406; see also Danish 
Old English (OEng.) 5, 13, 387, 396, 401-2, 
405-7; consonantal shifts 20; high vowel 
deletion 398; phonology 16-17, 19 
Old Frankish 389 
Old French 389; see also French 
Old Frisian 5, 387 
Old Gutnish 406 

Old High German (OHG) 5, 13, 387-9, 394, 
396-8, 402, 404, 406-9, 411; phonology 19; 
see also German 

Old Hittite (OHitt.) 178; see also Hittite 
Old Icelandic (OIc) 16, 388, 406; see also Icelandic 
Old Indic/Indo- Aryan/Indian (OIA) 4, 13, 240, 
250-4, 263, 266-7, 268-9, 277, 282-3; 
phonology 21; see also Sanskrit; Vedic 
Old Iranian 4, 272, 283; see also Iranian; Old 
Avestan 

Old Irish (OIr) 7, 13, 352-3, 365, 367-71, 

376-8, 410; inflectional system 371-2; 
phonology 15-19, 21; see also Irish Gaelic 
Old Latin (OLat.) 49-50, 118, 322, 325-6, 330, 
334-5, 337, 339; see also Classical Latin; Latin 
Old Latvian (OLatv.) 65; see also Latvian 
Old Lithuanian (OLith.) 500, 514-15; see also 
Lithuanian 

Old Low Franconian (Old Dutch) 5; see also 
Dutch 

Old Norse (ONor.) 5, 357, 387, 388, 396, 401-2, 
404, 407-9; phonology 17-18 
Old Norwegian 397, 406; see also Norwegian 
Old Persian (OPers.) 3, 4, 13, 263, 264, 266-7, 
269, 271, 281, 283-4, 421; morphophonology 
272-4; noun cases; pronouns 277; see also 
Persian/Farsi 

Old Prussian (OPruss.) 5, 13, 456, 481-8, 498-9, 
501, 515; tonal oppositions in 495; phonology 
15, 19; see also Baltic 
Old Russian (ORuss.) 17; see also Russian 
Old Saxon/Saxonian (OS) 3, 5, 387, 401-2 


Old Swedish 406; see also Swedish 
Old Welsh (OW) 354, 361, 370; see also Welsh 
Old Welsh, Old Cornish and Old Breton (OWCB) 
353,361,364-5 
Oriya 217 
Oscan (Osc.) 4 

Osco-Umbrian Italic (Sabellian) 4, 24, 25, 357; 

see also Sabellic languages 
Ossetic 266 

Palaic (Pal.) 3, 27, 171-3, 178-82, 185-6, 

188, 190 

Pali 4, 36,215,253,255 
Pamphylian (Pamph.) 33, 84, 175, 288 
Panjabi 217 

Pannonian Slavic 519; see also Slavic 
Parthian 266, 273, 422, 427; Middle 422 
Pashto 1, 4, 208, 266, 273 
Pehlevi 4 

Peloponnesian 287 

Persian/Farsi (Pers.) 1,3, 266, 410; Albanian 
borrowings 558; Middle 267, 422; New 4; 
see also Iranian; Old Persian/Farsi 
Phrygian (Phryg.) 3, 5, 6, 171, 422, 429, 558 
Pisidian 3, 175 
Polabian 519 

Polish (Pol.) 5, 512, 519, 547; syntax 545; 

see also Slavic 
Pomeranian 519 
Pontic 287 
Portuguese 4, 321 
Prakrits 4, 25,215-17, 251 
Prekmurje Slovene 519, 538, 544; see also Slovene 
pre-Proto-Germanic 20; see also Proto-Germanic 
Primitive Breton 361, 364; see also Breton 
Primitive Cornish 361, 364; see also Cornish 
Primitive Irish 359, 361, 366; see also Irish Gaelic 
Primitive Welsh 361, 364; see also Welsh 
Proto- Albanian 557-60 
Proto- Anatolian (PAnat.) 49, 171, 190; accent 
177; compared to Proto-Indo-European 194; 
consonants 176-7; lexicon 193-4; vowels 176 
Proto- Attic-Ionic 294-6 

Proto-Baltic (PBalt.) 482-3, 496, 521; consonants 
496-9 

Proto-Balto-Slavic (PBS1) 22, 521 
Proto-Celtic (PCelt.) 18, 24-5, 33-6, 40, 41, 
47-8,51, 171 

Proto-Germanic (PGmc) 20, 24-5, 30-1, 38, 46, 
49-50, 53, 55, 72-4, 79, 85, 171, 388-9, 391 
Proto-Greek 24-5, 291-4, 299-301 
Proto-Indian 264 
Proto-Indo- Aryan 208, 214, 457 
Proto-Indo-European (PIE): and language 
typology 153-68; orthography 14; see also 
Indo-European (IE); see also Proto-Indo- 
European morphology; Proto-Indo-European 
phonology; Proto-Indo-European syntax 
Proto-Indo-European laryngeals 29, 30, 46-7; 
*#HC- (initial pre-consonantal laryngeal) 

47-8; *Ch,C 46; *Ch 2 C 46-7; *Ch 3 C 47; *eh { 
45; *eh 2 45-6; *eh 3 46; *h,C- 47; *h,e 44; 

*h 2 C- 47-8; *h 2 e 44; *h 3 C- 48; *h 3 e 44-5; 
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*HV (laryngeal + vowel) 43-4; influence on 
neighboring vowels 43; laryngeal metathesis 
52-3; resonants and 36-9; syllabic 52 

Proto-Indo-European morphology 158-9, 163; 
ablaut 61-2; adjectives 79-80; adverbs 

80- 1; conjunctions 105; further reading 
106-7; interjections 106; numerals 89-91; 
particles 105-6; postpositions 104-5; see 
also Proto-Indo-European nouns; Proto-Indo- 
European pronouns; Proto-Indo-European verbs 

Proto-Indo-European nouns 78-9; athematic 
case endings 69-70; athematic stems 62-3, 
67-77; athematic stems (/-stems) 72-3; 
athematic stems (M-, 1-, laryngeal, and 
plosive) 77; athematic stems (w-stems) 76-7; 
athematic stems (r-stems) 74-6; athematic 
stems (w-stems) 73-4; case endings 63; case 
functions 139-143; case syncretism 143-4; 
case system 158-62; compensatory lengthening 
in athematic stems 69-70; declension suffixes 
62-4; genders of 62, 159-60, 163; heteroclitic 
stems 77-8; /-stems 72-3; M-, /-, laryngeal, 
and plosive stems 77; n- stems 76-7; r-stems 
74-6; neuter 63; root nouns 70-2; 5-stems 
78-9; thematic stems 62—7; thematic stems 
(Eh 2 -stems) 62, 66-7; thematic stems (o-stems) 
62-6; w-stems 73-4 

Proto-Indo-European phonology 56, 318; 
accentual paradigms 526-7; allophones 13; 
clusters 52-4; diphthongs 564; fricatives 
29-30; further reading 56-7; phonemes 13; 
reconstructed system 13; rules 54-5; stops 
13-14, 28, 153, 171, 176; three velar series 
in 26-7; Tocharian and 454-7; vowels 40-3; 
see also Proto-Indo-European laryngeals; 
Proto-Indo-European resonants and glides 

Proto-Indo-European pronouns: demonstrative 
85-8; indefinite 88; interrogative 88; personal 

81- 4; possessive 84-5; pronomial adjectives 
89; reflexive 84; relative 88-9 

Proto-Indo-European resonants and glides 30-1; 
asyllabic 31-4; syllabic 34-6; syllabic and 
laryngeals 36-9 

Proto-Indo-European syntax 111-15; absolute 
constructions 138-9; argument structure 
144-5; case functions 139-44; discontinuous 
constituents 116-20; further reading 146; 
positions of the verb 123-38; Wackemagel’s 
Law 120-3; word order 115-16 

Proto-Indo-European verbs 91-2, 468; absolute 
constructions 138-9; aorist 95-7; imperative 
103-4; middle (mediopassive) 99-100; 
non-finite verb forms 104; optative 102; perfect 
97-9; present 92-5; reconstruction of 91-2; 
subjunctive 100-1 

Proto-Indo-Iranian (Pllr) 205-8, 210-11, 223, 
229, 253-4 

Proto-Iranian 208, 214, 264, 269, 457 

Proto-Italic 19, 25, 328-9, 335-6, 349 

Proto- Sabellic 328 

Proto-Semitic 57 

Proto-Slavic (PS1.) 16, 17, 55, 456, 483, 521, 523, 
527, 547 


Proto-Tocharian (PToch.) 454, 456-7, 460, 463, 465 
Punjabi 1,4,214, 252 

Qarashehr 452 
Qiuci 452 
Quechua 18 

Raetic 321 

Romance (Rom.) 317, 389, 393, 454 
Romanian 4, 160, 545-6, 556-7, 559, 579-80, 
599-600 

Rumanian 32 1 , 600 

Runic 388-9, 403,405-7 

Russian (Russ.) 3, 5, 7, 396, 410, 519, 522, 

524, 558; syntax 545; see also Old Russian; 
Slavic 

Russian Church Slavic (RussCS) 483, 497, 516; 

see also Old Church Slavic 
Rusyn 519 

Sabellian (Osco-Umbrian) Italic 4, 24, 25, 357 
Sabellic languages 13, 317, 321, 329, 335, 341-3, 
349; see also Proto-Sabellic 
Saka (Scythian) languages 3, 264, 271, 453 
Samogitian see Zemaitic dialect 
Samoyedic 159 

Sanskrit (Skr.) 3, 4, 13, 241-2, 263, 393, 458, 
470-1, 481; Agni 452; Epic 242; grammar 253; 
and Tocharian 457; Vedic 241-2; voiced stops 
17; see also Old Indie 

Scandinavian languages 5, 389, 396; see also 
Danish; Finnish; Norwegian; Swedish 
Scots 8 

Scottish Gaelic 4, 352; see also Gaelic 
Scythian (Saka) 3, 264, 271, 453 
Selonian 486, 489 

Semitic 7, 57, 90, 157, 159, 162, 166, 356 
Serbian 5, 55, 519, 545; dialects 533, 558; 
see also Slavic 

Serbo-Croatian (SCr.) 5, 481, 527; dialects 557; 
phonology 16-17; see also Bosnian/Croatian/ 
Montenegrin/Serbian 
Shauraseni 4 
Shughni group 266 
Sidetic 3, 171, 175 
Silesian 519 

Sinhala 210, 217, 250, 251 
Sino-Tibetan 153, 155, 159-60 
Slavic (Slav.) 5, 269, 393, 454, 481-2, 486, 489, 
503, 514-15, 558; about 519-20; Albanian 
borrowings 557, 597; deverbatives from CSI 
543; disintegration of 522; Eastern (Balkan) 
South 545; history and migrations 521-2; 
map 520; metatony 482; micro-languages 
519; Pannonian 519; periodization 521; South 
524; syntax 543-6; word formation 542-3; 
see also Balto-Slavic; Common Slavic; East 
Slavic; Proto-Slavic; Slavic morphology; Slavic 
phonology; Slavic verbs; Western Slavic; 
Western South Slavic 

Slavic morphology: adjectives 537; adverbs 539; 
consonantal stem endings 536; indeclinables 
539; nominal declension 535-6; nouns 533-4; 
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pronouns 537-9; vocalic stem endings 536; 
vocalic stems 534-5 

Slavic phonology: accentual paradigms 526-7; 
cluster simplification 529; consonants 528-33; 
contraction 525; loss of final consonants 529; 
loss of post-consonantal j (‘deiotation’) 531-3; 
monophthongization of diphthongs 523-4; 
prosthesis 530; suprasegmentals 525-8; 
syllabic synharmony and palatalization 524; 
velar palatalization 530-2; vowels 522-5; 
Winter’s Law 527-8 

Slavic verbs: ablaut, tense, and aspect 541-2; 
and Indo-European verb morphology 539; 
invictions of non-past finite verb forms 540; 
verb form classes 540- 1 
Slovak (Slk.) 5, 519, 547; syntax 545; see also 
Slavic 

Slovene (Sin.) 5, 519, 522, 547; Prekmurje 519, 
538, 544; syntax 545; see also Slavic 
Slovincian 519 

Sogdian (Sogd.) 4, 263, 266, 274, 281, 284 
Sorbian, Upper and Lower 519, 547; see also 
Slavic 

South Picene 317, 341 

South Slavic 524; see also Slavic; Western South 
Slavic 

South-West British 352; see also Breton; Cornish 

Spanish (Sp.) 4, 8, 20, 32 lex 

Standard Albanian 558; see also Albanian 

Standard Average European 558 

Stokavian (Stok.) 33, 519 

Sumerian 164 

Swabian 388 

Swedish (Sw.) 3,8; Old 406 
Syriac 421, 423 

Tajik 1 
Tamian 489 
Tamil 1 
Thak 155 

Thessalian Greek (Thess.) 287, 294, 297; see also 
Greek 

Thracian, 5, 21, 359, 555-6 

Tocharian (Toch.) 3, 5, 6, 20, 287, 357, 431, 

437, 596; about 452-3; characteristics of 
453-4; development of PIE vowels in 455; 
further reading 474-5; map 453; morphology 
461-9, 461-9; place within Indo-European 
455-8; phonology 15-19, 21, 454-5, 458-61; 
syntax 471-4; word formation 470-1; see 
also Proto-Tocharian; Tocharian A, B, and C; 
Tocharian morphology 
Tocharian A 5, 13, 452-3 
Tocharian B 5, 13, 452-3, 457, 472 
Tocharian C 453, 457 

Tocharian morphology: adjectival declension 464; 
adjectives 462-4; declension of ‘one’ 467; 
nominal declension 462-3; pronouns 464-7; 
verbs 467-70 


Tosk dialect 5, 552-3, 560, 567; Cham 592; 
dialectical variations, nominal 575-6; 
dialectical variations, verbal 592-3; stress 
patterns 572; see also Albanian 
Tsova-Tush (Batsbi) 1 64 
Tumshuqese 266, 453, 457 
Tungusic 7 

Turkic 1,7, 158, 208-10; see also Altaic 
Turkish 1, 165, 452-3; Albanian borrowings 558, 
572, 597 

Ubykh 155 

Ukrainian (Ukr.) 5, 519, 524, 545, 547; see also 
Slavic 

Umbrian 4, 15, 317, 321, 329, 341-2, 349 
Upper/Lower Sorbian (ULSorb.) 519, 547 
Uralic languages 7, 153 
Urdu/Hindi 1, 4 
Uyghur Kucha 452 
Uyghur Qarashehr 452 

Vandal 5 

Vannetais dialect 364 

Vedic (Ved.) 4, 13, 263; verbal system 238-9, 
243-5; phonology 15-21, 221; see also 
Indo-Aryan; Old Indie; Sanskrit; Vedic Sanskrit 
Vedic Sanskrit 241-2; see also Sanskrit; Vedic 
Venetic (Ven.) 5, 21, 318, 321 
Vidzeme dialect 489, 494 
Vietnamese 155 
Volga Finnic 486 

Welsh (W) 2, 3, 4, 352, 363, 368, 377-8; 

Primitive 361, 364; see also Middle Welsh 
West Baltic 5, 482, 486; see also Baltic 
West British 352; see also Welsh 
West Germanic 388-9, 394, 396, 399, 402, 
404-5, 407; see also Germanic 
West Greek (WGr.) 287, 307, 313; 
see also Greek 

West Norse 406; see also Old Icelandic; Old 
Norse; Old Norwegian 
Western Slavic (WSlav.) 519, 524, 531, 545; 
see also Slavic 

Western Armenian 42 1 ; see also Armenian 
Western South Slavic 545; see also Slavic; South 
Slavic 

Yanqi 452 
Yenisean 155 
Yiddish 393 

Young Avestan (YAv. = Late Avestan) 13, 15-17, 
19, 283; see also Avestan 
Younger Avestan 265 
Yup’ik 163 

Zeelandic 397 

Zemaitic dialect 481, 488-9, 496, 508 
Zemgalian 486, 489 
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ablative absolute 166, 346-7 
ablative case: Albanian 573, 575, 588, 598, 600; 
Anatolian 178, 180, 182-3, 191; Armenian 
435; Baltic 499, 501; Balto-Slavic 479, 482; 
Celtic 367-8; Germanic 400; Greek 301-2; 
Indo- Aryan 233, 250-1; Indo-Iranian 207, 210; 
Iranian 274; Italic 329—31, 333-6, 342; Latin 
143, 156; Proto-Indo-European 62-3, 81-2, 
138, 140-3, 158-9; Slavic 534, 542; Tocharian 
456, 462-3 

ablaut: aerodynamic declension types 68-9, 71, 
75, 76; absolute Albanian 573; alternation of 
230-2; amphidynamic declension types 68-9, 
72-4; Anatolian 178, 187, 189; Armenian 

434, 436-7; athematic declension types 
67-9; Baltic 490, 508, 510, 515; Balto-Slavic 
486n2; Celtic 369, 372; Germanic 391, 398; 
Greek 300-1, 303, 305, 306; Indo- Aryan 223, 
230-3, 239; Iranian 273-6, 278-81; Italic 
332; mesodynamic declension types 69, 72, 73; 
Proto-Indo-European 42, 55-6, 57nl0, 61-3, 
67-9, 71-8, 92-4, 96, 98, 102, 112, 155, 163; 
Slavic 539, 541-2; Tocharian 455, 458 

absolute constructions 138-9, 145-6, 166, 168, 
346-7, 373 

absolutives 163-4, 238, 250, 257, 303, 588; 

see also gerunds 
Acaryan, Hr. 423, 447 

accent: akrostatic 274; Albanian 595; Anatolian 
177; Armenian 425; Baltic 495, 504; Celtic 
361, 364, 365; circumflex 297; dynamic 
155, 273, 296-7, 328; Germanic 393, 396, 

398; Greek 296-7; Indo- Aryan 222-3, 232; 
Indo-Iranian 206; Italic 328-9; Latin 328-9; 
PIE 55-6; Proto- Anatolian 177; shift 396, 398, 
462; Slavic 482, 525-7, 547; synchronic 425; 
tonal 155,273,296-7,328 
accusative case: Albanian 573-6, 583, 599-9; 
Anatolian 177-81, 183, 190-2; Armenian 

435, 437-8; Baltic 499-502, 506-8, 513; 
Celtic 359, 368-72; Germanic 400, 402, 404; 
Greek 310, 312-13; Indo-Aryan 233, 249-52; 
Indo-Iranian 207, 210; Italic/Latin 329-33, 
335-6, 342-3,345-7; PIE 31, 62, 115, 136, 
140, 141, 144-5, 156, 158-60, 162-4, 166-7; 
Slavic 535; Tocharian 462-3, 465-6, 474 

Achaemenian period 4, 266 
acute tone 479-81, 486, 496, 502 
Adams, Douglas Q. 8, 453, 456, 474-5 
Adams, J.N. 344, 350 
Adiego Lajara, Ignacio- Javier 171 


adjectives: adnominal 470-4; agreement of 62, 
252, 462; Albanian 580-2, 598; Armenian 
437-40, 446; Baltic 503-5; Balto-Slavic 
480; Celtic 360, 368, 370, 374; comparison of 
80; denominal 480; deverbal 580; Germanic 
401, 403; Greek 301-4, 314; Indo-Iranian 
211; interrogative 465; Iranian 276-7, 281; 
Italic 329-31, 334; Latin 334; PIE 79-80; 
possessive 180, 470; pronomial 89, 183; 
Proto-Indo-European 62, 66-7, 73, 79-80, 89, 
111; relative 465; Slavic 537; Tocharian 453, 
455, 462-4, 472-4; see also participles (verbal 
adjectives) 

adpositions 143, 156-8, 250; Baltic 514; 

Proto-Anatolian 190-1 
Adrados, Francisco R. 194 
adverbial clauses 312-13; Latin 346-7 
adverbs: Albanian 555, 580-2, 598; Armenian 
434, 440; Baltic 499, 501, 512, 513-14; Greek 
293, 297, 302-4; Italic 345; local 191; PIE 
80-1, 157; pronomial 89; Proto- Anatolian 
190-1; Slavic 538-9; verbal 158; see also 
converbs 
Aemilia 317 
Afanasevo culture 456 

affricates 14-15, 21-3, 27, 268, 393, 396, 423, 
433, 454, 457, 530-3, 568, 571; dental 269, 
271,498 

Afghanistan 3-4, 263 
Africa 214 
Agathangelos 42 1 
Agbayani, Brian 123, 192 
agent nouns 180, 314, 332, 349, 427, 432, 441, 
445, 446 , 468,471,480, 503 
Ager Faliscus 317 
Agni, Kingdom of 452 
Agricola, Rodlophus (Roelof Huisman) 2 
Ahlqvist, Anders 372 
Aitzemiiller, Rudolf 547 
Albania 5, 552 
Albanian alphabet 559 
Albanian diaspora 552, 554, 558, 567 
allative case: Anatolian 190; Baltic 499; 

Greek 313; Lycian 180; Old Hittite 178; 
Tocharian 463 
Allen, W.S. 294,314,350 
allophones 13, 14, 29, 31-2, 42, 155, 206, 221, 
223, 269, 325, 359, 363-6, 566-7 
alphabets: Albanian 559; Armenian 421-2; 
Avestan 265-6; Cyrillic 519; derived from 
Greek 171; epichoric 289; Glagolitic 38, 519; 
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Greek 289-90; Latin 324-5, 387; Lugano 359; 
Ogham 361 

Altindische Grammatik (Wackemagel) 254 
ambigender 573 
Ambrazas, Vytautas 514 
Anatolia 171-2, 205; see also Asia Minor 
Andersen, Henning 521, 531, 541 
Andersen, 0. 290 
Anderson, Cormac 367 
Anderson, R K. 249 
Andronovos culture(s) 205, 456 
‘(the) animals (pi.) run (sg.)’ rule 
(id £a>a xpexei) 65 

anlaut 365, 565; absolute 365-7, 565 
Anthony, David 153 
Antonsen, Elmer 387, 405 
aorist 95-7, 162, 185, 279, 307-9, 376; Albanian 
587-60, 593, 595-6; imperative 443-4; 
indicative 442-3; reduplicated 97; sigmatic 
96; Slavic 542; subjunctive 444; thematic 
96-7,341 

apocope 187, 192, 360, 373, 425, 435, 515, 

565, 576 

Arberesh writing tradition 554 
areal patterning 153, 155, 158, 160, 163-4, 
167-8, 177, 211, 356, 358, 497, 544, 558, 

573, 599 

Aristophanes 29 1 
Arkadiev, Peter 515 
Armenia 5, 263 
Armenian alphabet 42 1 -2 
Armenian diaspora 42 1 
Armenian Republic 42 1 
Armenian sound shift 426 
Armorica 352 
Arnold, Matthew 353 

articles: Albanian 574, 580-4; Celtic 367, 370; 
definite 304, 370, 396, 403, 408-9, 464, 522, 
558, 579-80, 582, 596; enclitic postponed 440; 
Homeric 117; indefinite 403, 409, 414nl0, 

440, 558, 574, 600; postpositive 558, 574, 579, 
582; prepositional 582, 583; prepositive 580, 
584, 585 
Arumaa, P. 547 
Aryans 264 

Asia Minor 3, 4, 5, 171-2, 174, 287; see also 
Anatolia 

aspiration 14, 18, 19-21, 30, 45, 47, 57n8, 229, 
268, 287, 289, 293, 392-3, 423 
Assamese ergative 210, 249 
Atharva-veda 215, 217, 252-3 
Aukstatic areas 493 
Australia 214 
Avar Khaganate 521 
Avery, Peter 392 
Avestan orthography 265-6 
Axel, Katrin 132, 396, 397, 406, 407 
Azerbaijan 263 

Bactria-Margiana Archaeological 
Complex 208 
Baesecke, Georg 410 
Bakker, E. J. 314 


Baldi, Philip 350 

Balkan peninsula 287 

Balkan Sprachbund 558, 573, 599, 601 

Balkanindogermanisch 422 

Balkanisms 558 

Balkans 5, 171, 555 

Ball, Martin J. 377 

Bammesberger, Alfred 399, 413 

Bangladesh 214 

Barber, Elizabeth Wayland 474 

Barbosa, Maria do Pilar Pereira 132 

Bardhi, Frang (Franciscus Blanchus) 554 

Barddal, Johanna 145, 211 

Barrack, Charles 56 

Bartholomae, Christian 120, 283 

Bartholomae’s Law 268 

baryton 180, 189, 297, 486n8, 525-6 

Basel Epigram 487 

Bauer, Anna 128, 164, 192 

Baum, Daniel 242 

Bayer, Josef 397 

Beekes, Robert S.P. 106, 146, 163, 240, 284, 430 
Behagel, Otto 403 
Behistun (Bisutun) inscription 42 1 
Belarus 486 

Bendahman, Jadwiga 107 
Benelux 5 
Bengal 205 
Bergin’s Law 111, 377 
Bernik, France 547 
Bhatta, V. P 255 
Bhutan 214 

Biberauer, Theresa 396 
Binchy, D. A. 377 

Birchbark Letters 519, 531, 535, 544, 547 

Bird, Norman 357 

Bimbaum, Henrik 483, 547 

Bloch, Jules 254 

Blom, Alderik H. 353 

Bloomfield, Maurice 253 

Bocek, Vit 547 

Bohtlingk, Otto 254 

Boley, Jacqueline 1 84 

Bolognesi, Giancarlo 422 

Bonnet, Guillaume 557 

Bopp, Franz 3, 112, 479 

Boretzky, Norbert 558, 588 

Bousquette, Joshua 397 

Boxhom, Marcus van 2-3 

Braune, Wilhelm 400, 401-2, 404 

Brereton, Joel P 253 

Britain/British Isles 4, 321, 322-3, 355, 389 

Brittany 4, 352 

Brixhe, Claude 175, 288 

Broderick, George 356 

Bronze Age 320, 486 

Brugmann, Karl 26, 112, 479 

“Brugmann’s grade” 239 

Brugmann’s Law 231, 267, 273 

Bubenik, Vit 107, 168, 249, 250-1, 255 

Buccini, Anthony F. 396 

Buchanan, Scot George 353 

Buchholz, Oda 572, 573, 588, 603 
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Buck, C.D. 314 
Buddhist texts 215, 452, 454 
Bulgaria 5, 552 
Burrow, Thomas 253 
Buzuku, Gjon 554 

Cabej, Eqrem 603 
Caesar (Julius) 323, 343, 387 
Caland, Willem 126 
Caland System 79, 107 
Camaj, Martin 597, 599 
Campania 317 
Campanile, Enrico 357, 375 
Campbell, Lyle 8 
Canada 42 1 

cardinal numerals 89, 90, 157, 440, 466-7, 505, 
584-5, 600 

Cardona, George 107, 255 
Caria 175 
Carroll, Ryan 399 
Carruba, Onofrio 192 

case endings: Anatolian 178-80; Armenian 435; 
Baltic 501; Celtic 361, 368; Classical Latin 
329-30; Indo-Aryan 210, 229, 233-4, 237-8, 
251; Italic 329-30, 334, 343; Old Hittite 178; 
PIE 62-3, 69-70, 163; Tocharian 462; see also 
declensions 

case functions: in the AIELs and PIE 139-45, 

155; Tocharian 454; see also ablative case; 
accusative case; allative case; dative case; 
ergative case; genitive case; instrumental case; 
locative case; nominative case; vocative case 
case loss 387, 390, 400, 412 
Catullus 323, 325 
Caucasus 3, 5, 171, 205 
causal clauses 312, 346 
causatives 94-5, 167, 189, 223, 238, 245, 278, 
340, 374, 399, 404, 445, 460-1, 468-9, 490, 
512, 541, 587, 593; morphological 209, 252; 
oppositions 248 
Cabej, Eqrem 560, 597 
Central Europe 5 

centum languages 6, 21, 497; labiovelars 23-5; 
loss of labiality 25-6; map 28; palatalized 
velars 21-3; reflex of PIE *kw/kw 24; reflexes 
found in satem languages 27 
Chadwick, John 291, 297, 314 
chain shifts 20, 393, 396, 412 
Chantraine, P. 314 
Chatterji, Suniti Kumar 254 
Cheung, Johnny 283 
China 5, 263, 452 
Chinese Empire 452 
Chinese Turkestan (Xinjiang) 263 
Chomsky adjunction 116 
Cicero 323 
Cipo, Kostaq 603 
circumflex 297 

circumflex tone: Balto-Slavic 155, 479-82, 489, 
491, 496-7, 501-2, 510; Lithuanian 492-3, 
495; Old Prussian 495-6; Saussure’s Law 
493-4, 5 1 5n4; Slavic 525-7 
Civita Castellana 317 


Clackson, James 56, 107, 146, 323, 344-5, 350, 
422, 431 

clauses: adverbial 312-13, 346-7; alignment 
of 111, 162-5, 167; adverbial 346-7; causal 
312, 346; conditional 346, 407, 473; dependent 
472-3, 511, 601-2; independent 105, 471; 
purpose 346; relative 111, 165-6, 193, 282, 
344, 347-8, 364, 472-3, 558; temporal 346; 
see also subordinate clauses 
clusters: consonant 431-3; dental 54; 
thorn 53 

Codex Argenteus 40 1 , 408, 410-11 
Codex Marianus 535, 538, 544 
Codex Neumannianus 487 
Codex Zographensis 538, 544 
Coetsem, Frans van 399 
Collinder, Bjorn 153 
Colvin, S. 314 

Common Slavic Linguistic Atlas 547 
comparative linguistics 3, 7, 112, 217, 250, 

252-3, 350, 353 

comparatives 80, 303; Albanian 580; Armenian 
438, 446; Celtic 370; Greek 80, 293, 303-4; 
Iranian 276; Italic 334; Slavic 537 
compensatory lengthening 43, 70, 77, 155, 177, 
294-6, 328, 366-7, 389, 412, 430, 441, 525 
complementation: Albanian 601-2; Greek 311, 
313; Latin 345-6 

complementizers 387, 405, 407-8, 412, 414 
compounding 403, 470, 543, 597, 599; see also 
compounds 

compounds: Albanian 599; Armenian 438, 446-7; 
Baltic 505; Celtic 377; Germanic 403; Greek 
302, 314; Indo-Aryan 222, 229, 250, 251, 

254; Iranian 281; PIE 79, 90; Italic 332, 350; 
nominal 470-1; Slavic 543; Tocharian 470-1; 
see also compounding 
Comrie, Bernard 157, 162, 168, 533, 546 
conditional clauses 346, 407, 473 
conditional mood 238, 242, 372, 375, 508, 511 
conjugations see verbal conjugations 
conjunctions: Albanian 586, 601-2; Anatolian 
192-3; coordinate 138; Germanic 407; Greek 
312; Italic 346-7; Lithuanian 514; PIE 55, 65, 
105, 123, 138; postpositive 123; subordinating 
146, 184, 347, 407, 412, 538, 544; temporal 
312; Tocharian 467 

consonant clusters: Albanian 462, 465-6, 

468-9; Armenian 426, 428, 431, 433; Celtic 
372; Germanic 399; Greek 291; Indo-Aryan 
229-30; PIE 52-4, 62; Tocharian 455, 459 
consonant shifts 18, 20, 392-3, 396 
consonantism 361, 364; Brythonic 365; Celtic 
362-3; Gaelic 366-7 

consonants: Anatolian 176-7; Armenian 426-34; 
Baltic 496-9; Celtic 361-7; dental affricates 
269, 271, 498; dental clusters 54; dentals 14, 

28, 35, 54, 227-8, 291, 342, 454, 528; doubled 
361; ejective 18, 154; Germanic 391-4, 397; 
Indo-Iranian 225-9; intervocalic geminate 364; 
liquids 428; nasals 428-9, 565; palatalized 
361, 367, 374, 460, 491, 524; plosive 225-6; 
pre-Germanic 391; prothetic 529-30, 537; 
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Proto-Anatolian 176-7; Proto-Indo-European 
52-5; retroflex 225, 227, 229-30, 263, 270, 
423; sibilants 177, 206, 217, 228-9, 269, 
427-8; Slavic 528-33; see also consonant 
clusters; consonant shifts; lenition; stops 
Constantine 522 

converbs 158, 211, 238, 250, 257; see also 
gerunds 

Corbett, Greville C. 533, 546 
Corsica 321 

Courtenay, Baudouin de 483nl 
Cowgill, Warren 154, 187, 356, 359, 363, 368, 
371,372, 373,439 
Cretan Hieroglyphic 29 1 
Croatia 552 

Croft, William 143, 168 

cuneiform 3, 171-4, 177-9, 181-3,263, 

266-7 

Curta, Florin 521 
Cuzzolin, Pierluigi 350 
Cypro-Minoa script 29 1 
Cyrillic alphabet 519 
Cyrillo-Methodian tradition 519 

Dacia 321 

Daka, Palok 603 

Dalmatian coast 558 

Daniel of Voskopoja 554 

Darius the Great 421 

dative absolute 166, 511, 544 

dative case: Albanian 558, 573-8, 581, 583, 

600- 1; Anatolian 178-83, 191-2; Armenian 
435, 438-9; Baltic 499, 501-3, 507-8, 

511-13; Balto-Slavic 6, 480, 482; Celtic 361, 
367-8, 370; Germanic 6, 400-2; Greek 144, 
301-2; Indo-Aryan 233, 249, 251; Indo-Iranian 
207; Iranian 274, 283; Italic/Latin 322-3, 
329-31, 333-6, 342-3; PIE 62, 138-9, 141, 
143-5, 147, 158, 166-7; Slavic 544 
Dauksa, Mykalojus 488 
deaspiration 19-20, 229, 392 
De Bernardo Stempel, Patrizia 359, 360, 362, 364, 
369, 370, 376-7 
Debrunner, Albert 254 

declensions: Albanian 574, 576-80, 583; Baltic 
500-7; Armenian 429-30; Avestan 275-6; 
Celtic 368, 370; consonants 235; Greek 
287, 298-300, 302-3; holodynamic ablaut 
69, 71, 72, 73, 76, 77; holostatic ablaut 69, 
78-9; hysterodynamic ablaut 68-9, 75, 77; 
Indo-Aryan 233-9, 250; Indo-Iranian 210; 
Iranian 272, 274, 276-7, 281-2; Italic/Latin 
329, 331-4, 342, 350; Proto-Indo-European 
62-5, 67-8, 77, 81-2, 85, 86, 88, 90, 93, 104, 
159; Slavic 533-7, 539; Tocharian 455, 462, 
464, 467; see also case endings; nouns 
definite articles 304, 370, 396, 403, 408-9, 464, 
522, 558, 579-80, 582, 596 
definiteness 387, 403, 405, 408, 409, 504, 537, 
542, 546, 573, 574, 576 
definites, Germanic 408-9 
Degener, Almuth 264 
de Hoz, Javier 355 


deictic particles 167, 171, 277, 304, 306, 335-6, 
407-9, 508, 538-9, 583-4, 596 
deictic pronouns 182-3, 464-5 
deiotation 524, 530-3 
Delamarre, Xavier 369, 372, 377, 378 
Delbriick, Berthold 111, 120, 124, 127, 249, 254 
Demiraj, Bardhyl 555, 557, 560, 571, 603 
Demiraj, Shaban 560, 573, 577, 578, 579, 603 
demonstratives: Celtic 370-1; Germanic 407-9; 
Latin 335; PIE 160, 165; see also pronouns, 
demonstrative 
Denmark 389 

dental affricates 269, 271, 498; see also affricates 
dental clusters 54 

dentals 14, 28, 35, 54, 227-8, 291, 342, 454, 528 

depalatalization 270, 271, 531 

dependent clauses 472-3, 511, 601-2 

dependent-markedness 156-7 

Derksen, Rick 481, 483, 486n3, 486n6, 515, 547 

desiderative mood 94, 161, 245, 269-70, 340, 

376, 508 

desiderative suffix 307-8 
de Simone, Carlo 354 
Desnickaja, Agnija 557, 560, 577, 578 
De Vogelaer, Gunther 397 
devoicing 20, 22, 25, 229, 272, 392-4, 

568, 596 

diachronic elements 34, 153, 158, 208-11, 

326-8, 334, 403, 443, 475, 515, 595 
diathesis 146, 238-41, 248, 510 
Dickey, Stephen M. 522, 542 
DiGiovine, Paolo 107, 185 
diglossia 323 
Dik, H. 311 
Dillon, Myles 356 
diphthongization 556, 565 
diphthongs: Armenian 424-5; Avestan 266; Baltic 
490; Celtic 364, 366; Germanic 395; Greek 
291, 293-4; Indo-Aryan 223-5; Indo-Iranian 
263; Iranian 266; Latvian 492; Lithuanian 
492-3, 496; long 51, 290; monophthongization 
523-4, 531; PIE 48-51; spurious 290 
Doluxanov, Pavel M. 52 1 
Domi, Mahir 603 
Doric future 307 
Dover, K. J. 3 1 1 
Dresher, B. Elan 398 
Dressier, Wolfgang 112, 128 
Drinka, Bridget 107, 164 
Dryer, Matthew S. 165 
Duden, Konrad 403 
Duhoux, Y. 3 14 
Dumezil, Georges 43 1 
Dunkel, George 192 

Dybo, Vladimir Antono vie 481, 482, 526, 547 
Dybo’s Law 357, 486n7, 526 
Dzaukjan, G.B. 431 

Edel’man, Dzoj Iosifovna 283 
East Prussia 488 
Eastern Europe 5 
Edda 406 
Egetmeyer, M. 314 
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Eggers, Hans 400, 401-2, 404 
Eichner, James 176, 177, 178, 186, 187, 189, 194, 
359, 429, 437, 440 
Einarsson, Stefan 40 1 
ejectives 18, 154 
Ekblom, Richard 495 
Elbing Vocabulary 487, 489, 495, 503 
Elichmann, Johann 2 

Elizarenkova, Tatjana Jakovlevna 253, 255 
Elsie, Robert W. 357 
Emmerick, Ronald 277 
Emonds, Joseph Embley 389 
Enchiridion (Luther) 488 
enclitic particles 408, 440 
enclitics 105, 120, 181, 191-2, 270, 272-3, 275, 
277, 304-5, 335, 344, 396, 408, 438, 507, 514; 
atonic 297; pronomial 171, 182-3, 397 
Endzelin, Jan 479, 482, 488, 494-5, 512, 

514, 515 

EndzelTn’s Law 494-5 
Enger, Hans-Olav 398 
England 7 

epigraphy 323-4, 350 
ergative case 164, 178,210-11,249-50 
ergative construction 65, 163, 167, 210-11, 252, 
257n29 

ergative hypothesis 111, 163-4 
Erlangen School 56 
Emout, Alfred 350 

Eska, Joseph F. 8, 356, 357, 358, 359, 360, 361, 
363, 368, 370, 373, 376, 377 
Etruria 3 1 7 

etymology 192, 194, 218, 378, 595; Armenian 
423; Slavic 547; Tocharian 454 
Evangelienbuch 398, 404, 408 
Evans, D. Ellis 357, 358, 359, 363, 364, 368, 

376, 378 

Evans, (Sir) Arthur 291 
Eythorsson, Thorhallur 406, 407 
Eznik 42 1 

Faarlund, Jan Terje 389, 396 

Faiss, Klaus 284 

Falerii Veteres 317, 321 

Falileyev, Alexander 354, 355, 359, 376 

Fennell, Trevor G. 515 

Ferrara, S. 3 14 

Ferraresi, Gisella 389, 405 

Fertig, David 397 

Fiedler, Wilfried 572, 573, 588, 595, 603 
Fiji 214 

First Sound Shift 392; see also sound changes 

Fleuriot, Leon 358 

Forssman, Bernhard 187, 283, 515 

Forster, R 389 

fortis geminates 388 

fortis stops 396 

Fortson, Benjamin W. IV 56, 106, 146 

Fraenkel, Ernst 515 

France 4, 359, 421 

Francis, Eric D. 43 1 

Frei, Peter 175 

Freising Folia 519, 531, 544, 547 


Fremdzeichen 458 

fricatives: Albanian 568; alveopalatal 498; 
Anatolian 177; Baltic 496-9; Celtic 354; dental 
532; IE 293; Germanic 381, 389, 391-3, 396; 
Greek 291-3; Iranian 264, 268, 271; Italic 
318, 325-8; non-phonemic 14; palatalization 
532; Proto-Indo-European 13-15, 19-23, 27, 
29-30, 33, 53; Slavic 528-30, 532-3 
Friedrich, Johannes 126, 194 
Friedrich, Paul 112, 165 
Fritz, Matthias 107, 146, 159 
FuB, Eric 397, 407 

Gamkrelidze, Thomas (Tamaz) V. 8, 18, 56, 154, 
155, 164, 165, 168,426, 527-8 
Garcia Alonso, Juan Luis 376 
Garcia Ramon 287 
Garde, Paul 547 

Garrett, Andrew 171, 184, 192, 193 

Gast, Volker 403 

Gaters, Alfreds 515 

Gelderen, Elly van 409, 4 1 0 

Geldner, Karl Friedrich 253, 283 

Gelenius, Sigismund (Zikmund Hruby z Jeleni) 2 

Gelsen, Henry 5 1 5 

geminates 364; elimination of 528; fortis 388 
gemination 294, 397 
genitive absolute 1 66, 3 1 3 
genitive case: Albanian 558, 573, 575-7, 580-2, 
599-600; Anatolian 178-80, 182-3; Armenian 
438-40, 447; Baltic 499-502, 506, 512-14; 
Balto-Slavic 479-80, 482; Celtic 357, 360, 

361, 368-71; Germanic 400, 403-4; Greek 
143, 301-2, 313; Indo-Aryan 233, 249-51; 
Indo-Iranian 207, 210-11; Iranian 274, 283; 
Italic/Latin 323, 329-30, 333-6, 342, 347; 
Italo-Celtic 319-20; PIE 62,81, 111, 118, 129, 
138-9, 141, 143-4, 147, 156, 158-9, 163, 
165-7; Slavic 522, 534-5, 546; Tocharian 
462-3,465-6, 472-4 

genitives: adnomial 512; partitive 513 14, 522 
Georgia (Soviet) 263 
Georgiev, V. 555 

gerundives 190, 238, 246, 250, 251, 342, 348, 470 
gerunds 146, 158, 238, 250, 342, 445, 511, 586, 
588, 592; see also absolutives; converbs 
Ghatage, A.M. 255 
Gimbutas, Marija 8 
Girdenis, Aleksas 496 
Gjinari, Jorgji 573 
Glagolitic alphabet 38, 519 
glides: geminate 388; hardening 388, 389; 
intervocalic 299; palatal 388; strengthening 
397; syllabic 14, 39, 42, 62 
glottalized stops 18, 154, 167,426 
glottallic theory 18, 56-7, 154, 426 
Gluhak, Alemko 547 
Godel, Robert 431, 433, 447 
Goedegebuure, Petra 178, 182, 192, 193 
Goeman, Ton 397 
Goldbach, Maria 405 
Golston, Chris 123, 192 
Gonda, Jan 124, 238, 249 
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Goto, Toshifumi 211, 245, 248, 253, 254, 283 
Gottingen Register of Electronic Texts in Indian 
Languages (GRETIL) 253 
Government and Binding model 115-16 
grabar 42 1 

grammars: generative 115; historical 253-4 

Grassmann, Hermann 255 

Grassman’s Law 19-20, 293-4 

Greece 4, 5, 287, 552 

Greek alphabet 289-90 

Greek uncial script 519 

Green, D.H. 387 

Greenberg, Joseph H. Ill, 115, 153, 165, 
168,538 

Greenberg, Marc L. 521, 527 
Greene, David 356, 358, 367 
Greppin, John A. C. 446 
Grierson’s division 2 1 7 
Griffith, Ralph T. H. 130, 372 
Grimm, Jacob 3, 112 
Grimm’s Law 20, 353, 392, 393 
Grunau, Simon 487 
Gruppenflexion 252 
Gumo, Kingdom of 453 
Gusmani, Roberto 175, 192 
gutturals 154, 225,559, 571 
Guyana 214 

Gvozdanovic, Jadranka 107 

Haarmaan, Harald 557 
Haegeman, Liliane 122 
Hajnal, Ivo 115, 166, 176, 180, 187, 189, 190 
Hale, Mark 115, 121, 122, 123, 193 
Halle, Morris 396 
Hallstatt Iron Age culture 354 
Hamp, Eric P. 352, 358, 359, 362, 375, 376, 389, 
436, 558, 603 

Hanna, Patrizia Noel Aziz 398 
Harbert, Wayne 403, 407 
HarSarson, Jon Axel 107 
Harvey, Anthony 361, 364 
Haspelmath, Martin 558, 599 
Haug, D. 290 
Haugen, Einar 397 
Havlik’s Law 525 

Head Preference Principle (HPP| 409 
Heather, Peter 388, 389 
Hecataeus of Miletus 354 
Heiric of Auxerre 353 
Held, Warren H„ Jr. 193 
Hemon, Roparz 378 
Henning, W. B. 266 
Herodotus 355, 421 
heteroclitic stems 77-8, 332, 370 
heteroclitics 8 1 , 100, 159, 167,437 
Hethitologie Portal Mainz 1 94 
Hettrich, Heinrich 139-41 
Hewson, John 107, 168, 251 
hieroglyphic texts 3, 171, 173, 174, 291 
High German Consonant Shift 393; see also 
sound changes 
Hildebrandslied 400, 4 1 0 
Hill, Eugen 404, 405 


Hintze, Almut 284 
Hiniiber, Oskar von 255 
Hirt, Hermann 481 
Hirt’s Law 481, 525 
Hittites 42 1 

Hock, HansH. 211,515, 560 
Hoffmann, Inge 1 94, 272 
Hoffmann, Karl 242, 265, 266, 273, 275, 277, 283 
Hoffner, Harry A. 185, 192, 193, 194 
Holland, Gary B. 139, 253 
holodynamic ablaut declension types 69, 71, 72, 
73, 76, 77 

holostatic ablaut declension types 69, 78-9 

Holst, Jan Henrik 422, 447 

Holtzmann’s Law 388 

Holub, Zbynek 522, 527 

Homer 135-7, 290, 298, 305, 308, 309, 314 

Homeric Kunstsprache 287, 290 

Hopper, Paul 154 

Horace 323 

Horrocks, G.C. 290, 311, 314, 323, 344-5, 350 

Howell, Robert B. 396, 401, 403 

Hruby, Zikmund (z Jeleni) 2 

Hubbard, Philip 603 

Hiibschmann, Heinrich 447 

Huggard, Mattyas 193 

Huisman, Roelof (Agricola, Rodolphus) 2 

Huld, Martin E. 603 

Humbach, Helmut 284 

Hurrian empire 264 

hyperbaton 116-20, 124, 127 

hypervoicing 4 1 3n 1 

hysterodynamic ablaut declension types 68-9, 

' 75, 77 

Iberian Peninsula 4 

Idsardi, William 392 

Illic-Svityc, Vladimir 153, 481, 482, 547 

Illyria 555 

Illyricum 32 1 

imperative mood 161, 281, 338; Albanian 
585-91,597, 601; Anatolian 184, 186, 188; 
Armenian 428, 441-3; Baltic 508, 510-11; 
Germanic 405, 407; Greek 305; Indo-Aryan 
238-42, 246-7; Indo-Iranian 207; Iranian 
278, 281; Italic/Latin 338, 345-6; PIE 91-2, 

97, 102-3, 106, 127-8, 1 6 1 ; Tocharian 460, 
468,471 

/-mutation 178, 190 

indefinite articles 403, 409, 414nl0, 440, 558, 

574, 600 

indefiniteness 312 
independent clauses 105, 471 
India (Republic of India) 3-4, 205; sociolinguistic 
situation in 214-15, 217 
indicative mood: Albanian 586-96, 597, 601-2; 
Anatolian 184-8; Armenian 441-4; Baltic 
508; Celtic 372-5; Germanic 411; Greek 
305-7, 309, 311-13; Indo-Aryan 238, 242; 
Indo-Iranian 207; Iranian 277-9; Italic/Latin 
323, 338,340; PIE 6, 91-2, 100-1, 161-2; 
Slavic 538; Tocharian 468, 473 
INDOLOGY 253 
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Indo-Uralic hypothesis 7 
infinitives: Albanian 552, 558, 586, 588, 592, 
602; Anatolian 178, 180, 190-1; Armenian 
445; Baltic 491, 493, 495, 503, 508-13; 
Balto-Slavic 486; Greek 305, 309-13; 

Indo- Aryan 222, 238; Iranian 278, 281; Italic/ 
Latin 341-2, 345-6; PIE 104, 107, 165-6; 
Slavic 540-1, 544; Tocharian 468-9 
inflection see case endings; declensions; verbal 
conjugations 

injunctive mood 91, 97, 107, 113, 114, 130, 207, 
238-9, 241-2, 246, 277, 279-80, 282, 428, 
441-2 

Insler, Stanley 249, 284 
instrumental case: Anatolian 180; Baltic 
499, 501-2; Balto-Slavic 479; Celtic 361, 
367-8; Germanic 400; Greek 139, 201, 

302; Hieroglyphic Luwian 182, 191; Hittite 
142-3, 178, 183; Indo-Aryan 233, 249-50; 
Indo-Iranian 207, 210, 211; Iranian 274; 

Italic 329-30, 342; Lithuanian 513; Vedic 
139-41; PIE 62, 81, 139, 143-4, 158, 192; 
Tocharian 463 

interjections 106, 158, 440, 558 
internationalisms 558 
intervocalic glide 299 
intrasyllabic harmony 524 
Iran 3-4, 263 
Iraq 263 
Ireland 4, 355 
Iron Age 320, 354 
Irslinger, Britta Sofie 376 
Isaac, G. R. 376 
Isidor 406 
Israel 42 1 

Italy 4, 5, 287, 359, 552; ancient 317 
Ivanov, Vjaceslav Vsevolodovic 8, 18, 56, 154, 
164, 165, 168,483, 527-8 
Iverson, Gregory K. 392, 393, 394, 396 

Jackson, A. V. Williams 263 
Jackson, Kenneth Hurlstone 356, 361 
Jacobs, Neil 413 
Jahowkyan, G. B. 446 
Jain, Dhanesh 255 
Jakobson, Roman 153 
Jamison, Stephanie W. 249, 253 
Janda, Richard D. 8 
Janko, Richard 290 

Jasanoff, Jay H. 107, 184, 187, 368, 369, 372, 
374, 375, 376, 427, 596 
Jaskula, Krzysztof 367 

Jazyki mira (Languages of the World) series 255 

Jeffery, L. H. 289 

jer fall 525 

Jespersen’s Cycle 398 

Jokl, Norbert 557, 558 

Jones, Brandtly 290 

Jones, William 3 

Jordan Colera, Carlos 360 

Joseph, Brian D. 8, 168, 599 

Joseph, Lionel S. 359, 362, 369, 375, 376 

Josephson, Folke 192-3 


Joshi, S. D. 255 
Jutland 389 

Kabelka, Jonas 515 
Kallio, Petri 486 
Kammenhuber, Annelies 1 94 
Kapovic, Mate 107, 526, 547 
Karalitinas, Simas 490 
Karl, Raimund 359, 371, 377 
Karulis, Konstantins 515 
Kathol, A. 397 
Katz, Joshua T. 107, 370 
Kavalioti, Theodor 554 
Kazakhstan 457 
Kellens, J. 278, 283, 284 
Kempgen, Sebastian 533, 543, 546 
Kent, R. G. 284 
Kerswill, Paul 389 

Keydana, Gotz 116, 121, 122, 123, 133, 

138-9, 146 
Killen, John 3 1 4 
Kimball, Sara 176, 187 
Kiparsky, Paul 115, 396, 407 
Klein, Daniel 488 
Klein, Jared S. 128 

Klingenschmitt, Gert 422, 429, 433, 434, 435, 

440, 443, 444, 447, 573, 577, 579, 598 
Kloekhorst, Alwin 176, 177, 187, 189, 194, 430 
Kluge, Friedrich 387, 389, 391 
Kobayashi, Masato 254 
Kober, Alice 291, 3 14 
Koch, John T. 362 
Kodderitzsch, Rolf 360, 603 
Kogan, Anton I. 205 
koineization 389 
Koivulehto, Jorma 389 
Koman culture 555 
Konig, Ekkehard 403 
Kopecny, Frantisek 547 
Koriwn 42 1 

Kortlandt, Frederik H. H. 18, 163, 240, 245, 427, 
428, 430, 434, 435, 448, 481, 486n6, 489, 526, 
527, 528, 547 
Kosovo 5, 552 
Kostallari, Androklli 603 
Krause, Wolfgang 387, 474-5 
Krisch, Thomas 113, 119, 120, 123, 124, 128, 
133-4, 135, 138-9, 144,255 
Kristoffersen, Gjert 396 
Kronasser, Heinz 1 93 
Kroonen, Guus 389 
Kroraina, Kingdom of 457 
Kroraina Prakrit 453 
Kucha, Kingdom of 452, 453 
Kulikov, Leonid Igorevic 144-5, 246-7, 248, 254 
Kiimmel, Martin 211, 242, 248, 254 
Kurgan hypothesis 6-8 
Kuroda, Susumu 404 
Kurschat, Friedrich 488, 493 
Kurylowicz, Jerzy 363, 364, 367, 409 
Kurz, Josef 547 
Kiispert, Klaus-Christian 396 
Kylstra, A.D. 389 



624 SUBJECT-AUTHOR INDEX 


labialization 14 

labials 14-15, 23, 38-9, 50, 228, 427, 454, 
489-90, 532-3; see also labiovelars 
labiovelar glides 388 

labiovelars 14-15, 20, 23-5, 39, 154, 176-7, 

206, 291-2, 324-5, 327, 363, 366, 388, 391-3, 
426-7,441,480, 486n2, 571 
Labov, William 8, 396 
Lahiri, Aditi 398 

Lambert, Pierre- Yves 352, 358, 360, 361, 368, 

372, 376, 378 

Lamberterie, Charles de 439, 447 
Landi, Addolarata 557 
Langslow, David 111, 119, 146 
language typology: phonological 153-5; 

Proto-Indo-European 153-68 
Lanman, C.R. 253 
Laroche, Emmanuel 1 74 
laryngeal hardening 443 
laryngeal metathesis 52-3 
laryngeals 14, 29, 30, 154, 155, 177, 292; 
Indo-Iranian 206; interconsonantal 430; 

PIE 267, 271; syllabic 14 
Lass, Roger 403 

Late Common Slavic period 521 
La Tene Iron Age culture 354 
Latin alphabet 387 
Latium 317,320 
Law of Illic-Svityc 486n7 
law of the penult 328 
Lebanon 42 1 
Ledisma, Jacobus 488 
Lehfeldt, Werner 515, 526 
Lehmann, Winfred P. 8, 1 12, 165, 168, 389, 
408,413 

'Leiden School’ 56, 526 
Lejeune, M. 314, 368, 372 
lenition 177, 361, 364-7 
Leskien, August 540, 547 
Leskien’s Law 493, 507 
Lewis, Henry 361-4, 377, 378 
lexicon: Albanian 557; Anatolian 193; 

Indo-Iranian 207-8; Latin 349-50 
Lhuyd, Edward 353 
Lieberman, Erez 399 
Lindeman, Fredrik Otto 362, 440 
Lindeman’s Law 54, 89 
linguistics, historical 3 
Lippert, Jorg 407 
Livy 323 

locative case: Albanian 575, 577, 578, 598; 
Anatolian 178, 180, 183, 190, 191; Armenian 
435, 439; Baltic, 499, 501, 502-4, 508, 513; 
Balto-Slavic 482; Celtic 367; Germanic 400; 
Greek 297, 313; Indo-Aryan 233, 250-1; 
Indo-Iranian 207, 210; Italic/Latin 329, 331, 
342; PIE 62, 81, 138-41, 143-4, 156, 158, 

165; Slavic 522; Tocharian 462 
Lockwood, W.B. 412 

Lubotsky, Alexander M. 2 1 1 , 242, 255, 284 
Luke (the Evangelist) 1 66 
Lugano alphabet 359 
Liihr, Rosemarie 116, 121, 444 


Luiten, Tyler 403 

Lunt, Horace G. 547 

Luraghi, Silvia 115, 128, 138, 143, 144 

Luther, Martin 488 

Lysistrata 291 

MacAulay, Donald 377 

Macavariani 155 

Macdonell, A. A. 254 

Macedonia 5, 552 

Mailhammer, Robert 389 

Maidive Islands 205, 214 

Mallory, James P. 8, 153, 171, 354, 456, 474 

Malzahn, Melanie 475 

Mann, Stewart 603 

Manx 352 

Marek, Christian 175 
Marggraf’s Law 462 
Martinet, Andre 364 
Martinez, Javier 283 

Martirosyan, Hrach K. 427, 428, 429, 447 
Mara 389 

Masica, Colin P. 2 1 1 , 252, 255 
Matasovic, Ranko 107, 159, 160, 164, 168, 357, 
378, 407,410,411 
Mathiassen, Terje 5 1 5 
Matranga, Luca 554 
Matras, Yaron 252, 601 

Matzinger, Joachim 422, 429, 447, 455, 554, 555, 
557, 560, 567, 571, 573, 577, 578, 579, 580, 
588, 593, 595, 596, 597, 599, 603 
Mayrhofer, Manfred 154. 211, 264, 273, 

283, 284 

Maziulis, Vytautas 489, 490, 495, 515 
Mazvydas, Martynas (Martinus Masvidius) 488 
McCluskey, Stephen C. 487 
McCone, Kim Robert 111, 124, 352, 356, 358, 
359, 360, 361, 362, 366, 367, 370, 371, 372, 
373, 375, 376, 377, 378 
McManus, Damian 361, 375 
McWhorter, John H. 389 
Medes 264 

Medimurje Kajkavian 544 
Meid, Wolfgang 125, 194, 354, 356, 357, 358, 
359, 360 

Meier-Briigger, Michael 56, 106, 146, 185 
Meillet, Antoine 26, 167, 350, 389, 428, 433, 435, 
446-7,448,479, 481 
Meillet’s Law 481, 482 
Meiser, Gerhard 350 
MeiBner, T. 3 14 
Melanesia 214 

Melchert, H. Craig 171, 176. 177, 178, 180, 

182, 185, 187, 188, 189, 190, 191, 192, 

193, 194 

Mendez Dosuna, J. 288 
Menz, Andrea 398 
Merrill, Peter T. 547 
‘Meshari’ 554 

mesodynamic ablaut declension types 69, 72, 73 
Mesopotamia 171 
Mesrop Mastoc c 42 1 
Messing, G. M. 300 
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metathesis 15-17, 52-4, 157,299, 301,432, 

524, 554 

metatony 481, 482, 493 
Methodius 522 
Meyer, Gustav 579, 603 
Michael Attaleiates 553 
Middle East 3, 5 
Mihaescu, Haralambie 557 
Miller, D. Gary 112 
Milyan 174 

Mitanni kingdom 205, 264, 457 
Mongolia 452 

Monier- Williams, Monier 254 
monosyllabic lengthening 38, 54-5, 70, 

83-4, 105 
Montenegro 5, 552 
Morgenstieme, Georg 205, 264 
morphology 7, 8; Albanian 573-97; Armenian 
434-45; Baltic 499-512; Balto-Slavic 479, 
480; Celtic 367-76; Germanic 389, 398-403; 
Greek (adjectives and adverbs) 302-4; Greek 
(nominal) 298-302; Greek (pronouns) 304-5; 
Indo-Iranian 207; inflectional 398-400; Iranian 
266, 274-81; Latin (nominal and pronomial) 
329-33; Latin (verbal) 336-43; nominal 
178-80, 233-4, 274-6, 298-302, 400-3; 
Slavic 533-42; Tocharian 461-9, 461-9; 
verbal 184-90, 238-9 

morphophonology: Indo-Aryan 229-33; Iranian 
272-4; Latin 339; morphotactics 229; PIE 112; 
Tocharian 458-9 

Morpurgo Davies, Anna 177, 185, 301, 314 

Mortensen, David 389 

Moscow Gospel 421 

Moscow School 526 

Movses Xorenac 'i 421 

Muhlenbach, Karl/ Mtilenbachs, Karlis 488, 

494, 515 
Mullen, Alex 361 
Muller, Nicole 377 
Muller, Stefan 391 
Murray, Robert 397, 398 
Mylius, Klaus 255 

Nagomo Karabagh 42 1 
Narten, Johanna 254, 265, 284 
Narten-imperfects 596 
nasalization 429, 490, 492, 561, 565 
National Awakening/Renaissance 
(Albania) 558 
Nau, Nicole 489 
near-constraints 54 
Negau helmet 4, 387 
Nepal 214 
Neri, Sergio 107 
Nesselmann, Georg H.F. 486 
Neu, Erich 194 
Newmark, Leonard 603 
Nibelungenlied 398 

Nichols, Johanna 156, 157, 160, 167, 168,521 
Nielsen, Hans Frede 389, 413 
Niepokuj, Mary 107 
Nolle, Johannes 175 


nominative case: Albanian 573-3, 578, 582-3, 
599-600; Anatolian, 178-80, 183, 189-90, 
192; Armenian 435, 437-8, 444-6; Baltic 
499, 502-4, 511-13; Celtic, 368-70, 374, 

376; Germanic 400, 402, 404, 411; Greek 297, 
312; Indo-Aryan, 233, 250, 252, 255, 257; 
Indo-Iranian, 207, 210-11; Italic/Latin 329-38, 
340-3, 345; PIE 62, 65, 94, 139, 145, 154, 

158- 60, 162-4, 167; Slavic; Tocharian 462-8, 
470, 472-4; see also nouns 

Nooten, Barend A. van 253 
Normans 7 
Normier, Rudolf 43 1 
Norway 389 
Nostratic hypothesis 168 

nouns: abstract 107, 349, 436, 446, 470, 480, 598; 
agent 180, 314, 332, 349, 427, 432, 441, 445, 
446, 468, 471, 480, 503; Albanian 523-30; 
Armenian 434-8, 445-7; Baltic 499-503; 
barytone 180, 189, 297, 486n8, 525-6; 
Common Celtic 367-70; Germanic 400-3; 
Greek 298-302; Indo-Aryan 233-8; Iranian 
274-6; Italic/Latin 329-34; Proto-Indo- 
European 62-3, 67-74, 77-9, 139-44, 

159- 60, 163; Slavic 533-6; Tocharian 
462-3; see also case endings; declensions; 
nominative case 

Nova Scotia 352 

Novgorod Birchbark letters 519, 531, 535, 

544, 547 

numerals: Albanian 584-5; Armenian 440-1; 
Baltic 505-6; cardinal 89, 90, 157, 440, 

466-7, 505, 584-5, 600; Celtic 352-3, 357; 
Iranian 276-7; ordinal 90, 352-3, 441, 467, 
505-6, 580, 585, 600; PIE 89-90, 107, 156-9; 
Tocharian 466-7 

6 hUiginn, Ruairi 364, 370 
Oberlies, Thomas 242, 254, 255 
object ellipsis 135 

obstruents 339, 392-4, 412, 510, 529, 568-71 
Oertel, Hanns 249 

Oettinger, Norbert 178, 180, 184, 187, 189 
Oftedal, Magne 396 
Ogham alphabet 361 
OHG sound shift 20 
Olander, Thomas 106, 547 
Olsen, Birgit Anette 429, 430, 431, 437, 441, 444, 
446, 447 
Oman 263 
O’Neil, Wayne 389 
onomatopoeia 106 

optative mood 102, 161, 281, 309, 468, 596-5 

Oranskaja, Tat’jana Iosifovna 255 

ordinal numerals 90, 352-3, 441, 467, 505-6, 

580, 585, 600 

Orel, Vladimir 389, 558, 560, 573, 577, 579, 

597, 603 

orthography 14, 172, 265-6, 288, 294-5, 323-4, 
361,489-92 

Oscan 317, 321, 328, 329, 330, 342, 349; verb 
system 336 
Oshiro, Terumasa 188 



626 SUBJECT-AUTHOR INDEX 


Osthoff’s Law 30 1 
Otfrid 398, 404, 408 
Otten, Heinrich 126, 127 
Ottoman invasion 558 
Olberg, Hermann 560 
Ovid 323 

Oxford Classical Texts series 350 
oxytone 180, 189, 297, 526 

Pakerys, Antanas 496 
Pakistan 3, 214, 263 
Pal Engjelli (Paulus Angelus) 554 
Palaic state 173 
palatal glides 388 

palatalizations, velar 14-15, 21-3, 26, 28, 154, 
206, 226-7, 269, 293, 458-60, 464, 469, 498, 
524, 530-2, 534,571 

palatovelars 15, 20, 21, 23, 26-7, 206, 496, 498 
Pali 255 

Pamphylia 175, 288 
Pandit, Shankar Pandurang 253 
Parker, H.N. 288 
Parry, Milman 290 

participles (verbal adjectives): in the ablative 
absolute 346; absolute constructions with 
165-7; active 180, 238, 247, 281, 446, 511; 
adjectivization of 599; Albanian 586-8, 590-2, 
597, 601-2; aorist 297, 454, 597; Armenian 
445; Baltic 511-12; c-stem formations in 302; 
future 445, 516; future active 342; future passive 
246; Germanic 398-9, 401, 405, 407; Gothic 

410- 12; Greek 302, 305, 309, 310-13; Iranian 
278-9; Latin 341-2, 346, 348; medio-passive 
190, 461; middle 238, 256n24, 281, 446; middle 
future 309; passive 445; passive perfect 245; 
past 184, 190, 279, 283, 398, 405, 445; past 
active 537; past passive 516; perfect 104, 158, 
342; perfect passive 211, 249, 251-2, 339; PIE 
37, 47, 92, 104-5, 138-9, 158; present active 
104, 158, 234, 334, 342, 500, 511, 537; present 
middle 104, 158, 342; present passive 482; 
preterit 461, 468-9; Slavic 544, 546; stative 
245, 247; Tocharian 464, 468-9, 471 

Patagonia 352 
patronymics 3 14 
P awstos Buzandac i 421 
Perez Orozco, Santiago 175 
Pedersen, Holger 8, 194, 361-4, 378, 427, 428, 
448, 577, 578, 579, 580 

periphrastics 184, 310, 339, 342, 358, 396, 404, 

411- 12,595 
Persians 264 

Petit, Daniel 479, 483, 499, 503 
Peyrot, Michael 475 

phonemes 154, 293, 294; Indo-Aryan 223-5; 
Latin 324; plosives 225-6; sonants 225; 
see also diphthongs; velars; vowels 
phonological rules, PIE 54-5 
phonology 8; Albanian 560-73; Armenian 
423-34; Baltic 489-99; Balto-Slavic 479, 

480; Brythonic 364-5; Celtic 361-4; Gaelic 
365-7; Germanic 391-8, 394-6; Greek 
291-7; Hittite 172; of Indo-Aryan languages 


221-9; Indo-Iranian 206; Iranian 264, 266-72, 
457; Latin 325-6; Slavic 522-33; Tocharian 
453, 458-61; see also Proto-Indo-European 
phonology 
phonotactics 52-4 
Pinault, Georges-Jean 475 
Pindar 130-2 
Pirart, Eric 284 
Pisani, Vittore 427, 558 
Pischel, Richard 255 
Plautus 344, 349 
Pleterski, Andrej 521 
Pliny 387 

Paninian grammatical tradition 217 

plosives 225-6 

Pluralprofilierung 403 

poetic language 115; see also prosody 

Pokomy, Julius 356, 378 

Pokomy’s roots 357 

Polome, Edgar C. 357, 389 

Polybius 555 

polyglossia 216 

Popko, Maciej 194 

portmanteau affix 402 

Portugal 5 

Porzig, Walter 558 

postpositions 55, 63, 66, 81, 104-5, 156, 157, 
210, 327, 434-5, 462-3, 499, 514 
Pott, August 479 
Powell, T.G.E. 354,355 
pre-Anatolians 455 
pre-Indo-Iranians 456 

prepositions: ablative 301; Albanian 575, 582-3, 
598, 600; Armenian 447; Baltic 513-14; 
Balto-Slavic 480-1; Celtic 358, 368, 377; 
Germanic 63, 400, 403-4; Greek 301-2, 313; 
Latin 323, 343-4, 348; PIE 105, 111, 139, 
143-4, 146, 156-7; Tocharian 472; see also 
adpositions; postpositions 
Presamnite 341 

preterit: with ablaut 46 1 ; Anatolian 161-2, 

184, 187: Armenian 441; and the aorist 95, 

307; Baltic 480, 509-10, 512; Celtic 359, 

361, 371, 372, 374-6; dental 359; Germanic 
398-400, 404-5, 41 1 - 12; Hittite 91, 185-8, 
192; Lithuanian 503; non-causative 460; Old 
Prussian 482; and the perfect 97; sigmatic 
461; Slavic 542; suffixless 376; synthetic 542; 
Tocharian 454, 460-1, 468-9; see also verbal 
tenses and conjugations 
preterit participles 461, 468-9 
pre-Tocharians 457 
Preussische Chronik 487 
preverbs 191; Balto-Slavic 480-1; Celtic 377 
Prifti, Peter 603 
Probert, Philomen 193, 296 
Prokosch, Eduard 40 1 2,404,413 
Prokosch’s Law 397 
Pronk, Tijmen 547 

pronouns: Albanian 582-4; anaphoric 171, 
181-3, 440; Baltic 506-607; Balto-Slavic 
482; Celtic 370-1; clitic 507, 514, 600-1; 
deictic 182-3, 464-5; demonstrative 85-6, 
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181-3, 276-7, 304, 334-6, 370, 440, 507-8, 
582-3; dual 401; enclitic 181, 182-3; first 
and second person 465-6; Greek 304-5; 
indefinite 88, 183-4, 439-40; interrogative 88, 
439-40, 465, 471, 508, 538, 584; Latin 334-6; 
personal 81-4, 160, 180-1, 277, 304, 334-5, 
370-1, 438-9, 506-7, 537-8, 582-3; PIE 81; 
possessive 84-5, 183, 439, 583-4; pronomial 
stems 171; reciprocal 439; reflexive 84, 439, 
584; relative 88-9, 183-4, 336, 439-40, 

465, 471-2, 584; Slavic 537-9; third person 
464, 507 

Prosdocimi, Aldo L. 350, 368 
prosody 123; Baltic 492-9; Germanic 
396-8 

Prosper, Blanca Maria 359, 360 
proterodynamic ablaut declension types 68-9 
Proto-Indo-European phonology 56, 318; 
accentual paradigms 526-7; allophones 13; 
clusters 52-4; diphthongs 564; fricatives 
29-30; further reading 56-7; laryngeals 
29-30, 43-53; phonemes 13; reconstructed 
system 13; resonants and glides 30-9; rules 
54-5; stops 13-14, 28, 153, 171, 176; three 
velar series in 26-7; Tocharian and 454-7; 
vowels 40-3 

Proto-Indo-European word order: discontinuous 
constituents (hyperbaton/tmesis) 116-20; 
marked order 1 (verb-initial V-l) 127-132; 
marked order 2 (verb- second and verb-third) 
133-7; moods 161; unmarked order 
(verb-final) 133-8; unmarked order (Celtic) 
125; unmarked order (Hittite) 236-7; 
unmarked order (Mycenean Greek) 124; 
unmarked order (Vedic) 125-6 ; verb stems 
161-2; Wackemagel’s Law 120-4 
pro thesis 530 

Ptolemy (Ptolemeu) 354, 555 
Puhvel, Jaan 194 
purpose clauses 346 

Qawrighul culture 456 
quantitative sociolinguistics 8 

Ramat, Paolo 168 
Ramer, Alexis Manaster 437 
Rask, Rasmus 3 
Rasmussen, Jens 176 
Rastorgueva, Vera Sergeevna 283 
Rau, Jeremy 107 
Raybould, Marilynne 355-6 
reduplication: Albanian 456; Anatolian 189; 
aorist 97, 113-14, 132, 189; Armenian 445; 
Germanic 399, 405, 413n3; Greek 305, 310; 
Indo-Aryan 233, 238-40, 242, 248; Iranian 
273, 278-9; Latin 337, 340; Old Irish future 
375-6; Proto-Indo-European 17, 24, 31, 

62, 83-4, 92, 97-8, 104, 187, 539; Slavic 
539, 546 

Reichelt, Hans 282, 283 
Reis, Marga 397 

relative clauses 111, 165-6, 193, 282, 344, 
347-8, 364, 472-3, 558 


Renan, Ernest 353 

Renfrew, Colin 354 

Renou, Louis 242, 253, 254, 284 

Repansek, Luka 376 

Republic of Cyprus 287 

retroflex 225, 227, 229-30, 263, 270, 423 

Rgveda (RV) 4, 31, 105, 113, 114, 118-21, 

123, 128-30, 140-1, 144-5,205 \ see also 
Vedic texts 

rhotacism 29, 165, 325, 327, 334, 389, 552, 

566-7 

rhotacization 397 
Riad, Tomas 388 

Rieken, Elisabeth 178, 180, 182, 188, 192, 194 
Rijksbaron, Albert 3 14 
Ringe, Don 8, 57, 106, 362, 389, 399, 404, 
405,413 

Rix, Helmut 107, 350, 356 
Rizza, Alfredo 192 
Ro Runestone 405 
Roberge, Paul T. 389 
Roberts, Ian 396 

Robinson, Orrin W. 388, 406, 413 

Rodway, Simon 372, 376 

Roma people 3 

Romani 558 

Romania 5 

root constraints 54 

Roques, Mario Luis 554 

Rosen, Hannah 350 

Roth, Rudolf 253, 254 

ruki rule 6, 29, 206, 229, 269, 428, 441-2, 457, 
497-8, 528-30, 537 
Rumsey, Alan 164 
Ruppel, Antonia 138, 313 
Russell, Paul 355, 361, 364, 371, 375, 

376, 377 
Russia 7 

Sadnik, Linda 547 

Salmasius, Claudius (Claude de Saumaise) 3 
Salmons, Joseph C. 154, 168, 389, 392, 393, 

394, 396 
sandhi 229, 230 
Sardinia 321 
Sasanian period 4, 265 
Sasse, Hans- Jurgen 597 
Sassetti, Filippo 3 

satem languages 6, 21, 426-7, 497, 521, 571; 
centum reflexes found in 27; labiovelars 23-5; 
map 28; palatalized velars 21-3 
Saussure, Ferdinand de 493 
Saussure’s Law 493, 515n4 
Schaefer, Christiane 247 
Schallert, Joseph 52 1 
Schenker, Alexander M. 52 1 , 546 
Schindler, Jochem 429 
Schleicher, August 479, 483, 488 
Schmalstieg, William R. 486n3, 487, 504, 
512,515 

Schmidt, Gemot 107 

Schmidt, Karl Horst 355, 356, 357, 358, 360, 368, 
371,375-6,375 
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Schmitt, Rudiger 115, 211, 263, 264, 266, 273, 
283, 284, 422, 425, 440, 444, 447 
Schmitt-Brandt, Robert 603 
Schrijver, Peter 354, 355, 357, 358, 362, 363, 

370, 372 

Schulte, Michael 387 

Schumacher, Stefan 357, 372, 375, 376, 377, 560, 
567, 571, 573, 577, 578, 588, 595, 596, 597, 
599, 603 

Schumann, Roland 107 
Schwebeablaut 62 
Scotland 4, 352-3, 356 
Second Consonant Shift 393 
Second Sound Shift 394 
SEELRC Network 547 
segmental phonolog problems 57 
Seldeslachts, Herman 435 
Selishchev, Afanansij M. 557 
semivowels 14, 33, 292-3, 424, 429 
Serbat, Guy 143 
Serbia 552 

Shatskov, Andrej 189 

shibilants 423, 433 

Shuteriqi, Dhimiter S. 554 

sibilants 177, 228-9, 427-8; Indo-Aryan 217; 

Indo-European 269; Indo-Iranian 206 
Sicilia 321 

SidePtsev, Andrej V. 192 
Sieg, Emil 474 
Sievers’ Law 54, 396 

Sihler, Andrew L. 57, 106, 306, 350, 391, 535 
Silk Road 5 

Sims- Williams, Patrick 353, 354, 355-6, 358, 
359, 360, 361, 363, 364, 365, 366, 367, 368, 
370, 372, 373, 375, 376, 377 
Sinhala instrumental case 210, 249 
Sintashta archaeological culture 205 
Skjasrvo, P. O. 283 
Slovenia 5 

Smith, Laura Catharine 398 
Smitherman, Thomas 145 
Smoczynski, Wojciech 515 
Smyth, H. W. 300 
Snoj, Marko 521, 538, 543, 547 
sociolinguistics, quantitative 8 
Solta, Georg Renatus 423, 447 
Somers Wicka, Katerina 398 
sdtera rule 297 
Soucek, Vladimir 126, 127 
sound changes 20, 388, 392-4, 396, 402, 412, 
426, 559 

South America 2 1 4 

South Asia 214 

South Montenegro 5 

South-East Asia 214 

Southworth, Franklin C. 205, 211, 255 

Spain 4, 321, 359 

Speijer (Speyer), Jacob S. 254 

spirantization 364, 365, 367, 392 

split-ergative alignment 162-3 

Sprachbund 483 

Sri Lanka 214 

Stalmaszczyk, Piotr 357 


Stang, Christian S. 479, 481, 482, 483, 486n2, 
497, 499, 509, 525, 547 
Stang’s Law 70, 526 
Starke, Frank 178, 180, 191 
Steams, McDonald, Jr. 388 
Steele, P.M. 314 
stele of Xanthos 174 
Stempel, Reinhard 107, 446-7 
Stemefeld, Wolfgang 1 32 
Stifter, David 358, 359, 360, 362, 371, 377 
Stockwell, Robert P. 405, 407 
stops 176-7, 426-7, 498-9; aspirated 14-15; 
dental 498, 499; fortis 396; geminate 388; 
glottalized 18, 154, 167,426; palatalized 498; 
PIE 14-29; Proto-Anatolian 176; velar 14, 

177, 325, 391, 480, 496, 498, 533; voiced 
14-18, 20-1, 54, 153-4, 176-7, 291, 327, 

392, 481, 496, 516n7; voiced aspirated 
stops 14-15, 18-21, 153-4, 176, 206, 222, 
264, 268, 291, 326, 392, 426-7, 432, 454, 

528; voiceless 14-16, 20, 26, 54, 1 53—4, 

171, 176-7, 225, 268, 359, 392-3, 396, 

431-2, 454 
Strand, Richard F. 205 
Stressed Syllable Law 397 
Stuart-Smith, Jane 326 
Stiiber, Karin 369, 370, 377 
Studi Etruschi 350 
Sturtevant, Edgar H. 194, 314 
subjunctive mood 100-1, 161,338,411,468; in 
dependent clauses 473; Greek 309, 3 1 1 
subjunctive stem 375 

subordinate clauses 193; Albanian 60 1 ; Anatolian 
193; Celtic 397; Germanic 405-6, 408; Greek 
305, 311, 312; Indo-Aryan 223; Italic/Latin 
343-6, 348; PIE 105, 139, 166; Slavic 546; 
Tocharian 468 

superlatives: Albanian 580- 1; Armenian 438; 
Baltic 514; Celtic 357, 370; Greek 303-4; 
Iranian 276; Italic 334; Italo-Celtic 319; 

PIE 80 
Surinam 214 
Svane, Gunnar 557 
Svarlien, Diane Amson 130, 132 
Sweden 389 

Syllable Contact Law 397 
syllable structure, PIE 52-4 
syncope patterns 371-2, 425 
syncretisms 138, 143-5, 158-9, 163, 302, 367, 
369, 399-400, 434-5 
syntax: adverbial clauses 346-7; Albanian 

599-602; Armenian 447; Baltic 512-15; Celtic 
377; Classical Armenian 447; complementation 
345-6; configurational 191-3; constituent 
order 343-4; Germanic 405-12; Greek 
311-14; Indo-Aryan 216; Iranian 282-3; 

Latin 343-9; morphosyntax 191; passive and 
ergative constmctions 249; relative clauses 
347-8; sequence of tenses 348; Slavic 543-6; 
subordinate clauses 193, 344-5; subordinating 
structures 348-9 
synthetic passives 389 
Syria 263,421 
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Szemerenyi, Oswald J.L. 56, 106, 107, 154, 156 
Szemerenyi’s Law 70, 76 

Tacitus 323, 387 
Tajikistan 263 
Talmy, Leonard 158 
Tarim Basin 452, 453, 456, 474 
Teeter’s Law 112 
temporal clauses 346 
Temes, Elmar 352, 361, 378 
Thesaurus Indogermanischer Text und 
Sprachmaterialien (TITUS) 253 
Thiersch, Craig L. 406 
Thomai, Jani 603 
Thomas, Werner 474-5 
Thomason, Sarah G. 165 
Thompson, R. J. E. 302 
thorn-clusters 53 

Thumeysen, Rudolf 367, 374, 376, 378 

Tibet 452 

Tichy, Eva 56 

Tischler, Johann 193, 194 

tmesis 111, 116-20, 157, 290,313, 377 

Topalli, Kolec 560, 577, 597, 599 

toponyms 524, 533, 556 

Toporov, Vladimir Nikolaevic 483, 486 

Tovar, Antonio 355, 358, 368 

Trautmann, Reinhold 479 

Tribulato, Olga 314 

triple representation 430 

Trips, Carola 403, 407 

Trubacev, Oleg Nikolaevic 522, 547 

Trudgill, Peter 389 

Turkey 3, 5, 263, 287, 421 

Turkmenistan 263 

Uhlenbeck, Christian 1 63 
Uhlich, Jurgen 359, 376 
Ukraine 7, 552 
Ulfilas (Wulfila) 387 
Umbria 317 

umlaut 387, 395-6, 402, 411, 424, 458, 460-1, 
564-6, 571, 573, 577, 579, 594 
unergatives 144 
United Kingdom 214 
United States 214, 421 
Untermann, Jurgen 350 
Urartians 42 1 
Ussishkin, Adam 398 
Uyghurs 474 
Uzbekistan 263, 452 

Vaan, Michiel de 283, 350, 560 
Vaillant, Andre 163, 496, 547 
Vata§escu, Catalina 557 
Vedas 4, 217; see also Rgveda 
Vedic texts 218-21, 252-3, 526; see also 
Rg-Veda 

velar palatalization 14-15, 21-3, 26, 28, 154, 
206, 226-7, 269, 293, 458-60, 464, 469, 498, 
524, 530-2, 534,571 
velar stops 14, 177, 325, 391, 480, 496, 

498, 533 


velars 226; geminate 396; PIE 269; see also 
labiovelars; palatovelars; velar palatalization; 
velar stops 

Velupillai, Viveka 168 
Venantius Fortunatus 353 
Vendryes, Joseph 370, 371, 377 
Vendryes’ Restriction 377 
Venetian district 558 
Vennemann, Theo 389, 397, 398, 434 
Ventris, Michael 291, 314 
verbal aspect: Albanian 585-6; Anatolian 184-5, 
189; Armenian 441; Baltic 512; Celtic 371, 

377; Germanic 405, 410-12, 414; Greek 305, 
309-12; perfective 95; PIE 92, 95, 97, 107n8, 
146, 161-2, 168; Slavic 533, 541-2; stative 79, 
92, 94, 97-9, 144, 164, 246-7, 256-7, 305, 
309-10, 370, 374, 442-5, 509, 541; Tocharian 
468; see also aorist 

verbal forms and inflections: non-finite 104, 

190, 341-2, 445; stem formation 189-90, 
444-5; verb stems 161-2; see also infinitives; 
participles (verbal adjectives) 
verbal moods: conditional 238, 242, 372, 375, 

508, 511; desiderative 94, 161, 245, 269-70, 
340, 376, 508; injunctive 91, 97, 107, 113, 

114, 130, 207, 238-9, 241-2, 246, 277, 
279-80, 282, 428, 441-2; optative 102, 

161, 281, 309, 468, 596-7; subjunctive 
100-1, 161,280-1,309,311,338,411,468, 
473; see also imperative mood; indicative 
mood 

verbal positions: Germanic 405-7; marked 
order 1 (verb-initial) 127-33; marked order 
2 (verb-second/verb-third) 133-7; unmarked 
order (verb-final) 124-7 

verbal tenses and conjugations: Albanian 585-97; 
Anatolian 184-90; Armenian 441-5; Baltic 
486-7, 508-12; Celtic 371-6; Germanic 
410-12; Greek 305-10; Indo- Aryan 238-47; 
Indo-Iranian 207; Iranian 277-81; Latin 

336- 43; old Indian 240-2; Oscan 336; 
Proto-Anatolian 184-90; Proto-Indo-European 
91-7, 99-104, 138-9; Slavic 539-42; 
Tocharian 454, 467-9; see also ablaut; aorist; 
preterit 

verbal voices: active 185-8, 207, 337-8, 467-8; 
mediopassive (middle voice) 99-100, 188, 207, 
209, 308, 338, 372, 389, 444, 467-8; passive 

162, 245-7, 339, 373-4; proterokinetic 
68, 190 

verbs: athematic roots 239, 306; auxiliary 162, 

184, 283, 546, 585-7, 592; 94, 180, 189, 

337- 8, 340, 350, 374, 376, 399, 404, 443-6, 
464, 468, 470; deponent 99, 308, 326, 338-9, 
371-2, 374, 376, 591; ditransitive 144; 
injunctives 91, 97, 107, 108nl2, 113, 114, 130, 
207, 238-9, 241-2, 246, 277, 279-80, 282, 
428, 441-2, 594; intransitive 144-5, 162-4, 
167, 190, 248, 256n24, 283, 307-8, 404, 
467-8, 472, 483-4, 510, 539, 588, 592; labile 
144-5, 209, 248; periphrastic 184, 310, 339, 
342, 358, 396, 404, 411-12, 595; transitive 65, 
144-5, 162-4, 167, 190, 240, 247-8, 256-7, 
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281, 287, 307-8, 447, 468, 510, 587-8, 600; 
unaccusative 144, 171 
Vergil 323 

Verkerk, Annemarie 158 
Vermeer, Willem 528, 529, 531, 535, 560 
Vemer, Karl 393 
Vemer’s Law 20, 393, 396 
Verscharfung 388 

Villar, Francisco 164, 360, 364, 368, 372 

Vine, Brent 350 

Vishva Bandhu 255 

Viredaz, Remy 434 

Viret, Jeremie 360 

vocalism: Brythonic 364-5; Celtic 362; 

Gaelic 366 

vocative case: Anatolian 178, 183; Armenian 
434; Baltic 499, 502; Celtic 368-9; 

Germanic 400, 402; Indo- Aryan 223, 233; 
Indo-Iranian 207; Italic/Latin 330-1, 333; PIE 
62, 80-1, 105-6, 158-9; Slavic 535; Tocharian 
462-3 

voiced aspirated stops 14-15, 18-21, 176, 206, 
222, 264, 268, 291, 326, 392, 426-7, 432, 

454, 528 

voiced stops 15-18, 20-1, 54, 153-4, 176-7, 
291, 327, 392, 481, 496, 516n7 
voiceless fricatives 15, 264, 326, 392-3 
voiceless stops 15-16, 20, 26, 54, 154, 171, 
176-7, 225, 268, 359, 392-3, 396, 431-2, 454 
Vorlage: Greek 407; Latin 406, 409 
vowels: Albanian 561-2; anaptyctic 266; 
Anatolian 176; Armenian 423-6; auslaut 
565; Baltic 489-92; Celtic 362, 364, 366-7; 
composition 377; contraction of 425-6; 
doubling 324, 328; dummy 291; East Baltic 
490; epenthetic 53, 266; Germanic 394-6; 
Greek 293-6; Indo- Aryan 221-5; Indo-Iranian 
206-7; Iranian 266-8, 272; Italic/Latin 
325-6, 328; Latvian 491-2; lengthening 154, 
324, 328, 397; Lithuanian 490-1; pre-tonic 
364; prothetic 34, 48, 52, 53, 292, 293, 428, 
429, 430, 432, 440; Proto- Anatolian 176; 
proto-Germanic 395; Proto-Indo-European 
39-43, 154-5, 455; Slavic 522-5; Tocharian 
455; weakening of 425; see also diphthongs; 
semivowels 
Voyles, Joseph 56 

Wachter, Rudolf 350 

Wackemagel, Jacob 111, 119, 120, 146,249, 

254,405 

Wackemagel clitics and particles 116-17, 123 
Wackemagel’s Law 120-3, 146, 192, 311, 371, 
373, 377, 543; in Baltic 514-15 
Wagner, Heinrich 356 
Wales 4 

Wallace, Rex 350 


Watkins, Calvert 107, 111, 113, 119, 120, 165, 
192, 194, 247, 356, 357, 358, 370, 372, 373, 
375, 376, 377 
Weerman, Fred 407 
Weight Law 397 
Weise, Oskar 397 
WeiB, Helmut 396, 397 
Weiss, Michael 350, 357, 375 
Weitenberg, Joseph J. S. 180, 192, 435 
Werba, Chlodwig H. 255 
West, Martin L. 282, 284 
West Macedonia 5 
Whaley, Lindsay J. 168 
Whitney, William Dwight 242, 253, 254, 255 
Wiesinger, Peter 396 
Willis, David W. E. 377 
Windluhr, Gemot 282, 283 
Winter, Werner 428, 430, 452, 462, 481, 527 
Winter’s Law 18, 154, 481, 486n6, 489, 506; in 
Baltic 516nl3; in Slavic 527-8 
Witczak, Krzysztof T. 357 
Witzel, Michael 253 
Woodard, Roger 194 

word formation: Albanian 597-9; Armenian 
445-7; Celtic 376-7; Germanic 403-5; 
Greek 313-14; Iranian 281-2; Italic 
349-50; Latin 349-50; Slavic 542-6; 
Tocharian 470-1 

word order 164-5, 191-3, 311; see also 
Proto-Indo-European word order 
Wujek, Jakub 488 
Wulfila (UMlas) 387 

Xhuvani, Aleksander 597 
Xinjiang province 5, 452 

Yakubovich, Ilya 172, 173, 188 
Ylli, Xhelal 557 

Yoshida, Kazuhiko 186, 187, 188 
Young, Steven 495 
Young, Thomas 3 

Zadar 552 

Zair, Nicholas 357, 362 
Zaliznjak, Andrej Anatol’evic 254, 544, 547 
Zarathustra 4, 457 
Zeilfelder, Susanne 1 1 6 
Zemaitija 492-3 
Zemgale 494 
Zeps, Valdis 487 
Ziegler, Sabine 361 
Zimmer, Stefan 192, 377 
Zinkevicius, Zigmas 502 
Zirmunskij [Schirmunski], Viktor 
Maksimovic 388 

Zograf, Georgij Aleksandrovic 252, 255 
Zoroastrian scriptures 264 



